Skip to main content
Royal Society Open Science logoLink to Royal Society Open Science
. 2017 Aug 23;4(8):170192. doi: 10.1098/rsos.170192

Chemical element transport in stellar evolution models

Maurizio Salaris 1,, Santi Cassisi 2,3
PMCID: PMC5579087  PMID: 28878972

Abstract

Stellar evolution computations provide the foundation of several methods applied to study the evolutionary properties of stars and stellar populations, both Galactic and extragalactic. The accuracy of the results obtained with these techniques is linked to the accuracy of the stellar models, and in this context the correct treatment of the transport of chemical elements is crucial. Unfortunately, in many respects calculations of the evolution of the chemical abundance profiles in stars are still affected by sometimes sizable uncertainties. Here, we review the various mechanisms of element transport included in the current generation of stellar evolution calculations, how they are implemented, the free parameters and uncertainties involved, the impact on the models and the observational constraints.

Keywords: stellar physics, mixing, convection, diffusion, rotation

1. Introduction

Almost a century ago Eddington wrote [1]:

‘It is reasonable to hope that in a not too distant future we shall be competent to understand so simple a thing as a star’.

During this time, the theory of stellar evolution has been developed and established, and its main predictions confirmed by a large number of empirical tests that have involved photometric, spectroscopic and asteroseismic observations (and solar neutrino flux measurements, after the discovery of neutrino oscillations). Results obtained from stellar model computations are nowadays widely used to develop a vast array of techniques to estimate distances, ages, star formation histories and chemical evolution of star clusters and galaxies, both resolved and unresolved [2].

Obtaining this kind of information from techniques rooted in stellar evolution calculations is a crucial step to address problems like understanding the mechanisms that drive the formation and evolution of galaxies. The accuracy of results gathered from stellar population analyses is tied to the accuracy of the current generation of stellar models; in this respect, one particularly thorny issue is how to treat the transport of chemical elements in stellar evolution calculations. The problem is that mixing and element transport do not arise from the solution of the equations of stellar structure and evolution, instead they have to be ‘added’ following recipes that often—as we will see—involve a number of free parameters and/or are subject to sizable uncertainties.

On the other hand, the temporal evolution of the chemical abundance profiles within stellar models is a main evolutionary driver, and can be in principle tested through spectroscopic observations of photospheric abundances of key elements, asteroseismic observations (study of non-radial pulsations of stars, which can test thermodynamical properties of stellar interiors that are also affected by the chemical composition), and more indirectly through the effect on star counts (sensitive to evolutionary timescales) and evolutionary paths in colour-magnitude-diagrams (CMDs) or Hertzsprung–Russell diagrams (HRDs), which are all affected by the internal chemical profiles.

The aim of this review is to discuss the various mechanisms of chemical element transport included in the current generation of stellar models, their effect on the evolutionary properties of the models, and the various prescriptions found in the literature, that often produce very different results. This will allow the reader to appreciate the main uncertainties involved, and what properties of stellar models are affected the most.

We start in §2 with a brief summary of the equations of stellar structure and evolution, and an overview of the main evolutionary properties of models of different initial masses, to set the stage for the discussion that follows. The next sections discuss the implementation in stellar modelling, the associated uncertanties and the observational constraints of processes like convection (§3), semiconvection (§4), thermohaline mixing (§5), atomic diffusion (§6), phase separation upon crystallization (§7) and rotationally induced element transport (§8). Section 9 discusses briefly an example of how a combination of several of the mechanisms discussed in the previous sections can explain the puzzling trend of photospheric Li abundances with effective temperature in open clusters, and conclusions follow in §10.

2. Stellar model computation

With stellar modelling we mean here the calculation of the run of physical (i.e. luminosity, temperature, density and specific heats) and chemical quantities, from the centre to the photosphere of a star of a given initial mass and chemical composition, and their evolution with time. Despite tremendous advances in computing power and computational techniques during the last decades, a full detailed modelling of a star by solving the equations of radiation hydrodynamics in three dimensions (3D) is still unfeasible, and this will be the case for the foreseeable future. This inability to model stars with multidimensional radiation hydrodynamics is a consequence of the extreme range of spatial and temporal scales1 that need to be resolved when calculating full evolutionary models covering all stages from the pre-main sequence (pre-MS) to the last white dwarf (WD) or pre-supernova phases. Current hydrodynamics computations are, however, starting to be able to provide some guidelines about mixing processes [3], as highlighted in the following sections.

For these reasons, complete stellar evolution computations still have to rely basically on the following ‘classical’ set of one-dimensional (1D) equations for spherical, non-rotating and non-magnetic stars (equation of continuity of mass, hydrostatic equilibrium,2 energy generation and energy transport, respectively) and Raphson–Newton solution methods [4]:

rm=14πr2ρ, 2.1
Pm=Gm4πr4, 2.2
Lm=ϵnϵν+ϵg 2.3
andTm=GmT4πr4P, 2.4

where the independent variable m is the mass enclosed within radius r, and T, L, P and ρ are, respectively, the temperature, luminosity, pressure and density at the layer specified by the value of m. The coefficient ϵν denotes the energy per unit time and unit mass carried away by neutrinos (that do not interact with the stellar gas), ϵn is the energy per unit time and unit mass produced by nuclear reactions. For a generic nuclear reaction nAA+nbbproducts involving elements A and b with mass fractions XA and Xb and atomic weights AA and Ab

ϵn=RAbQAb,

where QAb is the amount of energy released by a single reaction and RAb is the number of reactions per unit mass and unit time, given by

RAb=ρnA+nb1XAnAXbnbAAnAAbnbσvAbmHnA+nbnA!nb!, 2.5

where 〈σvAb is the reaction cross section.3

The coefficient ϵg represents the so-called gravitational energy produced per unit time and unit mass, and is given by

ϵg=(Uv)T,μvt+(UT)v,μTt+(Uμ)T,vμt+Pvt, 2.6

where U is the internal energy per unit mass, v=1/ρ is the specific volume and μ is the mean molecular weight of the stellar matter. The term (∂U/∂μ)T,v(∂μ/∂t) arises from the variation of U at constant temperature and volume due to the change of chemical abundances. Its contribution to the energy budget is negligible when nuclear reactions are efficient, but is very important in the case of WDs, where nuclear burnings are inefficient. When integrated over the whole stellar structure, ϵg is equal to the time variation of the internal energy plus the gravitational potential energy of the star [2,5].

For the case of radiative plus electron conduction energy transport, the gradient (dln(T)/dln(P)) is set to ∇rad

rad=316πacGκLPmT4, 2.7

where a is the radiation density constant, c is the speed of light, G is the gravitational constant and κ is the Rosseland opacity, including also the contribution of electron conduction when appropriate. In the case of convective energy transport, a theory of convection is needed to calculate the appropriate temperature gradient ∇conv (see §3).

These equations are complemented by a set of I equations (s=1,…,I) for the change of the mass fraction of the I chemical elements considered at the layer specified by m. Consider first the changes due just to nuclear reactions; an element s is produced by w reactions of the type

nhh+nkknps

and destroyed by l reactions

nds+njjnzz.

This provides the following equation for the variation of the abundance of s4

Xst=wρnh+nk1npXhnhXknkAsAhnhAknkσvhkmHnh+nk1nh!nk!lρnd+nj1ndXsndXjnjAsAsndAjnjσvsjmHnd+nj1nd!nj!. 2.8

The complete system of equations (equations (2.1)–(2.4) and I-times equation (2.7)) is solved at a given time t considering the ‘Lagrangian’ independent variable m, with r,L,P,T as unknowns, once the stellar mass and initial chemical composition are specified, and prescriptions for the equation of state of the stellar gas, Rosseland mean opacities, nuclear reaction cross section and energy generation rates, and neutrino production rates are provided. The chemical composition is usually denoted by X, Y , Z, that correspond to the mass fractions of H, He and all other elements collectively called ‘metals’, respectively. A relative distribution of the metal abundances also needs to be specified.

Notice that the chemical abundance profile enters explicitly equation (2.7) and ϵn, and affects the coefficients ϵν, ϵg, the opacities κ and the equation of state, which all depend on the chemical composition of the stellar matter.

Figure 1 shows an overview of the evolutionary paths of single stars with different initial masses (plus the main by-products of interacting binary evolution), as derived from complete stellar evolution models. This general evolutionary framework is solid and does not depend on the details of element transport modelling although the precise values of the various mass ranges do (and they depend also on the initial chemical composition). It constitutes a reference guideline for the discussions presented in the following sections.

Figure 1.

Figure 1.

General evolutionary paths of single stars with different initial masses. The exact values of the mass ranges depend on the initial chemical composition and the details of the adopted element transport mechanisms. We show also some final product of the evolution of interacting binaries (Novae, Type Ia supernovae). He-core WDs from single stars can form only on timescales much longer than a Hubble time, but are produced nowadays by interacting binary systems. There are observational indications that massive stars with initial mass above approximately 20M might not explode as supernovae, rather collapse directly to a black hole [6].

It is clear that chemical element transport does not arise naturally from the equations of stellar structure and evolution. This makes it necessary to first identify all possible mechanisms—in addition to the nuclear reactions—able to change the chemical abundances at a given mass layer, and then to develop formalisms for their implementation in the 1D equations (a major difficulty). The element transport mechanisms described below will add extra terms to the right-hand side of equation (2.7), in addition to the terms describing the effect of nuclear reactions.

3. Convection

Besides radiation and electron conduction, convection is the third fundamental mechanism for energy transport in stars. It involves organized large-scale motions of matter that in addition to carrying energy are also a very efficient source of mixing. It can be envisaged as a flux of matter from deeper—hence hotter—stellar layers moving vertically outwards into cooler layers, and material from cooler outer layers flowing down to hotter inner layers.

The implementation of this element (and energy) transport in stellar models requires first a criterion for the onset of the convective instability, and then a mathematical treatment to predict the main physical properties of convective regions.

3.1. Instabilities in non-rotating stars

Matter inside the stars is never at rest, but usually the gas is locally subject to small random perturbations around equilibrium positions. Under certain conditions, these small random perturbations can trigger large-scale motions that involve sizable fractions of the total stellar mass. These large-scale motions are called convection, and are the equivalent to the motion of water elements in a kettle heated from below.

The treatment of convection in stellar interiors is extremely complicated and requires the introduction of various approximations. This stems from the fact that the flow of gas in a stellar convective region is highly turbulent, forcing us to adopt simplified models that provide only mean approximate values for the properties of the flow of gas. The so-called mixing length theory (MLT) is the local, time-independent convection model almost universally used in stellar evolution calculations, which we will discuss below. Firstly, we show how a simplified linear analysis is sufficient to determine the main criteria for the onset of mixing (not only convective) in stellar interiors [7].

We consider a gas element at rest at a distance r from the star centre. This gas bubble will have a pressure P0, temperature T0, density ρ0 and mean molecular weight μ0 (the molecular weight is the mean mass of the gas particles in atomic mass units) equal to those of the environment, supposed to be in radiative equilibrium (in this section ‘radiative’ actually means ‘radiative plus conductive’) as shown in figure 2. If random motions displace the bubble by a small amount Δr away from the equilibrium position, the equation of motion for an element of unit volume can be written as (assuming the viscosity is negligible)

ρd2Δrdt2=gΔρ,

where Δρ is the difference (ρbubbleρsurr) between the bubble (supposed to have constant density) and the surroundings, and g is the local acceleration of gravity. One reasonable assumption is that the motion of the bubble is fast enough that all time derivatives of the mean stellar properties are equal to zero.

Figure 2.

Figure 2.

(a) Set-up of the simple analysis for the onset of instabilities (see text for details). (b) Stable situation. A blob of gas displaced vertically by a small amount oscillates around its equilibrium position. (c) Unstable situation. After the initial displacement, the blob of gas continues to rise as time goes on.

As a consequence of the displacement from its equilibrium position, two distinct physical situations can occur, as shown in figure 2. If the region is convectively stable, the displaced gas parcel experiences a restoring force that moves it back towards the original position, as illustrated in figure 2b. Indeed, the blob is subject to a stable oscillation around an equilibrium position with a frequency named the Brunt–Väisälä frequency (see below). If the bubble of material at position rr is less dense than the surrounding material, it will continue to be pushed upwards by buoyancy forces and the region is then said to be convectively unstable, as shown in figure 2c. Similarly, if the bubble displaced at a radial distance r−Δr is denser than the surrounding material, its motion will continue because it is heavier than the local environment.

To determine which solution applies to a given layer within a star, we assume that along the displacement Δr the bubble is always in pressure equilibrium with the surroundings, that is, ΔP=(PbubblePsurr)=0, and that the molecular weight of the bubble μbubble is always equal to its initial value μ0 (there is no matter exchange with the surroundings, hence the gas parcel retains its identity). The assumption of pressure equilibrium means that the motion of the bubble has to happen with a speed lower than the local sound speed. When there is a molecular weight gradient dμ/dρ throughout the region, the difference Δμ=(μbubbleμsurr) will be equal to Δμ=μ0−[μ0+(dμ/drr], hence Δμ=−(dμ/drr.

Using the relationships dln(P)=(1/P)dP and dln(μ)=(1/μ)dμ, one gets

Δμ=μ(dln(μ)dln(P))(dln(P)dr)Δr.

Differentiating with respect to time, one obtains

dΔμdt=μdln(μ)dln(P)dln(P)drdΔrdt.

The temperature difference ΔT=(TbubbleTsurr) depends on the difference between the temperature gradients in the bubble and in the surroundings and the rate of temperature change due to energy losses from the bubble (due, for example, to thermal diffusion) whose efficiency will depend on a parameter we denote as ζ, hence

ΔT=[(dTdr)ad(dTdr)rad]ΔrζΔTdt.

By introducing the notation (dln(T)/dln(P)) and differentiating with respect to time, one obtains

dΔTdt=Tdln(P)dr(adrad)dΔrdtζΔT.

If ΔP=0, and in the assumption that the differences ΔT, Δρ and Δμ are small, we obtain from the equation of state

χρΔρρ+χTΔTT+χμΔμμ=0,

where

χρ=(dln(P)dln(ρ))T,μ,χT=(dln(P)dln(T))ρ,μandχμ=(dln(P)dln(μ))ρ,T.

We have derived in this way the following set of four homogeneous equations for the four unknowns ΔT, Δρ, Δμ and Δr:

ρd2Δrdt2+gΔρ=0, 3.1
dΔμdt+μdln(μ)dln(P)dln(P)drdΔrdt=0, 3.2
dΔTdt+Tdln(P)dr(radad)dΔrdt+ζΔT=0 3.3
andχρΔρρ+χTΔTT+χμΔμμ=0. 3.4

One can search for solutions of the form Δx=Aent. By inserting into the respective equations this functional dependence for ΔT, Δρ, Δμ and Δr, a non-trivial solution is found when the determinant derived from the coefficients of AT, Aρ, Aμ and Ar is equal to zero, giving

n3+ζn2+[gχTχρdln(P)dr(radad+χμχTμ)]n+(ζgχμχρdln(P)drμ)=0, 3.5

where we have defined μdln(μ)/dln(P).

The condition that at least one of the n is real and positive or imaginary with a positive real part (unstable solutions) is given by the Hurwitz criterion, resulting in at least one of the following conditions to be satisfied (we recall that the pressure P always increases towards the star centre, χμ is negative, χT and χρ positive):

Δμ<0, 3.6
rad>adχμχTμL 3.7
andrad>ad. 3.8

If Δμ<0 (μ increasing towards the surface), the medium is always unstable. When the gas parcel is displaced upwards (downwards) by a small distance Δr, its density will be lower (higher) than the environment, and will continue to be pushed upwards (downwards) by buoyancy. The temperature difference between the displaced mass element and its surroundings suppresses or favours this displacement, depending upon the difference between ∇rad and ∇ad (more below). For the more common case of Δμ≥0 (heavier elements are usually synthesized by nuclear reactions in the central regions), we should consider equation (3.7)—the so-called ‘Ledoux criterion’—and equation (3.8)—the so-called ‘Schwarzschild criterion’. The second term on the right-hand side of equation (3.7) is positive for a positive ∇μ (the composition gradient has a stabilizing effect) so that if the gradients satisfy the Ledoux criterion, the Schwarzschild criterion is automatically satisfied, hence the Schwarzschild criterion determines the presence of an unstable medium.

These instability criteria are ‘local’, in the sense that they can be applied layer-by-layer without accounting for non-local effects that can be, however, relevant when dealing with convective mixing.

Figure 3 displays a qualitative sketch in the ∇μ−(∇rad−∇ad) diagram of the region where instabilities occur, divided into four quadrants. The only stable region in this diagram is the bottom left quadrant, where ∇μ≥0 and ∇rad<∇ad. All other quadrants are unstable regions, although the type of mixing involved depends on the exact values of the gradients. If ∇μ≥0 and ∇ad<∇rad<∇L (lower right quadrant), the instability is called ‘semiconvection’.

Figure 3.

Figure 3.

Sketch of the ∇μ−(∇rad−∇ad) stability plane with different regimes labelled (see text for details). The diagonal line dividing into half the top left and bottom right diagrams denotes ∇L.

The linear analysis in the semiconvective regime shows that if the gas bubble is displaced upwards and ζ=0, its internal temperature will be slightly larger compared to the surroundings, hence its lower density will favour a continuation of the displacement. On the other hand, μ of the raising bubble will be larger than the environment and in the semiconvective regime of the gradients this effect will prevail, causing the return of the mass element to its starting position. When ζ>0, the temperature of the element returning from above is smaller than its initial value at r, therefore the restoring force is larger upon returning than when the bubble was leaving its starting position. As a consequence, the bubble returns back to a radial location r with a larger velocity than when it left for the upwards motion. On the downward excursion, the same effect (with an opposite sign) will operate and the amplitude of these oscillations will progressively increase due to the increasing radiative losses during the oscillatory motion.

The growth of these oscillations (overstability) will lead to chemical mixing and thus decrease or destroy the stabilizing gradient ∇μ. Results from numerical simulations and laboratory experiments show a more complex picture [8,9], with in some cases, the formation of well-mixed fully convective layers, separated by stratified interfaces, but the linear analysis suffices to highlight the peculiarity of the mixing associated with semiconvection.

The efficiency (timescale) of semiconvective mixing is difficult to estimate and depends on the efficiency of the bubble energy losses; an analysis of the growth rate in the framework of the linear analysis shows that as a result of semiconvective mixing ∇rad∼∇ad [10], and this is another approach to treat semiconvective mixing in stellar evolution calculations, i.e. to impose a mixing efficiency such that ∇rad∼∇ad (actually this was the original way in which semiconvection was implemented [11]) in a semiconvective region (see §4).

If ∇μ<0 and ∇rad<∇L (top left quadrant), the instability is usually called a ‘thermohaline’ instability. If ζ=0, a gas bubble displaced downwards will have a larger μ than the environment, but also a larger temperature than the radiative environment. The combined effect is that the bubble density is lower than that of the environment, and buoyancy will push back the displaced gas element. However, if ζ>0, the energy loss will eventually decrease the bubble temperature enough to induce a further displacement downwards due to the effect of the larger μ.

This instability is controlled by the heat leakage of the displaced element, and it is observed, for example, when a layer of hot salt water lies over a layer of cold fresh water. Upon displacing downwards a blob of hot salt water, due to the fact that heat transfer by molecular collisions is typically faster than the motion of chlorine and sodium ions that cause the composition to equilibrate, the sinking blob will be able to come into thermal equilibrium with the surrounding medium faster than achieving composition equilibrium. Given that salt water is heavier than fresh water at the same temperature, the blob will continue to sink in the surrounding fresh water. As this motion continues, the medium develops ‘fingers’ of salt water reaching down into the fresh water. Inside a star, the role of salt can be played by a heavier element like helium, in a hydrogen-rich medium. This is an example of so-called ‘doubly diffusive instabilities’, because it involves the diffusion of two different components (particles and heat).

In general, a convectively unstable region mixes matter on very short timescales compared with evolutionary timescales (either nuclear or Kelvin–Helmholtz timescales), and the chemical profile in convective layers can always be assumed uniform to a very good approximation (instantaneous mixing approximation), with abundances of individual elements equal to values averaged over the whole convective region. If the convective region extends from mass layer m1 (inner boundary) to mass layer m2 (outer boundary) within the star, inside this region the abundance Xs of a generic element s is constant. At the boundaries (one or both of them), one may have a discontinuity between the homogeneous convective chemical profile and the profile in the radiative regions, for example, due to nuclear reactions (previous and/or present). Owing to these effects and just to give an example, the time evolution of Xs (in the approximation of instantaneous mixing) within an expanding convective shell is to a first order given by

dXsdt=1Δm[m1m2dXsdtdm+dm2dt(Xs2Xs)dm1dt(Xs1Xs)], 3.9

where Δm=m2m1, and Xs1, Xs2 are the abundances on the radiative side of the discontinuities at, respectively, the inner and outer boundary of the convective region. The first term in the integral describes the variation due to the nuclear burnings (if efficient), whereas the other two terms describe the change in composition when the boundaries of the convective zone move into surrounding regions of—in principle—inhomogeneous composition.

Exceptions to the validity of the instantaneous mixing in convective regions are the advanced evolutionary phases of massive stars about to explode as Type II supernovae (figure 1), the production of Li due to the Cameron–Fowler mechanism in the envelopes of red giant branch (RGB) stars [12] and proton ingestion episodes into the intershell convection zone of low-metallicity asymptotic giant branch (AGB) stars [13,14]. In this case, the nuclear-burning timescales in the convective regions are comparable to convective mixing timescales. This case is usually treated with a time-dependent convective mixing discussed in the next subsection.

Before closing this section, we just introduce a quantity that will appear often in the rest of the paper. In the case of ζ=0, gas elements in a stable region will oscillate around their equilibrium position with a frequency called the Brunt–Väisälä frequency, usually denoted with N. This can be derived easily from equation (3.5), and is equal to

N2=gdlnPdrχTχρ(radad+χμχTμ). 3.10

Another expression for this frequency (that we will use in the rest of the paper) involves the derivatives δ=(lnρ/lnT)P,μ and ϕ=(lnρ/lnμ)P,T, and the pressure scale height HPdr/dln(P)=P/(ρg) (where g is the gravitational acceleration), and is

N2=NT2+Nμ2=(gδHP(ad)+gϕHPμ). 3.11

Using δ and ϕ, the gradient ∇L (Ledoux gradient) can be rewritten as

L(ad+ϕδμ). 3.12

3.2. The mixing length theory of convection

From the point of view of the evolution of chemical abundances, the instantaneous mixing approximation does not require a model for stellar convection. However, this is necessary for calculating convective velocities and fluxes, and when a time-dependent description of convective mixing is required. The formalism almost universally used in stellar evolution calculations is the MLT [15], a simple, local, time-independent model, firstly applied to stellar modelling by Biermann [16]. The formulation by Böhm-Vitense [17] is usually employed in modern stellar evolution calculations.

The basic idea of the MLT is to assume that the stellar fluid is composed of identifiable convective elements that move vertically in the gravitational field between regions of higher and lower temperature. Indeed, there is no net mass flow, but the effect is an outward transport of energy. The MLT assumes a characteristic distance over which bubbles rise before dissipating, the so-called mixing length Λ (figure 4), and describes the motion of these bubbles over the characteristic scale Λ under some general assumptions:

  • — all bubbles have the same characteristic size that is of the same order as Λ;

  • — Λ is much smaller than any other length scale of physical significance in the star;

  • — the physical properties, i.e. temperature, density, pressure and chemical composition, of the bubbles differ only slightly from the surrounding medium;

  • — pressure equilibrium with the environment is maintained. This means that the velocities of the convective elements are small compared with the local sound speed in the local environment.

Figure 4.

Figure 4.

Schematic illustration of the MLT approximation to convective motion. The mixing length Λ corresponds to the characteristic radial distance scale over which rising and falling convective elements move before merging with the surrounding medium.

Λ is assumed to be equal to a multiple of the local pressure scale height Hp, i.e. Λ=αMLTHP, with αMLT—the mixing length parameter—being a constant to be empirically calibrated (usually by reproducing the solar effective temperature at the solar age with a solar model).

The need for a mean free path in the simple framework of the MLT can be easily explained as follows. Let us consider the cross sections of the rising and falling gas columns; if originally in a given convective layer the cross sections were the same, the rising gas (always in pressure equilibrium with the surroundings) will expand by a factor e after a distance equal to HP. This means that at this point within the star there is much less space available for the falling gas. On the other hand, the amount of falling material must be the same as the rising one, otherwise the star would either dissolve or concentrate all its mass in the interior, thus violating the hydrostatic equilibrium condition. The only solution is that after a distance Λ of the order of HP, part of the material stops and inverts its motion.

By relying on the previous assumptions, the MLT provides the following equations for the velocity of the convective elements vc, the convective flux Fc and the convective efficiency Γ (defined as the ratio between the excess heat content of a raising convective bubble just before its dissolution, and the energy radiated during its lifetime) [18]:

vc2=aΛ2gδ(c)Hp, 3.13
Fc=bρvccpTΛ(c)Hp 3.14
andΓcad=4cpρ2Λvcκc2aT3, 3.15

where ∇ is the temperature gradient of a rising (or falling) element of matter within the convective region, and ∇c is the average temperature gradient of all the matter at a given level within the convective zone (the quantity needed to solve the stellar structure equations). There are three additional free parameters besides αMLT, i.e. a, b and c, that are usually fixed a priori and define the MLT ‘flavour’ [19]5 There are four unknowns, namely vc, Fc, ∇c, ∇ and the three previous equations plus an additional equation arising from the fact that the total flux to be transported by radiation plus convection is known from the solution of the stellar evolution equation

L4πr2=4acgT4m3κPr2+Fc. 3.16

If convection is efficient in the deep stellar interiors, the MLT provides ad and velocities of the order of 1–100 m s−1, many orders of magnitude smaller than the local sound speed. On the contrary, in convective layers close to the surface the gradient is strongly superadiabatic and velocities are much larger, of the order of 1–10 km s−1, close to the local sound speed.

External or inner regions of a stellar model are convective in the following cases:

  • — Large values of the opacity κ. Given that ∇radκF, where F is the energy flux, ∇rad tends to increase above ∇ad. The radiative opacity generally increases with decreasing temperature—for a given chemical composition—hence this situation occurs most commonly in the cooler outer layers of stars.

  • — If the energy generation rate in the star is very sensitive to the temperature, then the energy flux F rises rapidly as r approaches zero in the stellar centre. This large heat flow can eventually cause ∇rad to increase above ∇ad. This situation occurs only in stellar cores, when the nuclear energy generation rate is very sensitive to temperature, as is the case of the H-burning CNO-cycle, He-burning and more advanced nuclear-burning stages.

  • — In ionization zones the adiabatic gradient can decrease below the typical value ∇ad∼0.4. Also, in these regions the radiative opacity usually increases. Owing to these effects, one can expect the ionization zones located in the outer layers to be convective.

When the convective mixing timescales are comparable to nuclear-burning timescales, a diffusive approach is usually employed to follow the chemical evolution [20]. This means that an extra term is added to the right-hand side of equation (2.7) for a given element s, i.e. the right-hand side of the following diffusion equation:

Xst|Mr=1ρr2r(Dcρr2Xsr), 3.17

where the diffusion coefficient associated with the convective transport is taken to be Dc=13αMLTvcHP, using the value of vc derived from the MLT (see [2123], for examples).

The MLT is very appealing for its mathematical simplicity (despite the free parameters involved) and the fact that it relies just on local quantities, hence it is easy to include in stellar evolution codes and does not affect the stability of the numerical solution of the stellar evolution equations. Once αMLT is calibrated on the Sun, the resulting HRDs (and CMDs) of the models generally reproduce the observations satisfactorily within the current errors.6 Comparisons of the effective temperature evolution of models calculated with a solar αMLT and with a calibration of αMLT obtained from 3D radiation hydrodynamics simulations of stellar envelopes [25] display an agreement within 30–50 K [26].7 On the other hand, helioseismic data (the study of non-radial oscillations of the Sun [28]) clearly point to shortcomings of the MLT description of the physical structure of convective regions [29].

An alternative local description of convection included in stellar evolution calculations with the ATON code [30] is presented in [31]. The main difference from the MLT is that the convective elements have a spectrum of sizes, and the scale length of the convective motions is set to be equal to the distance to the closest convective boundary. Stellar evolution tracks calculated with this convection model show a different evolution of the effective temperature Teff compared to calculations with the MLT, especially along the RGB phase. Differences increase with increasing initial metallicity. This convection model also suffers from shortcomings when compared with helioseismic results [29].

There are very sophisticated non-local Reynolds stress models that describe not only convection, but also (see below) semiconvection and overshooting [3237]. They introduce a large number of equations to be coupled to the stellar structure equations and several free parameters to be calibrated observationally, and are generally not included in current stellar evolution modelling.

Convection plays a major role in determining the evolutionary properties of stellar models. Figure 5 shows the HRD of models with (the 5M model) and without (the 1M model) convective cores during the MS. The shape of this evolutionary phase in the HRD is completely different between the two tracks, because of the different time evolutions of the H-profile in the burning region, as shown by figure 6. In the inner radiative layers of the 1M model, the H-abundance profile changes smoothly during the MS evolution, and at any given time the H abundance increases gradually from the core towards the more external layers where the burning becomes progressively less efficient. In the convective inner layers of the 5M model—where the burning takes place—the H-abundance profile is uniform, and the progressive retreat with time of the outer border of the convective core produces at a given time t a H-abundance profile flat in the innermost regions, then increasing outwards.

Figure 5.

Figure 5.

HRD of 1M and 5M stellar evolution models with the initial solar chemical composition. The 5M model is computed until the end of He-core burning. The 1M model stops during the RGB evolution. The dashed line displays the HRD of the same 5M model calculated with MS core overshooting (0.2Hp—see text for details).

Figure 6.

Figure 6.

(a) The evolution with time of the H-abundance profile (H abundance as a function of the local fractional mass) within the 1M model of figure 5, during the MS phase. (b) The same, but for the 5M model without core overshooting. The different numbers label a temporal sequence with decreasing values of the H abundance in the mixed core.

Surface convection (present everywhere along the HRD evolution, apart from the MS phase of the 5M model) is very important when comparing the surface abundances measured from spectroscopy with the models, because of the dredge-up phenomenon, whereby the fully mixed convective envelopes reach layers processed by nuclear burnings, hence altering their chemical composition during the RGB and AGB phases [2].

3.3. Convective overshooting

As already mentioned, the criteria for the onset of convection are local. The boundaries of the convective regions are fixed at the layer where the random motions of the gas do not get amplified. At this convective border, the acceleration of the gas elements is zero, but not the velocity. One expects, therefore, the chemically mixed region to extend into the formally stable layers in the case of both core and envelope convection. In real stars mixing beyond the formal boundary most probably results from the interplay of several physical processes [3840], grouped in stellar evolution modelling under the ‘umbrella’ term overshooting (or sometimes convective boundary mixing). These overshooting processes are modelled very crudely by introducing free parameters to be calibrated on observations of eclipsing binaries [41,42] (typically comparisons with masses and radii of both components under the assumptions that they are coeval) and open clusters [43] (the shape of the MS turn-off (TO) region).

The standard approach is to assume that overshooting into the stable layers does not affect their thermal structure, hence the temperature gradient stays radiative. The composition is then mixed instantaneously between the formal convective border and layers at a distance λHP from this border, where λ (λ<1) is a free parameter to be observationally calibrated, and HP is the pressure scale height at the convective border. This is the approach taken, for example, in the BaSTI [44], DSEP [45], Yale–Yonsei [46] codes.

A similar approach of instantaneous mixing is taken in [47]. The following integral criterion is employed to fix the extent of the convective core overshooting, inspired by a constraint on the maximum possible extension of the overshooting region developed by Roxburgh [48]

0rccFover(LradLN)1T2dTdrdr+rccrov(2Fover)(LradLN)1T2dTdrdr=0, 3.18

where Fover is a free parameter (between 0 and 1) to be calibrated against observations. The two luminosities Lrad=−((16πacT3r2)/(3κρ)) dT/dr and LN represent, respectively, the local radiative luminosity and the total luminosity produced by nuclear reactions within radius r, while rcc is the radius of the convective core boundary and rov is the radius of the outer boundary of the overshooting region, to be determined through this equation.

The various releases of Padua stellar evolution models [49] and the PARSEC code [50] consider instead an adiabatic gradient for the overshooting region (this is the case of penetrative convection according to [51], because the overshooting material is assumed to be able to change the entropy stratification), and the spatial size of the overshooting region is determined as follows [52]. Starting from each radial distance ri from the centre inside the formally convective region, the following equation (based on the MLT) is integrated outwards, up to r=ri+l, where lHp, Hp being the pressure scale height at the convective boundary, and λ (λ<1) a free parameter:

13vr3r=gκTχTχρFccPρgμχμχρΔμvr, 3.19

with Δμ=μ(r)−μ(ri). The convective flux Fc is determined as Fc=FFrad, where F=L/(4πr2) is the total energy flux and Frad=−((4acT3)/(3κρ)) dT/dr. In the overshooting region, where the actual gradient is set to adiabatic, Frad>F hence Fc<0, mimicking the fact that the convective elements penetrating into the stable layers are formally cooler than the surrounding medium.

At any given r, the convective elements originated in the range between rl and r will display a range of velocities, and the maximum value vm attained at each r is taken to derive the run of vm(r) as a function of r. The border of the instantaneously mixed region is then taken at the radial distance for which vm=0.

In all these approaches, the free parameters that determine the spatial extent of the convective core overshooting region needs to decrease in the regime of small convective cores. The reason is that HP as r0, hence the smaller the size of the formally convective core, the larger the size of the actual mixed core (including overshooting region), with no smooth transition of the shape of MS evolutionary tracks from the convective core to the radiative core case. Observations of the MS TO of open clusters of various ages confirms the need to decrease to zero the extent of the overshooting regions when the stellar mass decreases, in the regime of small convective cores (masses below approx. 1.5M—see, e.g. [47]). This means that the assumed trend of λ (or Fover) with mass, for masses with small convective cores, introduces an additional degree of freedom, albeit affecting only a restricted range of stellar masses. In this context, following an integral constraint on the maximum possible extent of overshooting regions [48,53], it has been proposed to limit the extent of the overshooting region to 15% of the radius of the formally convective core to generate a smooth transition from models with convective cores to models with radiative cores on the MS [54].

Additionally, one can find in the literature two very different approaches to the mixing beyond the formal convective boundary.

  • (i) A diffusive approach [55] used, for example, in the STARS [56], MESA [57,58] and GARSTEC [59] codes. The element transport beyond the formal convective boundary is described as a diffusive process (avoiding the instantaneous mixing approximation) based on results of 2D radiation hydrodynamics simulations of shallow stellar surface convection zones [60]
    Xit|Mr=1ρr2r(Dovρr2Xir). 3.20
    The diffusion coefficient Dov is given by
    Dov=Dcexp(2zfHp), 3.21
    where Dc is the diffusion coefficient inside the convective region (Dc=(13)αMLTvcHP), z is the distance from the convective boundary, HP is the pressure scale height at the convective boundary and f is a dimensionless free parameter. Typical calibrated values of f are approximately 0.01–0.02.
  • (ii) Modelling of the mixing as ‘turbulent entrainment’, following the simulations by Meakin & Arnett [61], as employed in the stellar evolution calculations by Staritsin [62]. The motion of matter in the zone of convective instability is turbulent, and the rising turbulent flow spreads horizontally near the boundary of the turbulent region. The interface between the convective region and the stable layers moves through the stable region due to the continuous involvement of new layers in the turbulent motion. The velocity Ve of the penetration of the convective turbulent boundary into the stable layers is determined by the ‘turbulent-entrainment’ law written as
    VeVt=ARiBn, 3.22
    where Vt is a typical turbulent velocity at the boundary (that can be taken, for example, from the MLT), A and n are parameters that characterize the entrainment (A∼0.027 and n∼1.05 according to the simulations by Meakin & Arnett [61]), and RiB is the so-called bulk Richardson number defined as
    RiB=lΔbVt2, 3.23
    with l denoting the typical size of the eddies doing the entrainment (some fraction of the pressure scale height Hp at the boundary of the mixed region) and
    Δb=ΔhN2dr. 3.24

    This integral is performed across a region of thickness Δh that contains the convective boundary, N2 being the the Brunt–Väisälä frequency. When Ve is determined, the distance d over which this boundary shifts during an evolutionary time step Δt is given by d=VeΔt.

Despite the uncertainties involved in its parametrization, the treatment of overshooting is very important because it affects the evolutionary properties of the models. For example core overshooting during the MS produces brighter models, an increased MS lifetime (because more fuel is available), larger He-core masses that induce a brighter and shorter-lived He-burning phase, and also less extended loops in the HRD (figure 5). Overshooting below convective envelopes can alter the surface abundances after the dredge-up episodes [63], and affect the luminosity of the RGB bump (see §5) and also the extension of the loops in the HRD (in the case of the 5M model in figure 5, the loop during the core He-burning phase would become more extended with overshooting also from the convective envelope). Age estimates of young–intermediate age clusters are obviously affected by the amount of overshooting included in the models, i.e. the larger the overshooting region the older is the age estimate of a given cluster. Overshooting also plays an important role during C-burning in super AGB stars, affecting the propagation of the carbon-burning flame [64]. Too large overshooting at the base of the convective C-burning region can prevent the flame from reaching the centre, thus producing a hybrid CONe core, and eventually a CONe WD.

It has also been shown that the diffusive approach to mixing in the overshooting region provides different results in terms of evolutionary times compared to instantaneous mixing, because of a slower addition of extra fuel from the overshooting region when this scheme is implemented [23].

4. Semiconvection

After convection, semiconvection (called ‘double-diffusive convection’ in oceanography) is probably the most significant element transport mechanism in non-rotating stellar evolution models. Below are the two major cases where semiconvective transport is efficient, and how this is implemented in stellar models.8

4.1. H-burning phase with convective cores

There is a large body of literature that addresses the issue of semiconvection in massive stars with convective cores during the MS [10,11,6668]. In a nutshell, layers left behind by shrinking convective cores during the MS will have a hydrogen abundance that increases with increasing radius (the chemical composition of each layer is determined by the composition of the convective core at the moment the layer has detached from the retreating mixed region). They are characterized by ∇ad<∇rad<∇L, and a treatment of semiconvective mixing is needed (figure 7). There is also a narrow range of masses around 1.5M (the values depending on the initial chemical composition) with increasing mass of the convective core during part of the MS, where a narrow semiconvective region forms right above the fully mixed core [69].

Figure 7.

Figure 7.

Profiles of the hydrogen mass fraction X and the radiative, adiabatic and Ledoux gradient as a function of the fractional mass coordinate, within a 1.5M model during the MS phase. From left to right, different tones of grey mark convective, semiconvective and radiative layers, respectively (courtesy of V. Silva-Aguirre).

The efficiency of element transport in this semiconvective region has important consequences for the morphology of the TO of the tracks—very efficient semiconvection mimics the case of overshooting beyond the formal convective boundary and no semiconvection [69]—and the post-MS evolution of massive stars [70,71] because it changes the H-profile above the H-exhausted region. To provide a guideline, for massive stars inefficient mixing favours core He-ignition on the cool (red) side of the HRD, while efficient mixing favours the ignition on the hot (blue) side of the HRD (and increases MS evolutionary timescales). One could think that comparisons with the observed ratio of blue to red supergiants (B/R ratio) could provide strong constraints on the efficiency of semiconvective mixing in massive stars. However, other factors like the extent of the overshooting region (that acts in the direction to reduce the size of the semiconvective layers), and rotation will affect the HRD location (and eventually loops) during the He-burning phase [72]. At the moment no set of theoretical models seems to be able to match observational constraints on the B/R ratio, that is also strongly affected by mass transfer in binaries, which make up most of the massive star population.

Semiconvective mixing is included in these evolutionary calculations following various recipes. A traditional approach was to iterate the composition in the individual semiconvective layers until ∇rad=∇ad locally [11,70,71]. More modern calculations adopt again a diffusive approach, including a chosen semiconvective diffusion coefficient DSC. The coefficient from [10] is implemented in the codes STERN [73], MESA and GARSTEC. It is derived from a linear local stability analysis [7], assuming the MLT to determine the velocity of the gas elements (using αMLT=1.5), and is given by

DSC=αSCK6cpρadL, 4.1

where K=(4acT3)/(3κρ) is the thermal conductivity, ∇ is the actual temperature gradient and the free parameter αSC determines the mixing timescale (larger αSC correspond to shorter mixing timescales). Values of αSC currently used are of the order of approximately 0.04–0.1, calibrated on empirical constraints [22,72]. The value of ∇ in the semiconvective region is determined from

L=Lrad(1+(LSCLrad))andLSCLrad=αSCad2(L)[(ad)β(83β)3224ββ2μ],} 4.2

where β is the ratio of the gas pressure to the total pressure (gas plus radiation), L is the total luminosity and Lrad is the radiative luminosity, which can be written as

Lrad=163πacGmT4κP.

Larger values of αSC produce semiconvective temperature gradients ∇ increasingly close to ∇ad.

Another expression for DSC has been derived in the assumption of layering of the semiconvective region, with nearly uniform composition in each layer, separated by thin boundary layers within which the chemical elements are transported by molecular diffusion alone [74]:

DSC=DsKT(4β3)radadμ. 4.3

where KT=K/(ρcP) is the thermal diffusivity, β is the ratio of the gas pressure to the total pressure (gas plus radiation) and Ds is the diffusion coefficient of He due just to the He abundance gradient (see §6). For a chemical composition of essentially two elements (in our case H and He) with atomic numbers Z1 and Z2, masses m1 and m2 and number densities n1 and n2,

Ds=316n(2KBTπm)1/2(2KBTZ1Z2e2)21ln(Λ),

where m=m1m2/(m1+m2), n=n1+n2, KB is the Boltzmann constant and

Λ=1+(4λDKBTZ1Z2e2)2,

with λD=(KBT/(4πne e2))1/2 (Debye length) and ne the electron density.

Typically, Ds is very small (smaller by about eight orders of magnitude) compared with K/(cpρ) (the so-called thermal diffusivity) and the predicted semiconvective transport is very inefficient.

The code KEPLER [75,76] implements a different diffusion coefficient for the semiconvective transport:

DSC=αSCKTDcDc+αSCKT, 4.4

where Dc is the diffusion coefficient the layer would have if the Schwarzschild criterion is used (fully efficient convection, Dc=13αMLTvcHP) and αSC is a free parameter usually fixed to 0.1 in KEPLER calculations [77].

Finally, results from recent 3D hydrodynamics simulations of layered semiconvective regions [9] have been transposed into a diffusion coefficient DSC implemented in the MESA code. The result is that in stellar conditions the mixing obtained with this coefficient is very fast and is essentially equivalent to calculations performed with instantaneous mixing in the semiconvective region [78].

4.2. Core He-burning phase in low–intermediate-mass stars

Another important evolutionary stage where semiconvection plays a major role is the core He-burning phase of low- and intermediate-mass stars, which has been widely discussed in the literature (see [7982] and references therein). If we consider a low-mass star (initial mass below approx. 2M) after the core He-flash, He-burning is efficient in a convective core with central values log(Tc)8.07±0.01, log(ρc)4.25±0.05, and a chemical composition of almost pure He. The opacity is dominated by electron scattering, but an important contribution (about 25% of the total) comes also from free–free absorption. The transformation of He to C due to nuclear burning increases the free–free opacity, hence κ, and within the convective core ∇rad increases, developing a discontinuity of ∇rad at the inner convective boundary (figure 8).

Figure 8.

Figure 8.

Time evolution of ∇rad as a function of the mass enclosed within radius r inside a core He-burning model, if the discontinuity of ∇rad is maintained. Increasing numbers define a sequence of increasing time (see text for details).

This discontinuity of the radiative gradient is clearly unphysical. In fact, in convective regions far from the surface ∇c∼∇ad, and given that in the MLT Fc∼(∇rad−∇c), there would be an increasing convective flux developing on the inner side of the convective boundary. According to [83], the presence of this discontinuity is due to an incorrect application of the Schwarzschild criterion when an MLT picture of convection is considered, for a proper implementation requires always ∇rad=∇ad at the convective side of the boundary. Another interpretation is that even a very small amount of overshooting suffices to reach the neutral ∇rad=∇ad condition at the convective boundary, by means of a self-driving mechanism.

Let us consider an overshooting such that just one radiative layer is mixed. This layer will become fully convective on very short timescales, with locally ∇rad>∇ad because of the amount of carbon mixed from the layers below. This means that it is now the formal boundary of a new enlarged convective core. A small overshooting to mix the next radiative layer will have the same effect, and so on until finally ∇rad=∇ad at the convective side of the boundary. Another possibility is atomic diffusion [84] (see §6), driven by the gradient in carbon abundance between the fully mixed convective core and the surrounding radiative layers. As carbon diffuses into the radiative He-rich layers, the local opacity increases, ∇rad becomes larger than ∇ad and these layers become convective.

This diffusion is able to ‘extend’ the convective core on short timescales (short compared to nuclear timescales), so that ∇rad=∇ad is satisfied at the convective boundary [84]. Yet another possibility to extend the formally convective core to attain ∇rad=∇ad at the convective side of the boundary is the shear instability [85] (see also §8). At the formal boundary of the convective region (where the discontinuity of ∇rad appears), we have two fluids of mean molecular weight μ1 and μ2, respectively, whose surface of separation is perpendicular to the gravity field, and have a relative velocity vtang tangential to the separation surface. In this situation vtangvc, the velocity of the overturning convective elements when they reach the formal convective boundary, as obtained from the MLT.9 In this case, a mixed transition region should appear whose width is

z=vc24g(1μ1/μ2). 4.5

By denoting with tmix the typical time to mix this region of width z, and assuming that tmix is of the order of the characteristic time of convection at the boundary of the core (as suggested by Castellani et al. [79]), i.e. tmix=Λ/vc, with ΛHP, the velocity of advancement of the border of the mixed core is vpz/tmixzvc/Λ. Combining these relations with equation (4.5) provides

vp=vc34gΛ(1μ1/μ2). 4.6

It has been shown in [79] using an MLT approach that an overshooting length z≈5 cm guarantees an advancement of the fully mixed core with speed vp103[1(ad/rad)]cmyr1. This is sufficient to enlarge the mass of the fully mixed core to the point where ∇ad=∇rad) on timescales shorter than nuclear timescales. By adopting the values of g, vc, λ and μ1/μ2 adopted by Castellani et al. [79], equation (4.5) and (4.6) provide z∼3 cm, and vp≈103 cm yr−1, consistent with results in [79].

In practical terms, at every computational time step, one can place the boundary of the fully mixed convective region at the layer where ∇rad=∇ad (figure 9).

Figure 9.

Figure 9.

Similar to figure 8, qualitative sketch of the change in the fully mixed core when the discontinuity of ∇rad at its edge is maintained (a), and when the convective boundary is set where ∇rad=∇ad (b) (see text for details).

After this early phase, when the extension of the convective core follows the increase of ∇rad, the radiative gradient profile starts to show a minimum (profile 3 in figure 10), as a consequence of the progressive outward shift of the convective boundary. The presence of this minimum depends on the complex behaviour of the physical quantities involved in the definition of ∇rad, such as opacity, pressure, temperature and local energy flux. In this situation, outward mixing to eliminate the ∇rad discontinuity will induce a general decrease of the radiative gradient in the whole convective core (due to an average increase of He and consequent decrease of C—see profile 4 in figure 10). The radiative gradient will eventually decrease and become equal to ∇ad at the location of the minimum of ∇ad (profile 5 in figure 10).

Figure 10.

Figure 10.

Similar to figure 9, sketch of the time evolution of the radiative gradient profile near the boundary of the convective core during the central He-burning stage. (ac) The sequence of events which lead to the formation of the semiconvective zone (c). Increasing numbers denote a sequence of increasing time (see text for details).

The semiconvective region is the region between the ‘neutral’ ∇rad=∇rad point and the overlying formally convective shell whose upper boundary still displays a discontinuity in ∇rad. In fact, a full instantaneous mixing between the convective core located inside the minimum and the external convective shell would have the consequence of decreasing the radiative gradient in the whole mixed region. However, due to the presence of this minimum, ∇rad in a portion of the mixed core would become lower than the ∇ad, i.e. it would not be convective. A solution for this inconsistency is to impose a partial mixing in the formally convective shell (see, e.g. [86] for an example of implementation), such that the final chemical composition—shown in figure 11—satisfies the condition ∇rad=∇ad (profile 6 in figure 10). The mass location of the minimum of the radiative gradient moves outwards with time, because of the evolution of the chemical abundances caused by nuclear burning. As a final result, the C-enriched region increases its size outwards.

Figure 11.

Figure 11.

He-abundance profile inside the convective core and the semiconvective zone at various levels of central He depletion, for a low-mass core He-burning model. The dashed line marks the location of the fully mixed core boundary.

The effects of semiconvection on the evolution of these models are the following: the evolutionary tracks perform more extended loops in the HRD, and the central He-burning phase lasts longer because of a larger amount of fuel to burn.

When the central He abundance has decreased to about Y ∼0.10, α-captures by C nuclei tend to overcome C production by 3α reactions, thus He-burning becomes mainly a 12C+α production of oxygen, whose opacity is even larger than that of 12C. This causes an increase in the size of the semiconvective region and, in turn, more fresh helium is transferred into the core, which is now nearly He-depleted. Even a small amount of He added to the mixed core enhances the rate of energy production, thus the luminosity increases, driving an increase in the radiative gradient. As a consequence, a phase of enlarged mixed zone starts, the so-called breathing pulse. After this breathing pulse, the star readjusts to burn steadily in the core the fresh He driven there by semiconvection. Detailed calculations show that a few breathing pulses are expected before the complete exhaustion of He in the core. The evolutionary effects of the breathing pulses are the following: the models perform a loop in the HRD at each pulse, the He-burning lifetime is slightly increased and the mass of the CO-core at He exhaustion is increased.

Empirical constraints—mainly the number ratio between horizontal branch (HB—core He-burning phase in low-mass stars) and AGB stars in Galactic globular clusters (GGCs)—suggest that the efficiency of the breathing pulses phenomenon is very low, if any [87,88]. Therefore, they are usually inhibited in stellar model computations by using ad hoc numerical assumptions [82,89]. Typically, during the late stages of core He-burning one forces the extension of the mixed region not to lead to an increase in the central He abundance from one model to the next [87]. Another option is to set to zero the gravitational term in the energy generation equation for the inner regions. In this way, the breathing pulses are also effectively inhibited [82].

This brief discussion highlights clearly the difficulty in modelling core mixing in HB stars. Notice that the standard treatment is usually instantaneous mixing, which may not be adequate in a semiconvective regime. None of the diffusive semiconvective formalisms employed for semiconvection related to H-burning is usually applied to this situation, although the diffusive mixing employed in the STARS code still leads to the onset of breathing pulses during the core He-burning phase. It is also worth recalling that with the inclusion of an extended overshooting region (approx. 1 HP) beyond the layer where the ∇rad discontinuity develops, the fully mixed core is always so large that the need to include semiconvective layers disappears [90] (breathing pulses still seem to appear also with large overshooting). An important consequence of including semiconvection, or large overshooting, and/or breathing pulses, is that the CO profile in the final CO core changes, with important consequences for the cooling times of the final WD stage.10

With increasing stellar mass, the weight of the free–free opacity decreases (because of higher core temperatures) and eventually all these problems disappear in the regime of intermediate-mass stars (masses above a few solar masses).

5. Thermohaline mixing

In recent years, the role played by thermohaline mixing in stellar evolution has been widely explored in connection with low-mass RGB evolution.11 When the convective envelope deepens after the TO, the surface chemical composition is altered due to the dredge-up of H-burning processed matter—the so-called first dredge-up—that increases the abundance of N and He, and decreases C and the 12C/13C ratio. The first dredge-up is completed when the convective region reaches its maximum extension, approaching closely (but not reaching) the H-burning shell around the inert (and electron degenerate) He-core. From this moment on the receding convective boundary is not expected to modify further the surface abundances along the RGB. Spectroscopic observations of metal-poor Galactic halo stars provide, however, compelling evidence for an additional mixing process occurring when RGB stars reach the luminosity of the RGB bump (figure 12), which causes a sudden drop of the isotopic ratio 12C/13C, a decrease in Li and C, and an increase in N. This mixing affects approximately 95% of low-mass stars, regardless of whether they populate the halo field or clusters [94,95], and thermohaline mixing has been proposed to explain these observations.12

Figure 12.

Figure 12.

H-abundance profile within a low-mass stellar model at the start (dashed line) and at the completion (solid line) of the first dredge-up, respectively. The horizontal coordinate displays the local value of the mass. At the completion of the dredge-up, the He-core mass is equal to approximately 0.2M (it was approx. 0.14M at the start of the dredge-up) and the H-abundance discontinuity left over by the fully mixed convective envelope at its maximum extension is located 0.25M away from the centre. The varying H abundance in the layers between the He-core and the discontinuity has been produced during core H-burning along the MS. When the H-burning shell reaches the H-abundance discontinuity the RGB model experiences a temporary drop in luminosity (that in a stellar population produces a local increase in star counts, the so-called ‘RGB bump’, before evolving again with increasing L when the discontinuity is crossed.

In low-mass stars, the main H-burning mechanism during the MS is the p-p chain that, due to its weak dependence on temperature, is efficient also in stellar layers far from the star centre. As a consequence, 3He accumulates in a broad zone outside the main energy production region. During the first dredge-up, this 3He is mixed within the convective envelope, with the consequence that, during the following RGB evolution, the layers above the H-discontinuity left over by the receding convective envelope at its maximum extension will have a uniform 3He abundance, larger than the initial one.

When the shell advances towards the surface during the RGB evolution, in the outer wing above the point of maximum burning efficiency there is a narrow region where 3He is processed through the reaction 3He(3He,2p)4He. In this nuclear reaction two nuclei transform into three and the mean mass per nucleus—the molecular weight μ—decreases. This leads to a small local decrease of μ when moving from the surface towards the centre of the star. As long as the H-burning shell advances through layers below the H-discontinuity—when the star evolves before the RGB bump, see figure 12—this effect is negligible because the shell is moving in a region with a large positive gradient, due to the H profile left over at the end of the MS. However, when the H-burning shell enters the region of uniform H abundance above the discontinuity, the local inversion of the μ profile, of the order of one part in 104, becomes important [97]. This situation corresponds to the conditions for thermohaline mixing (see [98] and references therein).

Thermohaline mixing is usually included in stellar evolution codes as a diffusive process that tends to erase the molecular weight inversion, with a diffusion coefficient derived from a linear analysis [99,100]

Dth=CthKcpρ(ϕδ)μ(radad), 5.1

where Cth=(83)π2α2, α being a free parameter related to the aspect ratio (length/width) of the mixing elements.13

As shown in figure 13, evolutionary calculations show that thermohaline mixing extends between the outer wing of the H-burning shell and the inner boundary of the convective envelope, merging with the outer convection in a short time (approx. 30 Myr for a model with mass of the order of 1M). Therefore, depending on the mixing efficiency (hence the choice of Cth), a significant amount of nuclear-processed matter in the hotter layers of the H-burning shell can be dredged up to the surface during the remaining RGB evolution, helping to explain the spectroscopic data.

Figure 13.

Figure 13.

Location of the mass boundaries as a function of time (zero point taken at the completion of the first dredge-up) of the convective envelope, H-burning shell and the zone mixed by the thermohaline instability, for a low-mass RGB stellar model.

Comparisons with observations require Cth∼100−300 from Li observations in GGCs, and Cth∼1000 for C in globulars, clearly mutually inconsistent. On the other hand, models in [92,98] are able to match the 12C/13C isotopic ratio, C and N abundances in halo RGB field stars with Cth∼1000; the same choice of Cth allows also to match measurements of the carbon isotopic ratio and the [N/C] ratio in open clusters, and Li abundances in field disc stars. Results from 3D hydrodynamics simulations (rescaled to match stellar conditions) add an additional source of uncertainty, for they predict an efficiency equivalent to just Cth≈10 [102,103].

A recent detailed numerical analysis [101] has shown that the surface chemical abundances predicted by stellar models accounting for thermohaline mixing depend on numerical assumptions like spatial and time resolution adopted in the computations. As a consequence, the predicted surface chemical abundances should be treated with caution until a firmer assessment on how to treat thermohaline mixing in model computations is achieved.

6. Atomic diffusion

Microscopic effects related to collisions among the gas particles induce a slow element transport within radiative regions. It is possible to show from first principles that individual ions are forced to move under the influence of pressure as well as temperature gradients, which both tend to displace the heavier elements towards the centre of the star, and of concentration gradients that oppose the above processes. Radiation—which does not have a major effect in the Sun [104]—pushes the ions towards the surface, whenever the radiative acceleration imparted to an individual ion species is larger than the gravitational acceleration. The speed of the diffusive flow depends on the collisions with the surrounding particles, as they share the acquired momentum in a random way. It is the extent of these ‘collision’ effects that dictates the timescale of element diffusion within the stellar structure, once the physical and chemical profiles are specified.

The most general treatment for the element transport in a multi-component fluid associated with diffusion is provided by the Burgers equations [105]. They are obtained assuming the gas particles have approximate Maxwellian velocity distributions, the temperatures are the same for all particle species, the mean thermal velocities are much larger than the diffusion velocities and magnetic fields are unimportant, and they can be written as

dpidr+ρi(ggrad,i)niZ¯ieE=jiNKij(wjwi)+jiNKijzijmjrimirjmi+mj, 6.1

including the heat flow equations,

52niKBT=52jiNzijmjmi+mj(wjwi)25KiiziirijiNKij(mi+mj)2(3mi2+mj2zij+0.8mimjzij)ri+jiNKijmimj(mi+mj)2(3+zij0.8zij)rj. 6.2

In addition, there are the constraints of electric current neutrality,

iZ¯iniwi=0 6.3

and local mass conservation,

iminiwi=0. 6.4

In the above 2N+2 equations pi, ρi, ni, Z¯i and mi denote the partial pressure, mass density, number density, mean charge and mass for species i, respectively. The total number of species (including electrons) is N. The 2N+2 unknown variables are the N diffusion velocities wi, the N heat fluxes ri, the gravitational acceleration g (the comparison of the derived g with the known value from the integration of the stellar structure equations provides an important check for the consistency of the results) and the electric field E. The coefficients Kij, zij, zij and zij have to be specified, together with the radiative accelerations grad. Several stellar evolution codes—in one form or another—use the routine by Thoul et al. [106] to solve the Burgers equations and calculate the velocities of the various chemical species. These diffusion velocities can then be inserted as an advection term in the equation for the time evolution of the mass fraction abundance Xi

Xit|Mr=1ρr2r(ρr2Xiwi), 6.5

where we show on the right-hand side just the contribution of diffusion to the time evolution of the abundance Xi of element i.

In Burgers’ formalism, the effect of collisions between ions is represented by the so-called resistance coefficients, i.e. the matrices K, z, z, z′′, whose precise evaluation is essential to estimate correctly the diffusion timescales for the various elements. These resistance coefficients can be expressed in terms of the so-called reduced collision integrals Ωij(l,s) according to the following relationships:

KijKij0=4Tij2Ωij(1,1)ln(Λij2+1), 6.6
zij=11.2Ωij(1,2)Ωij(1,1), 6.7
zij=2.56Ωij(1,2)4.8Ωij(1,3)Ωij(1,1) 6.8
andzij=2Ωij(2,2)Ωij(1,1), 6.9

where

Tij=KBTλD|ZiZje2| 6.10

and

Kij0=232μijπ(KBT)3(ZiZje2)2ninjln(Λij2+1), 6.11

with λD being the Debye length; Λij=4Tij the plasma parameter (ln(Λij) is called Coulomb logarithm); μij=mimj/(mi+mj) the reduced mass; and Zi, mi and ni the charge number, mass and particle number density of species i, respectively.

The collisions between particles of species i,j in the stellar plasma determine the values of the Ωij(l,s) integrals; the physics of the collisions is specified by some form of the Coulomb interaction. This, as a first approximation, can be described by a pure Coulomb potential with a long-range cut-off distance, typically equal to λD. Using this truncated pure Coulomb potential, the resistance coefficients originally computed by Burgers become [105]

KijKij0=2lnΛijCE±π2/4ln(Λij2+1), 6.12
zij=0.6, 6.13
zij=1.3 6.14
andzij2, 6.15

where CE is Euler’s constant and the ± signs denote repulsive (+) and attractive (−) pairs of particles, respectively. More accurate collision integrals have been obtained by considering a Debye–Hückel type of potential

Vij(r)=ZiZje2rer/λD, 6.16

where r is the particle distance. The results are provided either in tabulated form [107] or as fitting formulae [108112], and some of them are compared in figure 14, as a function of Φij=ln(ln(1+Λij2)). Given that the calculations shown in the figure (apart from Burgers’ results) make all the same physical assumptions, results are similar, but they all differ significantly from Burgers’ results when moving towards higher densities (lower values of Φij), where his approximations are no longer adequate.

Figure 14.

Figure 14.

Resistance coefficients K (a), z (b), z (c) and z′′ (d) as originally computed by Burgers [105] (dotted lines), [107] (solid lines), [108] (short-dashed line), [110] (dashed-dotted lines) and [109] (long-dashed line) as a function of Φij=ln(ln(1+Λij2)). Dark black lines denote the case of a repulsive potential, while the brighter ones denote attractive forces (not computed in [108,109]). The dots represent the actual values computed by Muchmore [108], not his fitting formulae.

Some of these authors argued that while λD is a suitable screening distance at low densities, the Debye sphere loses its significance in denser plasmas, and that a more appropriate screening distance is in this case the mean interionic distance. Hence, they suggested to use the larger value between λD and the mean interionic distance (in the Sun, the former has always been larger than the latter). Whatever the choice for the actual screening distance, its value has to be employed as λD in equation (6.10) to compute the appropriate Λij for determining the collision integrals. The effect of quantum corrections on the resistance coefficients has been included in [111] while the calculations [112] account for very recent developments in ionic transport properties in strongly coupled plasmas [113].

As for the radiative levitation of element i, the main physical interactions that drive the transfer of momentum from the radiation field to ions are bound–bound and bound–free transitions, while it is practically ineffective for fully ionized elements. The radiative acceleration can be written as [114]

grad,i=μκμicl4πr2γi 6.17

where μ is the mean atomic weight, μi is the atomic weight of element i, l is the local luminosity and r is the radius [114]. The dimensionless quantity γi depends on the monochromatic opacity data

γi=(σi(u)[1exp(u)]ai(u))duifiσi(u),

where u=()/(KBT), σi is the cross section for absorption or scattering of radiation by element i, ai accounts for the fact that in bound–free transitions only a fraction of the momentum of the ionizing photons is transferred to the ion, the rest being transferred to the electron lost by the ion, and fi is the number fraction of element i. The Opacity Project provides a set of codes called OPSERVER that enables the calculation of grad,i [115].

Calculations of precise grad,i values involve carrying out the integration over about 104 u values for each atomic species [116]. Given that these calculations have to be repeated at each layer in the stellar model and at each time step during the computation of an evolutionary sequence, this explains why, to date, there are only few extended sets of stellar models that include also the effect of radiative levitation. One has also to note that the Rosseland mean opacity entering the stellar structure equations as well as equation (6.17) has to be continuously recalculated at each mass layer, not just in the nuclear-burning regions—this is in principle true also for calculations including only the other diffusive processes listed above—to be fully consistent with the composition changes.

As an example, figure 15 displays the total radiative acceleration of iron in a stellar envelope. When the radiative acceleration is larger than gravity below a convective envelope—if convection is present, the effect is obviously negligible because of the much faster timescales of convective mixing—the element can diffuse towards the surface. Figure 16 displays the velocity profiles of H (moving upwards), He and CNO (sinking) within a solar model, derived from the solution of the Burgers equations.

Figure 15.

Figure 15.

Total radiative acceleration of Fe as a function of the temperature in a stellar envelope with the labelled Teff and chemical composition, for three values of the surface gravity log(g), equal to 3.8 (short-dashed line), 4.0 (dotted line) and 4.2 (solid line), respectively. The run of the local gravity with temperature is displayed by long-dashed lines.

Figure 16.

Figure 16.

Diffusion velocity of H, He, C, N, O (the lines displaying the diffusion velocities of C, N and O almost overlap), as a function of the local fraction of the total radius, within a solar model. The velocity is in units of solar radius over the typical diffusion timescale for the Sun, that is of the order of 6×1013 yr. The base of the convective envelope is at R/R∼0.71.

The diffusion velocities are always dominated by the effect of pressure gradients. The radiative (upwards) acceleration of C, N and O is only at most about 5% of the local acceleration of gravity just below the convective envelope boundary [104], and decreases fast moving towards the centre. The local sharp increase of the settling velocity of C and the corresponding small decrease for N at r/R∼0.15–0.20 are due to the effect of abundance gradients when these elements attain the equilibrium abundances of the CN cycle (C decreases, while N increases).

6.1. The effect of atomic diffusion on stellar models

Atomic diffusion (sometimes denoted as microscopic diffusion, and hereafter simply diffusion) has a major direct effect on the MS evolution, the chemical stratification of the external layers of hot horizontal branch stars14 and WDs, and the internal chemical stratification of cold WDs. However, some properties of other evolutionary phases are also indirectly affected, as discussed below [84,117121].

The impact of diffusion on stellar models will be discussed mainly in the context of low-mass stars, due to their evolutionary timescales comparable to the diffusion timescales during the MS phase. One has also to take into account that massive hot stars, where in principle radiative levitation can be extremely efficient, experience strong mass loss and rotational mixings (see later on in this section and §8) that tend to limit or completely erase the effect of diffusion.

If we consider the evolution of a typical low-mass star with a convective envelope on the MS, the surface abundances of metals and He tend to decrease because of diffusion from the bottom boundary of the shrinking convective envelope (that maintains a uniform chemical profile due to the shorter convective timescales compared to diffusion) being replaced by hydrogen. However, if the radiative acceleration on some ion species is larger than the local gravitational acceleration below the convective envelope, these elements are slowly pushed into the convective zone and their surface abundance increases. In general, the variation of the surface abundances during the MS phase is dictated by the interplay between radiative levitation and the sedimentation due mainly to pressure gradients (often denoted as gravitational settling). This variation of the surface abundances reaches a maximum around the TO.

When, after the TO, the convective envelope starts to deepen, it will rehomogenize an increasingly larger fraction of the stellar mass. Once the star reaches the RGB and the first dredge-up is completed, the abundance changes previously developed are almost completely erased, apart for the very inner layers, where metals and He were sinking during the MS. In general, the smaller (in mass) the convective envelope, the larger is the change of surface abundances during the MS, because of the smaller diluting buffer of matter with the initial chemical composition.

Figure 17 compares the HRD of low-mass, metal-poor evolutionary tracks, calculated with and without the inclusion of atomic diffusion. The track with diffusion increasingly diverges from the no-diffusion one moving along the MS. Its TO is fainter and cooler, while the two tracks coverge again along the RGB, where the effect of diffusion is practically erased by the fully mixed deep convective zone. The MS evolutionary times are also different, for the track with diffusion has a shorter lifetime because of the slow increase of He at the centre at the expenses of H during the MS phase.15 These changes of TO luminosity and MS lifetime at a given stellar mass affect also theoretical isochrones, hence age determinations of old stellar populations based on the TO luminosity decrease by about 10%.

Figure 17.

Figure 17.

Comparison of the HRD of two 0.82M, Y =0.25, Z=0.001, evolutionary tracks calculated with (solid line) and without (dashed line) atomic diffusion.

Figure 18 shows the pattern of variations of TO surface abundances, for a typical stellar model at the TO of a metal-poor globular cluster. These models have thin convective envelopes that maximize the effect of diffusion.

Figure 18.

Figure 18.

Difference between the TO surface abundances of the most relevant elements, and their initial values, for a low-mass metal-poor model (see labels). The element abundances are given as log(N/H), e.g. as logarithms of their number fraction to hydrogen [122]. These abundance variations are shown for models calculated with no mass loss, and two different mass loss rates during the MS evolution.

It is straightforward to notice the selective effect of diffusion, due to radiative levitation. Elements like He, Li, C, N and O sink below the convective layers and their surface abundances are severely depleted up to a factor of about ten. Other elements like Mg, Si, S and Fe are pushed into the convective envelope by the radiation pressure and their surface abundances increase during the MS. The size of these abundance variations decrease in models with larger convective envelopes. For example, a 0.8M model with initial [Fe/H]=−0.7 will show uniformly depleted abundances for all elements heavier than H at the TO, at a level of just 0.1–0.15 dex.

After the first dredge-up is completed, these abundance variations basically disappear. The He abundance post dredge-up is slightly lower in the model with diffusion (ΔY ∼0.01), the RGB bump magnitude decreases by approximately 0.07 mag, the He-core mass at the He-flash increases by a few 0.001M. As a consequence, the RGB tip luminosity increases at the level of just approximately 0.01 mag. The abundances of metals (other than CNO) in the He-core are only a few 0.01 dex more abundant than originally, while they (and He) are less abundant than originally by about the same amount around the core.

Moving to the following HB phase, the zero age horizontal branch (ZAHB) luminosity decreases by approximately 0.02 mag compared to the no-diffusion case, driven by the lower amount of He in the envelope, but the major effect is on the surface abundances during the following HB evolution, as shown in figure 19. At Teff above approximately 6000 K, the surface chemical composition of the models starts to be altered by diffusion; the selective effect increases during the HB evolution and is more pronounced for lower HB masses that evolve at higher Teff. The abundance of He tends to decrease, while overall the surface metal content increases. Figure 19 displays, as an example, the predicted behaviour of the surface Fe abundance in metal-poor HB models. One can notice an increase in surface Fe by a factor of approximately 10 within the first 10 Myr, and up to a factor of approximately 100 after 60 Myr.

Figure 19.

Figure 19.

(a) Displaying the surface [Fe/H] values measured in a sample of stars in the metal-poor GGCs M15, M68 and M92, as a function of their Teff. The evolution (dotted segments for an HB age between 0 and 10 Myr, solid segments for ages between 30 and 60 Myr) of the surface [Fe/H] for HB models with the labelled masses including diffusion is also displayed [120]. (b) Displaying the projected rotation velocities for the cluster stars in the top panel. The red points (with error bars) identify stars with a different behaviour regarding the correlation between Teff, rotation velocity and surface [Fe/H] (see text for details).

These surface abundance changes are again erased by the deepening envelope convection for all HB masses that evolve to the following AGB phase. The subsequent hot WD phase sees diffusion altering again in a major way the surface abundances. Owing to the high surface gravity, gravitational settling of metals (down to the edge of the electron degenerate CO core) in the doubly layered H and He (DA type) or He-dominated (non-DA type) envelopes is very fast. Timescales depend on the dominant element, the envelope thickness, Teff and gravity, but they are at most of the order of 106 yr [119]. Finally, in cool WDs, when the electron degenerate CO core is in the liquid phase, the slow gravitational settling of 22Ne lenghtens the cooling times of faint WDs from high metallicity progenitors by up to approximately 1 Gyr, about 10% or more of the total cooling time of the models [123127]. This element is the most abundant impurity present in the CO core, and originates from 14N during the core He-burning phase. Owing to the two additional neutrons hosted by the 22Ne nucleus relative to 12C and 16O ions, this element experiences a net downward gravitational force and slow settling in the liquid region of the core (inhibited in the solid regions, due to the expected sudden increase of viscosity). This regime is much denser than the case treated by the Burgers equations, and the 22Ne diffusion velocity w22Ne is determined from a diffusion coefficient D22Ne estimated from molecular dynamics calculations for a strongly coupled plasma [128]:

w22Ne=2mpgD22NeKBT,

where mp is the proton mass and g is the local acceleration of gravity.

The most recent estimate of the diffusion coefficient D22Ne in the liquid phase provides

D22NeD00.53[Z¯Z22Ne]2/3(1+0.22Γ)exp(0.135Γ0.62). 6.18

In this equation, Z¯ denotes the average atomic number of the degenerate core chemical composition, and Γ the Coulomb parameter (1<Γ<180 in the liquid phase) defined as

Γ=Z5/3¯e2aeT, 6.19

Here, Z5/3¯ is an average over the ion charges, T is the temperature and the electron sphere radius ae is equal to (3/4πne)1/3 with ne=Z¯n the electron density and n the ion density.

In addition,

D0=3ωpa2Γ4/3

with ωp the plasma frequency

ωp=[4πe2Z¯2nM¯]1/2,

with M¯ denoting the average mass of the ions.

The diffusion of 22Ne increases the energy budget of the WD (hence increases the cooling times) through the contribution (∂U/∂μ)T,V(∂μ/∂t) to the gravitational energy generation coefficient ϵg in the equations of stellar structure (see equation (2.5)).

Moving to stars with initial masses in the range approximately 1.5–3.0M, which display vanishing convective surface layers during the MS, calculations with diffusion show the development of surface abundance variations already during the pre-MS [129]. During the MS phase, they display large surface abundance variations (like He and Ca underabundances, iron-peak element overabundances); moreover, local enhancements of iron and nickel abundances below the thin surface convective layers lead to extra convective zones that, in some cases, can trigger pulsations through the iron-induced κ-mechanism [130].

6.2. Inhibition of the efficiency of atomic diffusion

As already mentioned, diffusion is an element transport mechanism that comes from basic physics principles, and should be efficient in stars. Helioseismic observations tell us that diffusion is efficient in the Sun; its inclusion in stellar models improves the match of the inferred sound speed profile, as well as the present depth of the convective envelope and its He abundance [131,132]. The very efficient diffusion of metals from the WD envelopes is also confirmed by observations, whereby for the vast majority of WDs the chemical composition of the surface is either pure H or pure He.16 Also the diffusion of Ne in the CO liquid cores of WDs is indirectly favoured by the comparison of TO and WD ages for the old metal-rich open cluster NGC6791. The effect of Ne diffusion increases the age obtained from the WD luminosity function, ensuring agreement with the TO age [126].

However, existing spectroscopic measurements of surface element abundances (e.g. Fe) in a sample of GGCs contradict the predictions of stellar models that include uninhibited diffusion [120,133136]. In a star cluster, the initial mass of the stars evolving in post-MS stages is essentially constant, and measurements of chemical abundances at the TO and the base of the RGB should display a difference, due to the effect of diffusion on evolutionary tracks discussed before, which is maximized at the TO and essentially disappears along the RGB. For example, in the case of NGC6397, with [Fe/H]∼−2.1 along the RGB, the TO Fe abundance should be approximately 0.3 dex lower, and the Ca abundance approximately 0.1 dex larger than on the RGB according to models calculated with diffusion; observations show instead TO Fe and Ca abundances approximately 0.15 and approximately 0.1 dex lower than on the RGB, respectively [137].

This points to a reduction of the effect of diffusion on the surface abundances by some competing mechanism. Nothing of course can be said about the efficiency of diffusion in the inner layers.17 The same qualitative result is found in more metal-rich clusters like NGC6752 and M4, and also on the HB of a number of clusters, as shown by figure 19. In this figure, one can clearly see that for HB stars hosted by three metal-poor GGCs, the surface Fe abundance increases abruptly when Teff goes above approximately 11 000 K, while theory predicts Fe enhancements (compared with the initial value [Fe/H]=−2.3 dex typical of these clusters) at much lower temperatures.

Also comparisons of models in the mass range between approximately 1.5 and 3.0M, calculated including diffusion with surface abundances measured in A and F stars, show that diffusion must be moderated by some competing mechanism [129,139,140]. For example, in samples of A and F stars belonging to the Hyades, Pleiades and Coma Berenices Galactic clusters, observed abundances of elements like Na, Fe and Ni display a general pattern and star-to-star scatters that point to diffusion being moderated to different degrees by some competing process [141].

Finally, the observed constant surface Li abundance observed in field halo stars with Teff above approximately 5500–6000 K and [Fe/H] below approximately −1.5 dex—the so-called Spite plateau [142]—is also problematic for models including diffusion, because they predict for these stars a progressive decrease of surface Li with increasing Teff, which is not observed [143,144].

6.2.1. Rotational mixing

As we will see in §8, rotational mixing can, in principle, counteract the effect of diffusion, and this has been invoked to explain the delayed (in terms of Teff) onset of diffusion in hot HB stars [145]. This inference is related to the distribution of observed projected rotational velocities for the hot HB stars of figure 19, as shown in the lower panel of the same figure. All observed stars with Teff above approximately 11 000 K that show spectroscopically the signature of diffusion are very slow rotators, whereas at lower Teff rotation rates are on average higher. Estimates (not based on fully consistent stellar evolution models) of the competing effects of meridional circulation and diffusion, show that moderation/inhibition of diffusion below Teff∼11 000 K by rotational mixing is possible [145]. Interestingly, in support of this idea figure 19 shows three stars with Teff>11 000 K and large projected rotation velocities with no enhancement of surface Fe, at odds with the other slow-rotating objects at the same temperatures. The link between (partial) inhibition of diffusion and rotation is however probably not justified for A and F stars, which often show slow rotation velocities and surface abundances clearly affected by a reduced efficiency of diffusion, compared with the model predictions.

6.2.2. Mass loss

Another process that can moderate diffusion in the stellar outer layers is mass loss [122,146,147]. Assuming that mass loss (hereafter ML) is spherical and unseparated (chemical composition of the wind equal to the photospheric composition), its effect is simply to peel off the outer layers of a model. Simple mass conservation constraints, coupled to the fact that the structure of the star is unaffected by the amount of mass lost between two consecutive computational time steps, translate into the appearance of an outward flowing interior velocity due to the wind [148] given by

vw(r)=M˙4πr2ρmrM, 6.20

where ρ is the local density at radial distance r from the centre, mr is the mass interior to r, M* is the total mass and M˙ (negative) is the mass loss rate.

Figure 18 displays the effect of ML on the surface abundances of a typical model at the TO of a metal-poor GGC. For elements affected mainly by gravitational settling, the effect of ML is to increase their surface abundance compared with the pure diffusion case. The reason is that with the chosen mass loss rates, below the convective envelope vw counteracts the settling velocity, hence the degree of depletion is reduced. Larger negative values of M˙ imply larger positive vw and less diffusion from the envelope. In the case of elements supported by radiative levitation in some layers below the convective envelope, the effect of ML on the surface abundances is more complex, and this is due to the interplay between vw and the depth at which grad is larger than the local gravitational acceleration g. For example, let us assume an ML rate M˙=1013 and an age of 10 Gyr; if vw is larger than the settling velocity down to ΔM=10−3M* from the surface, and in those layers grad,i>g for a generic element i, after 10 Gyr the wind will have advected to the surface matter at that location, and the surface abundance of element i will be enhanced compared with the initial value.

This complex behaviour of the elements supported by radiative levitation can be seen in figure 18. For an element like Mg an ML rate M˙=1013Myr1 transforms the surface overabundance of about 0.10 dex into an underabundance of about 0.4 dex. A further increase of M˙ reduces the underabundance to about just 0.1 dex. In the case of Ca, the overabundance caused by diffusion is instead increased by an ML rate M˙=1013Myr1, and then transformed into an underabundance when M˙=1012Myr1.

ML rates of the order of ≈10−12M yr−1 are required to explain the Spite plateau and the behaviour of surface abundances in the metal-poor GGC NGC6397 [122,146]. These rates are unlikely, being much larger than the solar current mass loss rate M˙=2×1014Myr1.

Rates of the order of ≈10−13M yr−1 are invoked to explain at least some of the surface abundance patterns seen in A and F stars [148].

6.2.3. Thermohaline mixing

The modifications of the chemical abundance profiles caused by diffusion lead to variations of the local mean molecular weight. For example, He always sinks, introducing a stabilizing contribution to the local μ-gradient. On the other hand, heavy elements supported by radiative levitation will lead to a destabilizing μ-gradient (increasing outwards). If this second effect prevails, these layers are subject to thermohaline mixing (see §5), which will tend to erase the abundance changes, and hence to moderate the effect of diffusion.

Published calculations of models for A and F stars, including both diffusion and thermohaline mixing, show that thermohaline mixing (employing the diffusion coefficient derived by Brown et al. [149]) plays an important role in moderating the effect of diffusion on the surface abundances, although the surface abundances of the models do not yet match observations [140]. Both processes have been included also in calculations to explain the observed population of carbon-enhanced metal-poor stellar stars [150].

6.2.4. Generic turbulence

A pragmatic solution adopted in several investigations is to add by hand some turbulence able to counteract the efficiency of atomic diffusion in the outer layers. Turbulence here means simply an additional ad hoc term in the chemical evolution equations that acts towards suppressing the development of chemical abundance gradients, not explicitly connected to a specific physical mechanism. Equation (6.5), which showed the contribution of diffusion to the time evolution of the abundance Xi of element i, is therefore modified to include an additional term

Xit|Mr=1ρr2r(Dturbρr2Xir)1ρr2r(ρr2Xiwi), 6.21

where Dturb is the chosen turbulent diffusion coefficient.

The literature presents at least three different choices for Dturb. To date, the only formulation employed to interpret several different sets of data is the following [118,139,151]:

Dturb,Rich=fD(He)0(ρρ0)n, 6.22

where

D(He)0=[3.3×1015T2.54ρln(1+1.125×1016T3ρ)]0

is the He diffusion coefficient18 at a certain reference depth in the approximation of trace amount in an ionized hydrogen plasma. The subscript 0 indicates the reference depth in terms of either a reference density ρ0 or, more often, a reference temperature T0. In this latter case, ρ0 is the density at the layer where T=T0. The turbulent diffusion coefficient Dturb is f times the He diffusion coefficient at the reference depth 0, and varying as ρn.

A choice of parameters f=400, log(T0)=5.5, n=−3 (denoted as T5.5D400-3) allows to match the abundances of A and F stars, while a T6.0D400-3 or T6.09D400-3 choice of parameters in equation (6.22) reproduces the flatness of the Li Spite plateau [118]. Spectroscopy of stars in NGC6397, NGC6752 and M4, three GGCs spanning a range of about 1 dex in initial metallicity, suggests choices between T6.0D400-3 and T6.2D400-3 to match the observations [133,135,152]. Interestingly, for the approximately 4 Gyr old, solar metallicity open cluster M67, predictions from stellar models with uninhibited diffusion are generally consistent with observed abundances [153], although the predicted abundance variations are small.

Two additional forms for Dturb have been proposed, but not widely applied, to comparisons with spectroscopic observations. The first one applied to low-mass star models with convective envelopes is

Dturb,VdB=f(ρBCZ/ρ)3(1(MBCZ/Mtot))n, 6.23

with f=15 and n=1.5 [154], where BCZ denote quantities at the lower boundary of the convective envelope and Mtot is the total mass of the model. Owing to the steep power-law dependence of the density ratio, Dturb becomes negligible in the nuclear-burning regions, ensuring that the assumed turbulence will not affect the inner chemical profiles resulting from nucleosynthesis and gravitational settling.

The other proposed parametrization is proportional to the radiative viscosity [155]

Dturb,visc=f4aT415cκρ2, 6.24

where f is a free parameter that is constrained in the range f=10.2+2.0 by helioseismic observations, and spectroscopy of Hyades stars and OB stars (in this latter case assuming no competing effect from ML).

Figure 20 compares the different formulations for Dturb discussed above, for the outer layers of a 0.8M, [Fe/H]=−1.3 model, in the latter phase of its MS evolution. In the case of Dturb,Rich, we display both the T6.0D400-3 and T6.2D400-3 results, the ones that allow a match to results from spectroscopic observations, as discussed before. Notice how when moving the reference temperature from log(T0)=6.0 to log(T0)=6.2, the entire profile of Dturb,Rich shifts to deeper layers, thus counteracting the efficiency of diffusion in deeper stellar regions. Also, Dturb,VdB (displayed with the choice of parameters recommended in [154]) appears to be very similar to Dturb,Rich for the T6.2D400-3 choice. On the other hand, both absolute values and trends of Dturb,visc (for f=1) with mass depth are very different from the other cases.

Figure 20.

Figure 20.

Comparison of Dturb,Rich solid line for the T6.0D400-3 choice of free parameters, dotted line for T6.2D400-3), Dturb,VdB (dashed line) and Dturb,visc (dotted-dashed line) as a function of the mass location within a 0.8M, [Fe/H]= −1.3 model, in the latter phase of its MS evolution. The vertical thin line marks the bottom of the convective envelope, while the thin dashed line displays the diffusion coefficient of He (see text for details).

For the sake of comparison, we also show the diffusion coefficient of He (that is never supported by radiative levitation) at the edge of the convective envelope and below. All Dturb choices displayed in the figure are able to counteract the diffusion of He from the bottom of the convective zone. Even Dturb,visc, that is the smallest turbulent diffusion coefficient in the figure, matches the one for He, hence is able to inhibit its settling below the convective envelope.

7. Phase separation in white dwarfs

Besides convection in the non-degenerate envelopes and the diffusion of 22Ne in the liquid core, another important process that redistributes the chemical abundances in cold WDs is the so-called ‘chemical separation’ upon crystallization [156]. In a nutshell, when a layer with given C and O abundances in the liquid phase crystallizes (we assume for the moment the core being made of just C and O, given that the sum C+O is always more than approximately 95% by mass in any realistic WD model), the equilibrium composition in the solid phase—given by the phase diagram of the CO mixture—is very likely to be different. This will cause a gradual change of the overall chemical profiles within the CO core that impacts the energy budget and cooling times, as detailed below.

Figure 21b displays a phase diagram for the CO mixture [158], with the regions of liquid and solid phase labelled. For heuristic purposes, let us assume a C mass fraction XC=0.50 uniformly throughout the core (hence the oxygen mass fraction XO is also approx. 0.50). When the mixture starts to crystallize at the centre (the value of the Coulomb parameter Γ=180 for the onset of crystallization is reached first in the centre, because of higher densities), the chemical composition in the solid phase is determined as follows. One needs to draw a vertical line in the phase diagram of figure 21 with the horizontal coordinate equal to 0.50, which runs through the region belonging to the liquid phase until it intersects the upper line describing the phase diagram. The vertical coordinate of the intersection point gives the crystallization temperature of the WD centre (corresponding to T∼1.3TC, where TC denotes the crystallization temperature of a pure C mixture). From this point, one has to draw a horizontal line that will intersect the lower segment of the phase diagram in correspondence with a carbon abundance XC∼0.30. This is the equilibrium abundance of carbon in the solid phase (the corresponding oxygen abundance is XO=1−XC).

Figure 21.

Figure 21.

(a) Oxygen profile (in mass fraction) of a 0.61 M WD at the beginning of the thermal pulsing phase of the progenitor (dotted line), and the same after rehomogenization due to the μ inversion (dotted-dashed line) and after complete crystallization (solid line) [157]. (b) Phase diagram of the CO mixture adopted to calculate the profiles in the left panel [158]. TC denotes the crystallization temperature of a pure C composition, XC the C mass fraction.

Given that XC in the now-crystallized centre is lower than the initial value, conservation of mass requires that the carbon abundance in the liquid phase at the crystallization boundary is increased (hence XO is decreased) with respect to the original value. This means that right above the crystallized boundary the molecular weight is now lower than in the overlying layers still in the liquid phase (where the ratio XO/XC is higher). An increase in molecular weight with increase in distance from the centre in the liquid phase causes an instability to develop, and the resulting fast-mixed region extends outwards in mass as long as the new uniform average XC value is higher than the abundance in the next, unperturbed layer (in this case, it will reach the edge of the CO core) [159,160]. After mixing is completed, the liquid phase will have an overall enhanced XC abundance compared to the value before the crystallization of the central layer.

Let us now suppose that the new value of XC at the boundary of the solid core is equal to 0.55 when this layer crystallizes—at a lower temperature than the core because of lower density. The abundance in the solid phase can be derived in the same way as before, and it is now equal to XC∼0.35. This implies again that right outside the newly crystallized layer the abundance XC must be higher (and XO lower) than in the overlying layer. The instability in the liquid phase ensues again and the cycle is repeated (mixing in the liquid phase eventually stopping when the carbon abundance of the newly crystallized layer becomes lower than or equal to the overlying layers still in the liquid phase) until the whole degenerate core is crystallized. The final profile of XC and XO after crystallization is completed is no longer homogeneous; XO will display central values higher than in the liquid phase which decrease from the centre outwards, while the opposite is true for XC.

This variation of the XO/XC profile (hence the local value of the molecular weight μ) in the WD core during crystallization due to the CO phase diagram has an important impact on the WD cooling times, because of the term (∂U/∂μ)T,v(∂μ/∂t) in ϵg [157]. This means that more energy is available to be released and cooling times are longer. Depending on the thickness and chemical composition of the non-degenerate layers surrounding the CO core (that regulate the energy release), delays induced by chemical separation upon crystallization can reach approximately 10% for the coolest objects.

Figure 21a displays the evolution of the O-profile for a realistic 0.61M WD model [157]. The first stage of this temporal sequence is represented by the profile in the electron degenerate core at the beginning of the thermal pulse phase of the AGB progenitor, built during the core He-burning. One can notice the local maximum in XO is about 0.25M away from the centre. The associated local maximum of the molecular weight μ causes an instability that homogenizes the internal layers during the liquid phase of the cooling [157], generating the profile that is maintained until the start of core crystallization. The final very different XO profile is attained when the whole core is crystallized, and it is produced by the chemical separation upon crystallization.

The most recent re-evaluation of the phase diagram for a binary CO mixture is displayed in figure 22 [161]. Compared to the widely used phase diagram of figure 21, it causes slightly smaller abundance variations for a given initial chemical profile in the liquid phase.

Figure 22.

Figure 22.

The most recent re-evaluation of the phase diagram for a CO mixture [161].

Within the CO core there are also small amounts of other metals—so-called minor species, with mass fractions of at most the order of the initial progenitor metallicity—that have been either processed through the previous burning phases (i.e. Ne), or are unchanged since the formation of the progenitor (i.e. Fe). Owing to the extreme complexity of calculating a multi-component phase diagram, a ternary CONe mixture is often assumed to behave as an effective binary mixture composed of neon (or iron) plus an element of average charge 〈Z〉=7 determined from the C and O abundances. Ne is important, given that 22Ne is the most abundant species after C and O, with a mass fraction X22NeZ, where Z is the initial metallicity of the progenitor (Z∼0.02 for the solar metallicity). A detailed phase diagram for a three-component mixture CONe shows that the effect of Ne on the final WD cooling times is negligible, when the separation of carbon and oxygen is accounted for [162].

8. Rotation and rotational element transport mechanisms

Basic considerations about the angular momentum evolution in a contracting protostellar cloud, observations of the solar magnetic field and stellar spectroscopy (and now also asteroseismology) dictate/show that stars do rotate. Observations of MS O- and B-type stars in the Galaxy and the Magellanic Clouds reveal average projected rotational velocities of the order of 150 km s−1 [163166], with values up to 350–400 km s−1. Average rotational velocities of the order of 150 km s−1 are observed also in MS A- and F-type stars [167], fast decreasing when moving to later spectral types [168], while giant stars display typically slow projected rotational velocities below approximately 10 km s−1 [169].

All fast rotators among O-stars show surface He-enrichments not predicted by standard non-rotating models [170]. Also, high rotational velocities along the MS are associated with enhancements of surface N that cannot be reproduced by non-rotating stellar models [171,172].

For a long time, rotation has not been an ingredient of standard stellar models, due to the increase in complexity and uncertainty—free parameters—related to the inclusion of rotation. Moreover, the basic principle that the explanation relying on the smallest number of hypotheses is the one to be preferred, coupled to the many successes of non-rotating stellar models, means that rotation has been generally considered only a second-order effect. It is, however, clear that it can have important effects on the structure and evolution of stars, mainly through its effect on the evolution of the chemical stratification.

8.1. One-dimensional modelling of rotating stars

The physical basis for the inclusion of rotation in detailed stellar evolution computations was laid down 50 years ago [173,174] and more recently hydrodynamical treatments of the problem have appeared [175177]. Detailed stellar evolution calculations and comprehensive libraries of stellar models are, however, still (and for the foreseeable future) possible only with standard 1D modelling, and the basic assumptions to simulate the average mechanical and thermal distortions induced by rotation without basically altering the standard equations, are the following19 :

  • (i) Roche approximation, i.e. the gravitational potential Ψ is the same as if the total mass of the star were concentrated at the centre.

  • (ii) The angular velocity Ω and chemical composition are constant along isobars, e.g. surfaces of constant pressure (shellular rotation). This follows the assumption that turbulence is anisotropic, with a stronger transport in the horizontal (tangential to an isobar) direction than in the vertical (perpendicular to an isobar) one [178]. The angular velocity in this case depends very weakly on the colatitude, hence ΩΩ(r), meaning that it varies according only to the radial coordinate of the isobars.

  • (iii) The mean radius rP of an isobar is defined by VP=(4π/3)rP3, where VP is the volume inside the isobar.

With these assumptions, the set of equations of stellar structure for a rotating star can be written in 1D as [179,180]

rPmP=14πrP2ρ¯,PmP=GmP4πrP4fp,LPmP=ϵnϵν+ϵgandT¯mP=GmPT4πrP4PP} 8.1

with ∇P being the appropriate temperature gradient dln(T¯)/dln(P). In the case of radiative transport,

P,rad=3κ16πacGPT4LPmPfTfp. 8.2

When comparing these equations to equations (2.1)–(2.4), one can notice that they are formally identical, apart from the form factorsfP and fT. The only difference is the interpretation of the variables. In the case of rotating models, rP is the mean radius of an isobar defined by the value P of the pressure, ρ¯ and T¯ are the volume-averaged density and temperature between two contiguous isobars (the difference with averages on the isobars is negligible if the mass grid of the models is dense enough), and LP and mP are the mass and luminosity inside a given isobar. Calculations of the energy generation coefficients, adiabatic gradient and opacity make use of ρ¯ and T¯. The equation of state is expressed in terms of P=P(ρ¯,T¯) and the chemical composition on the isobar P.

Analogous to the case of non-rotating models the system of equations is solved considering the ‘Lagrangian’ independent variable mP, with rP,LP,P,T¯ as unknowns.

The form factors fP and fT are defined, respectively, as

fP=4πrP4GmPSP1g1 8.3

and

fT=(4πrP2SP)21gg1. 8.4

The quantities 〈g〉 and 〈g−1〉 are average values of the gravity g over an isobar P with surface area SP. For a generic variable f this average is defined as

f=1SPP=const.fdσ, 8.5

where dσ is an infinitesimal element of the isobar surface P. At the non-rotation limit, fP and fT converge to unity and the equations are reduced to their non-rotating form.

To solve equations (8.1), one needs to evaluate fP and fT, and this requires the calculation of the surfaces of isobars, e.g. surfaces where ΨP is constant, with

ΨP=Φ+12Ω2r2sin2θ, 8.6

where Φ is the Roche gravitational potential, r is the radial distance from the centre and θ is the colatitude (θ=0 at the poles; see e.g. [181] for an example of how to calculate integrals (8.5) for a given ΨP). It is through the calculation of fP and fT that the angular velocity profile enters the equation of stellar evolution. It is worth noticing that all this formalism works also in the case of a ‘conservative’ potential (for example, the simple case of solid body rotation), e.g. if the centrifugal acceleration can be derived from a potential [182,183], which is not the case for shellular rotation. A potential of this type is used, for example, in [182]

Ψ=Φ12Ω2r2sin2θ. 8.7

In this case, instead of isobaric surfaces one reads equipotential surfaces, and the equations (including fP and fT) are the same as the shellular case.

8.2. Chemical element and angular momentum transport

A tricky issue for modelling rotating stars with 1D stellar models is how to describe the transport of angular momentum—that determines the evolution of Ω—and the chemical mixing associated with rotation. These are two facets of the same problem, for rotation triggers hydrodynamical instabilities and large-scale motions of the gas in radiative regions, which result in transport of both angular momentum and chemical elements. In-depth discussions about rotation-driven instabilities can be found in [180,184,185]. Here, we just give a brief overview, focussing on the actual implementation in stellar evolution calculations.

The Eddington–Sweet meridional circulation is one of the major instabilities caused by rotation. In simple terms, we can compare a non-rotating star in radiative equilibrium with its solid body rotating counterpart. The equipotential surfaces of the non-rotating star are spherical, while in the case of a solid body rotator they are rotational ellipsoids, and two contiguous equipotential surfaces will diverge in distance from each other at the equator. Given that the effective gravity g is proportional to the gradient of the potential (that is normal to the equipotential surfaces), it will vary with latitude on an equipotential surface. As a consequence, the temperature will be hotter at the poles and cooler at the equator—as demonstrated by von Zeipel almost a century ago, the energy flux is proportional to the local value of g (the von Zeipel theorem)—preventing the star from maintaining hydrostatic equilibrium.

The solution to this paradox is to invoke large-scale mass motions that transport energy, the so-called meridional circulation, moving material inwards from the equator and upwards along the rotational axis towards the poles. The timescale for this mixing process—the ‘Eddington–Sweet’ timescale—was estimated to be

tESGM2LRGMΩ2R3, 8.8

where L, M and R are the stellar luminosity, mass and radius, respectively, Ω the angular velocity and the first term is the Kelvin–Helmholtz thermal timescale. These early estimates of tES were much shorter than the MS lifetime of stars, even for modest rotation rates, hence rotating stars should be fully mixed, contradicting observational data. The presence of chemical abundance stratifications (μ-gradients) in the interior of rotating stars can, however, increase tES considerably compared to the early estimates. A widely used model of the meridional circulation included in stellar evolution codes (that will be used later when discussing the implementation of rotational mixing) develops the circulation velocity vector U into two components [178]:

U=U2(r)P2(cos(θ))er+V2(r)dP2(θ)dθeθ, 8.9

where r is the radial coordinate, θ is the colatitude and P2 is the Legendre polynomial of order 2. The radial components of these velocities, which will be used to treat chemical element transport, are related through

1rddr[ρr2U2(r)]6ρV2(r)=0. 8.10

Another important effect of meridional circulation is that it advects also angular momentum. Local variations of angular velocity with time due to this angular momentum transport, plus the effect of contraction and expansion of the stellar layers (and eventually angular momentum loss from mass loss) will generate—starting, for example, from solid body rotation, usually assumed for pre-MS fully convective stars—a variation of angular rotation velocity with depth in radiative regions. Hence a ‘shear’ develops between neighbouring layers that leads to instabilities. This stems from the fact that the minimum energy state of a rotating fluid is solid body rotation, and if the star develops differential rotation, it is possible to extract energy by homogenizing the velocities through transport of material.

The strong thermal and molecular weight radial stratification in radiative zones tends to oppose the homogenization of the rotational velocities, while it is reasonable to assume that no restoring forces oppose horizontal displacements. As a consequence, horizontal shear (along an isobar) is expected to generate a strong turbulence on short dynamical timescales, justifying the assumption of shellular rotation. Eventually, thermal diffusion and horizontal shear can reduce the stabilizing effect of the vertical (radial) thermal and chemical stratification and induce element and angular momentum transport if the Richardson number Ri satisfies this criterion:

RiN2(du/dz)2<Ricrit=14, 8.11

where u is the velocity of the fluid elements and z designates the vertical direction.20

The Solberg–Høiland instability, plus additional instabilities (i.e. Goldreich–Schubert–Fricke, ABCD instabilities) that can develop when equipotentials and isobars do not coincide—so-called baroclinic instabilities [180]—do potentially contribute to the transport of angular momentum and chemicals.21 Some of these instabilities are implemented in calculations of rotating stellar models (see §8.2.2), even though their treatment is considered to be much more uncertain than the case of meridional circulation and shear instabilities [184,185].

In the case of radiatively driven winds, the von Zeipel result has important consequences on the mass loss rates to be employed in the calculation of rotating stellar models, because in general it causes an increase in the mass loss efficiency compared to the case of a non-rotating counterpart. Various prescriptions of the mass loss enhancement due to rotation can be found in the literature. As an example, we report the prescription employed in the MESA code [58]:

dMdt(Ω)=dMdt(Ω=0)(1(1Ω/Ωcrit))ζ, 8.12

with ζ=0.43, and Ω2crit=(1−L/LEdd)GM/R3 and LEdd=4πcGM/κ averaged over a certain optical depth range.

8.2.1. Advective/diffusive implementation

Meridional circulation and shear instability are considered as the main mechanisms for angular momentum transport and rotational mixing in a number of stellar evolution codes, e.g. STAREVOL [187,188], CESTAM [189], the Geneva stellar evolution codes [179,190] and the FRANEC code [191], that implement advective+diffusive transport of angular momentum and diffusive rotational chemical mixing.

The transport equation for a chemical element with mass fraction Xi is written as22

Xit|Mr=1ρr2r(ρr2DchemXir), 8.13

where Dchem is the sum of the vertical shear diffusion coefficient Dshear and the effective diffusion coefficient Deff, which accounts for the combined effect of the strong horizontal shear diffusion Dh and of the meridional currents:

Deff=130|rU22(r)|2Dh. 8.14

The equation for the transport of angular momentum is written as [192]

t(r2Ω)|Mr=15ρr2r(ρr4ΩU2(r))+1ρr2r(ρDangr4Ωr), 8.15

where Dang denotes the diffusion coefficient for angular momentum. The second term on the right-hand side of equation (8.15) is a diffusion term, similar in its form to equation (8.13), while the first term is an advective term, modelling the transport by a velocity current. Equation (8.13) does not contain an advective term, following the author of [193] who showed that the combined effect of turbulence and circulation currents is equivalent to a diffusion process for the element transport.

There are various expressions in the literature for U2. The complete description by Maeder & Zahn [194] in the case of shellular rotation provides

U2(r)=Pρ¯g¯CPT¯1(ad+(ϕ/δ)μ)[LM(EΩ+Eμ)+T¯CPδΘt]. 8.16

The barred symbols indicate averages over the isobar; EΩ and Eμ are complicated terms that depend on the angular velocity profile and mean molecular weight fluctuations on an isobar (see [180] for the complete formulae and the derivation); Θ describes the density fluctuation on an isobar; and M here denotes the reduced mass

M=MrP(1Ω22πGρm), 8.17

where ρm is the mean density inside the considered isobar surface, and L is the luminosity within the same surface.

In the case of convective regions, the transport of angular momentum is very uncertain, as the interaction between convection and meridional currents is not well understood. Traditionally, one can choose between the following two limiting cases. If convection inhibits meridional currents, angular momentum is redistributed very efficiently by convection, as in the case of chemical elements, and the result will be solid body rotation, as in the solar convective envelope. If meridional currents dominate—one can hypothesize that this is the case applicable to large, rarefied RGB envelopes, where convective elements may collide elastically rather than inelastically as in the solid body case—it is the specific angular momentum that is expected to be uniform. Hydrodynamical simulations have shown that the extended deep convective envelopes of red giant stars are likely to undergo radial differential rotation, with an angular velocity profile of the form Ω(r)∝r−0.5 [195]. In general, solid body rotation is usually prescribed in convective regions.

Angular momentum losses due to stellar winds need also to be accounted for. In the assumption that the mass loss is spherically symmetric, the rate of angular momentum loss during a given time step will be approximately equal to the average specific angular momentum at the surface of the star multiplied by the assumed mass loss rate.

To discuss the effect of rotation and rotational mixing on stellar evolution models, we consider as an example models from [196], calculated with the Geneva code. Figure 23 compares the HRD of 9, 15 and 32M solar initial chemical composition models with and without rotation, from the zero age main sequence (ZAMS) to the end of core C-burning. In these calculations, Dh has been taken as

Dh=1chr|2V2(r)αU2(r)|, 8.18

from [178], assuming ch=1 and α=12(dln(r2Ω)/dlnr). We express Dshear as

Dshear=fenergHPgδKT[(φ/δ)μ+(adrad)](9π32ΩdlnΩdlnr)2 8.19

from [197], where fenerg=1 (the fraction of the excess energy in the shear that contributes to mixing).

Figure 23.

Figure 23.

Comparison of the HRD of 9, 15 and 32M solar initial composition models, with (solid line) and without (dashed line) rotation, from the Geneva calculations [196].

The models are initialized as solid bodies at the ZAMS, and then evolved according to the prescriptions for the angular momentum and chemical transport described above. The prescribed initial equatorial rotation velocity is 0.4 times the critical velocity (when the centrifugal acceleration in the equatorial plane exactly compensates the gravitational acceleration) of the corresponding stellar mass. This is equal to values between approximately 250 and 300 km s−1 for the three displayed masses.

The effect of rotation on the tracks (that of course depends on initial velocities) is striking. Models calculated with rotation evolve to higher luminosities during the MS and stay more luminous also during the following evolutionary phases. This is mainly due to the consequences of the chemical element transport associated with rotation. In general, rotational mixing works in the direction of erasing chemical gradients within the star, and hence during the MS, this causes a continuous slow ingestion of fresh hydrogen into the H-depleted convective core, as well as a slow transport towards the surface of elements whose abundances are increased by H-burning (e.g. He and 14N). The effect of the ingestion of H in the convective core is to increase the MS lifetime compared to the non-rotating counterpart, and to produce slightly larger He-cores at the end of core H-burning.

To this purpose, figure 24 displays the internal profiles (taken during the MS when central H is reduced to—from (a,c) to (b,d)—50% and 10% by mass, respectively) of H and N abundances, angular velocity and chemical diffusion coefficients for shear and meridional circulation in solar initial composition, 15M rotating and non-rotating evolutionary models from [191], similar to the Geneva calculations. One can see very clearly the effect of rotational mixing, which smooths out gradients in the N- and H-abundance profiles. Also, shear mixing dominates in the external layers, where the gradient of angular velocity gets progressively steeper, while meridional circulation is the more efficient chemical transport process close to the edge of the convective core. Notice also the flat angular velocity profile in the convective core, due to the solid body assumption.

Figure 24.

Figure 24.

Internal profiles of H and N abundances, angular velocity and chemical diffusion coefficients for shear and meridional circulation, respectively, as a function of mass, within 15M solar initial composition models from [191], with the labelled initial rotation rates. The different panels refer to two MS stages, when the central H mass fraction is equal to 50% (a,c) and 10% (b,d), respectively (courtesy of M. Limongi).

The different paths in the HRD followed by rotating and non-rotating models together with the different lifetimes lead also to different mass loss histories and hence to different total masses at the end of the MS and during the post-MS phases. This impacts, for example (for a fixed convective mixing scheme), the occurrence of ‘loops’ in the HRD, as the blue or red location of a model during the giant phase depends on thickness of the H-rich envelope (thick enough envelopes keep the models at the red side of the HR, while when their masses decrease below a threshold value, the models move to the blue). During the He-burning phase, the slow ingestion of fresh He in the core lowers the final C/O ratio and increases the mass of the CO core [191], affecting the pre-supernova structure of the massive models.

Figure 25a shows the evolution of the angular velocity Ω taken at the surface and at the centre of the 15M models. After the ZAMS model with enforced solid body rotation is left to evolve, there is a readjustment of the rotational profile at the very beginning of the evolution, until the equilibrium rotational profile is reached, and the angular velocity starts to evolve under the action of the transport mechanisms. The general trend is to transfer angular momentum from the contracting core towards the external layers, but this is counterbalanced by the mass loss that removes angular momentum from the surface and eventual expansions of the convective envelopes that slow down the surface.

Figure 25.

Figure 25.

(a) Displaying the evolution of the surface and central angular velocity as a function of Teff for the 15M models in figure 23. (b) Displaying the evolution of the ratio of the polar and equatorial radius.

For the model shown in figure 25, the net effect is a constant decrease of the surface angular velocity along the whole evolution, whereas the centre of the star displays a moderate increase of angular velocity during the MS, followed by a plateau and a sharp increase during the giant phase. The models are never very far from spherical symmetry, as shown in figure 25b, which displays the ratio of the polar to the equatorial radius along the whole evolution.

The effect of element transport on the surface abundances (in mass fractions) of some key elements (He, C, N, O) for the same 15M evolution is displayed in figure 26. Notice that in the non-rotating models the abundances change only due to the dredge-up during the red giant phase. Rotating models display instead abundance changes already during the MS, due to rotational mixing that tends to erase chemical gradients. This explains the increase of 14N and He, whose abundance increase in the central regions due to CNO H-burning, and the depletion of 12C and 16N, caused by the decreased O and N equilibrium abundances, compared to the initial scaled solar values. The abundances remain almost constant during the fast transition to the giant phase, and then change again due to the dredge-up.

Figure 26.

Figure 26.

Evolution with Teff of the surface abundances (in mass fractions) of the labelled elements, for the rotating (solid lines) and non-rotating (dashed lines) models in figure 23.

The detailed behaviour of the evolution of the angular momentum and chemical abundance profiles is, however, strongly dependent on the precise choice of the diffusion coefficients in equations (8.13) and (8.15). We can see, for example, that the coefficient employed in these calculations contain parameters like ch and fenerg that are set to fixed constant values not derived from first principles.

It is extremely interesting to analyse also the case of the 1M rotating (ZAMS equatorial velocity equal to 50 km s−1) and non-rotating models for the same initial chemical composition. Both calculations account also for atomic diffusion, but without including radiative levitation, whose effect is practically negligible at this metallicity. Figure 27 compares their HRDs,23 which are virtually identical along the MS, while the rotating SGB is slightly brighter and the following RGB phase slightly hotter. The MS lifetime of the rotating model is just approximately 4% longer than the non-rotating case.

Figure 27.

Figure 27.

As figure 23 but for 1M models.

Figure 28 compares the evolution with Teff of the surface abundances of key elements affected by rotational mixing. In the case of non-rotating models, the signature of efficient diffusion during the MS is very clear (see §6). The abundances of all elements decrease, reaching a minimum around the TO, and then increase when convection deepens, before the signature of the first dredge-up can be seen for C, N and He. When rotation is included, the abundances stay constant along the MS, almost equal to the initial values, showing that rotational mixing (for the chosen initial rotational velocity at this metallicity) strongly inhibits the effect of diffusion from the convective envelopes. Along the RGB of the rotating models displays stronger enhancements of N and He, and a larger depletion of C compared to the non-rotating counterparts.

Figure 28.

Figure 28.

As figure 26 but for 1M models.

There are also alternative expressions for Dshear and Dh, in addition to the ones employed in the calculations we are discussing, namely:

  • Dshear from [198]
    Dshear=fenergHPgδ(KT+Dh)[(φ/δ)μ(1+K/Dh)+(adrad)](9π32ΩdlnΩdlnr)2, 8.20
    with K, fenerg and φ as in [197].
  • Dh from [199]
    Dh=Ar(rΩV2(r)|2V2(r)αU2(r)|)1/3, 8.21
    with α as in [178] and A=0.002.
  • Dh from [200]
    Dh=(β10)1/2(r2Ω)1/2(r|2V2(r)αU(r)|)1/2, 8.22
    with α as in [178] and β=1.5×10−6.

The effect of using these various combinations of Dshear and Dh in the Geneva code has been investigated recently [201].24 They found that MS lifetimes, the evolution of surface velocities and the angular momentum of the core have a weak dependence on the choice of these diffusion coefficients. The shape of the evolutionary tracks, the surface enrichment (for a fixed initial rotation velocity), the blue-to-red evolution in the HRD and the extension of the blue loops are, however, significantly affected by the choice of Dshear and Dh.

8.2.2. Diffusive implementation

There is an alternative approach to include transport of chemicals and angular momentum, used in codes like the Yale evolutionary code [203], KEPLER, STERN and MESA. In this case, the temporal evolution of angular momentum and chemical abundances due to rotation is described by a set of two diffusion equations, computationally easy to implement:

Ωt|Mr=1ρr4r(ρr4νΩr) 8.23

and

Xit|Mr=1ρr2r(ρr2DXir), 8.24

where ν and D are, respectively, the total turbulent viscosity and the total diffusion coefficient defined as a sum of all diffusion coefficients associated with all the transport processes taken into account. Each of these diffusion coefficients is built as the product of the velocity v and the path length l of the redistribution currents with

l=min(r,|rlnv|). 8.25

By |r/lnv| is denoted the velocity scale height. The timescale associated with the redistribution over the path scale is simply l2/D=l/v.

This description is usually employed to include within the same formalism also convection and semiconvection, treated as diffusive processes. Appropriate diffusion coefficients for convective and rotational transports are adopted, often with free parameters to be calibrated against some sets of observations, given that these coefficients arise from order-of-magnitude considerations. Rotational mixing processes include at least meridional circulation, and shear, plus eventually additional rotational instabilities [58,76] not usually included in the codes that employ the advective/diffusive implementation.

The viscosity ν is prescribed by

ν=Dconv+Dsemiconv+i,rotinstDi 8.26

and the diffusion coefficient is usually written as the following sum [76]:

D=Dconv+Dsemiconv+fc×(i,rotinstDi), 8.27

where fc is one of the efficiency parameters entering the diffusive formalism and it is calibrated on observations like the solar lithium abundance (fc=0.046 in the models [203]), or the main trend of the observed nitrogen surface abundances with the projected rotational velocity for the nitrogen-enriched fast rotators in the LMC (fc=0.0228 in the models [204]).

An example of these implementations is given in the following, for the meridional circulation [76,203]. The characteristic velocity of these currents in a radiative region is assumed to be

vES0=ad(δ(ad)Ω2r3L(GM)2(2ϵr2L2r2M34πρr), 8.28

with ϵ denoting the energy generation rate per unit mass.

The counteracting effect of μ-gradients is accounted for according to

vμ=fμψHPδ(ad)τKH|μ|μ, 8.29

ψ=(dln(ρ)/dln(μ))P,T,τKH is the local thermal timescale and fμ is a free parameter that allows to vary the sensitivity of meridional circulation to μ gradients.

The effective value of the meridional circulation velocity used to determine the associated diffusion coefficient is finally calculated as vES=|vES0|−|vμ| if vES0>vμ, or vES=0 otherwise.

Figures 29 and 30 compare results for the initial solar composition 15M models discussed in figures 23, 25 and 26, with results of a MESA calculation (that employs this diffusive implementation for the transport of angular momentum and chemicals) for the same mass, solar initial chemical composition and approximately the same initial rotational velocity [205]. The HRD shows that these rotating models are almost equivalent to the non-rotating Geneva calculations. However, the evolution of the surface chemical elements shown in figure 30 displays clear signatures of rotational mixing, although the quantitative effect is very different from the Geneva results. The enhancement of the surface abundances during the MS is more moderate in the MESA calculations (even accounting for the slightly different initial abundances of some elements), and He is almost unchanged. Even the effect of the dredge-up seems to differ between the calculations.

Figure 29.

Figure 29.

Comparison of the HRD of the Geneva models in figure 23 (same line styles) with the rotating counterpart (dotted line) from MESA calculations [58].

Figure 30.

Figure 30.

Comparison of the evolution of the surface abundances (in mass fractions) as a function of Teff of the labelled elements, for the two sets of rotating models in figure 29. Solid lines refer to the Geneva calculations, dotted lines to the MESA models.

It is difficult to determine the exact cause of these differences, given that also other elements of the model input physics (apart from the implementation of rotational transports) differ between the two sets of calculations. Very importantly, MESA calculations include also the effect of atomic diffusion (and radiative levitation), however, moderated by some additional turbulence [205]. But taken at face value, this comparison may give an idea of the uncertainties in the outputs of the current generation of rotating models.

8.3. Additional effects

There are at least two additional processes that may affect substantially the evolution of rotating stars, for they modify the transport of angular momentum and, directly or indirectly, also the transport of chemical elements. Their implementation in stellar model calculations is still uncertain and they are not generally included in the current generation of stellar models.

8.3.1. Gravity waves

Inside a star, the density changes monotonically with the depth and the molecular weight typically increases in the direction of increasing gravity. When a gas element in this stable stratification is perturbed, the competition between buoyancy and gravity gives rise to an oscillatory motion around the equilibrium position and creates a so-called ‘internal gravity wave’ (IGW).

These IGWs are expected to be generated by the injection of kinetic energy from a turbulent region into an adjacent stable region, as observed both in two-dimensional (2D) and 3D simulations of convective mixing, e.g. by convective overshooting in a stable region and bulk excitation or excitation by Reynolds stresses inside the convection zone [206208]. The IGWs penetrate into the radiation zone, transporting angular momentum that is deposited where they are dissipated through heat diffusion by photon exchange, which produces an ‘attenuation factor’ (τ) proportional to the thermal diffusivity and inversely proportional to the IGW frequency ν and amplitude. It is by shaping the internal rotation profile that IGWs contribute indirectly to the mixing of chemical elements.

Angular momentum transport by gravity waves is seldom included in evolutionary stellar model calculations. The current approximate treatment implemented in some stellar evolution calculations [184] expresses the solution of the equations describing the propagation of IGWs in a rotating star in terms of Legendre polynomials. At each point within a radiative region, the total angular momentum ‘luminosity’25 associated with the IGW propagation can be written as

LJ(r)=wavesLJ,ν,,m(rc)exp[τ(r,σ,)], 8.30

where rc denotes the radiation/convection interface, ℓ and m represent, respectively, the spherical order and the azimuthal number of the Legendre polynomial, ν the frequency of the IGW when launched from the convective zone, σ=νm(Ω(r)−Ω(c)) is the local wave frequency measured in the co-rotating frame with angular velocity Ω(r)—Ω(c) being the angular velocity of the solid body rotating convective zone—and

LJ,ν,,m=4πr2FJ(ν,,m)=4πr22mνFE(,ω). 8.31

Here, ℱJ(ν,ℓ,m) is the mean flux of angular momentum carried by a monochromatic wave with emission frequency ν. The amplitude of the waves is assumed to be [209]

FE(ν,)=ν24πdrρ2r2[(ξrr)2+(+1)(ξhr)2]×exp[hν2(+1)2r2]vc3Λ41+(ντΛ)15/2 8.32

with ξr and [ℓ(ℓ+1)]1/2ξh being, respectively, the vertical and horizontal displacement wave functions normalized to unit energy flux at the edge of the considered convection zone, vc the convective velocity, Λ=αMLTHP the radial size of a convective element, τΛΛ/vc the convective timescale and hν=λmin{1,(2ντΛ)3/2}. The radial (kr) and horizontal (kh) wave numbers are related by

kr2=(N2ω21)kh2=(N2ω21)(+1)r2. 8.33

The local damping rate τ(r,σ,ℓ) can be written as

τ(r,σ,)=[(+1)]3/2rrc(KT+νv)NNT2σ4(N2N2σ2)1/2drr3, 8.34

where NT2 is the thermal part of the Brunt–Väisälä frequency and νv is the vertical shear turbulent viscosity

νv=85Ricrit(r(dΩ/dr))2NT2/(KT+νh)+Nμ2/νh, 8.35

with Nμ2 denoting the molecular weight stratification part of the Brunt–Väisälä frequency, and νh the horizontal shear turbulent viscosity, which can be set to Dh, and Ricrit=14.

The deposition of angular momentum is then given by the radial derivative of ℒJ. Given that, as a first approximation, only the radial dependency of IGW transport is considered, all quantities required are evaluated from horizontal (on isobars) averages. The angular momentum evolution due only with IGW transport is given by

ρddt[r2Ω]=±38π1r2LJ(r)r. 8.36

The + or − sign in front of the angular momentum luminosity corresponds to prograde (m>0) or retrograde (m<0) waves.

It is clear that in the absence of differential damping for inward- and outward-travelling waves, there will be no angular momentum transport. This is the case, for example, of solid body rotation. However, differential rotation naturally filters the IGWs that propagate within the star. Considering, for example, the realistic case of a low-mass star with interior layers—see e.g. Figure 31 for the 1M solar chemical composition Geneva rotating model discussed before—that rotate faster than the convective envelope. In the inner radiative regions, we have Ω(r)>Ω(c) and for prograde m>0 waves σ=νm(Ω(r)−Ω(c)) is shifted to lower frequencies when travelling from the edge of the convective envelope towards the interior layers. The radial damping length of the waves given approximately by

ld=2r3σ4[l(l+1)]3/2NNT2KT, 8.37

progressively decreases for these prograde waves. The retrograde m<0 waves are instead boosted to higher frequencies, and their damping length increases, propagating further within the star compared to the prograde waves. As a consequence, prograde waves are absorbed before they can propagate far into more rapidly rotating layers of a star, while retrograde waves pass through, and upon dissipation they deposit their negative angular momentum (they contribute with a negative sign in equation (8.36)), spinning down the rapidly rotating layers. The star then tends to evolve towards solid body rotation.

Figure 31.

Figure 31.

Evolution with Teff of the central and surface angular velocity, for 1M initial solar chemical composition rotating models, calculated with the Geneva code [196].

The angular momentum transport associated with IGWs is considered to be a possibility to explain the solar rotation profile, which is much flatter than generally predicted by rotating models that do not include IGWs [210]. Also, recent asteroseismic observations [211,212] show that the core of low-mass RGB stars do not rotate faster than the surface as much as predicted by current rotating stellar models. The angular momentum redistribution by IGWs may provide one way to reconcile theory with observations. Recent multidimensional detailed hydrodynamical simulations of generation and propagation of IGWs within stars, confirm their ability to transport efficiently angular momentum [213,214].

8.3.2. Magnetic fields

Another way to favour the redistribution of angular momentum within a rotating star is to consider the effect of internal magnetic fields. They are generally implemented following the dynamo mechanism presented in [215], which envisages the creation of magnetic fields in the radiative regions of differentially rotating stars at the expense of the shear, due to the so-called Tayler–Spruit instability.26

The STERN code, which uses a diffusive implementation of element and angular momentum transports, accounts for the effect of magnetic fields as follows [217].

Defining by q=dlnΩ/dlnr the shear, the effective radial viscosity produced by the magnetic field can be written as

νre=νe0νe1νe0+νe1f(q), 8.38

where

νe0=r2Ωq2(ΩNμ)4 8.39

and

νe1=r2Ωmax[(ΩNT)1/2(κr2NT)1/2,q2(ΩNT)4], 8.40

Denoting by qmin the minimum rotational gradient necessary for the dynamo to operate,

f(q)=1qminq,(q>qmin) 8.41

and

f(q)=0,(qqmin). 8.42

The effective diffusivity for transport of chemical elements is given as

De=De0De1De0+De1f(q), 8.43

where

De0=r2Ωq4(ΩNμ)6 8.44

and

De1=r2Ωmax[(ΩNT)3/4(KTr2NT)3/4,q2(ΩNT)6]. 8.45

According to [215] qmin is given by

qmin=q0+q1 8.46
q0=(NμΩ)7/4(ηr2Nμ)1/4 8.47
andq1=(NTΩ)7/4(ηr2NT)1/4(ηKT)3/4, 8.48

with η=7×1011ln(Λ)T3/2 denoting the Spitzer magnetic diffusivity and ln(Λ) the Coulomb logarithm (see §6).

These viscosities and diffusivities have been included as additional terms to the diffusion coefficients of, respectively, angular momentum and chemicals, in both the STERN [204,217] and KEPLER [218] codes, which use a diffusive implementation of element and angular momentum transports. In this latter code, modifications are applied to the viscosities and diffusivities derived before in the case of semiconvective regions, or where the thermohaline mixing is efficient, which means in regions where the timescale of mixing is much longer than in convective layers. In the case of semiconvection, a dynamo effective viscosity is defined as νre=νe0f(q) and the final expression for the viscosity in the semiconvective region is set to be νe=νreνSC, where νSC=(1/3)Hpvconv and vconv is the convective velocity derived from the MLT. The effective diffusion coefficient for the transport of elements is set to (De+DSC), with DSC being the diffusion coefficient due to semiconvection. In the thermohaline mixing region De is set to De1, νe to νe1 and qmin to q1.

The inclusion of internal magnetic fields decreases the rotation velocity contrast between core and envelope during the MS, approaching near solid body rotation. This affects both the shape of the evolutionary tracks as well as the abundance profiles within the models. Figure 32 displays, as an example, the effect on the surface chemical abundances at selected evolutionary stages in the calculations with the code KEPLER [218], for 15M solar metallicity models with a ZAMS equatorial rotation velocity of 200 km s−1.

Figure 32.

Figure 32.

In each panel, from left to right, the abundances (mass fractions—filled circles) correspond to the MS stage when the central H mass fraction is 0.35, to central He-burning with a 0.50 mass fraction of He and to the pre-supernova stage. Solid lines correspond to rotating models without magnetic fields and dashed lines to rotating models with magnetic fields (see text for details).

Internal magnetic fields generated by the Tayler–Spruit instability have been also included in the Geneva code [219]—which implements the advective/diffusive formalism for rotational transports—in a slightly different form. Denoting by x the ratio

x=(ωAΩ)2,

where ωA is the Alfvén frequency of a magnetic field of intensity B, the solution of the equation

r2Ωq2KT(NT2+Nμ2)x4r2Ω3KTx3+2Nμ2x2Ω2q2=0 8.49

provides the unknown quantity ωA. The diffusion coefficient for the vertical transport of angular momentum is then given by

νre=Ωr2q(ωAΩ)3(ΩN), 8.50

and the diffusion coefficient for the transport of chemicals by

De=r2Ωq2(ωAΩ)6. 8.51

Once ωA is calculated, the condition on the minimum shear for the dynamo to work is tested as

q>(NΩ)7/4(Der2N)1/4. 8.52

If this condition is not realized, a formalism for the treatment of low rotation rates—expanding upon work in [215]—is implemented. We notice that, in this treatment, the Brunt–Väisälä frequency is rewritten as

N2=De/KT(De/KT)+2(NT2+Nμ2), 8.53

to account for the presence of a magnetic diffusivity De.

A set of 15M solar metallicity models calculated with this implementation [219] also predict an almost solid body rotation during the MS, with obviously a negligible element transport by shear mixing. The transport of elements due to the magnetic fields is also negligible, while the meridional circulation is strongly enhanced by the flat rotational profile. The net effect during the MS is an enhancement of the surface abundance variations compared to rotating models without internal magnetic fields. This is different from the result of figure 32 for a KEPLER model with the same mass, initial chemical composition and very similar initial rotation rate. In this latter case, the variation of the surface abundances during the MS is almost negligible compared to the case without magnetic fields.

Finally, we mention very briefly the effect of magnetic braking due to magnetized winds.27 Winds with magnetic fields—in the case of surface magnetic fields, whatever their origin is—exert a braking torque that is significantly larger than that for non-magnetic cases [220,221]. The reason is that the magnetic field connects the mass lost from the surface of the star to the envelope, and when the wind finally decouples from the magnetic field, it has the same angular velocity as the surface but a much larger moment of inertia. This increases the amount of angular momentum lost, compared to the case with no magnetic fields, and it is accepted as the reason for the slow rotation rate of low-mass MS stars. The effect on the chemical element transport is indirect, through the variation of the star rotational profile. Surface magnetic fields are observed mainly in low-mass stars with convective envelopes, where they are expected to be generated by a dynamo mechanism, and this type of magnetic braking is implemented mainly in models of low-mass stars. There are various prescriptions in the literature that require the calibration of free parameters on observations, given the lack of knowledge about the surface magnetic fields [222]. For example, the rate of angular momentum loss due to magnetized winds employed in [222] is a variation of [223,224]:

dJdt=KWΩΩsat2(RR)1/2(MM)1/2forΩΩsat 8.54

and

dJdt=KWΩ3(RR)1/2(MM)1/2forΩ<Ωsat, 8.55

where KW=2×1048 in cgs units to reproduce the solar case and Ωsat is a free parameter.

9. An example of possible synergy among several element transport processes

After the description of all major element transport mechanisms included in modern stellar evolution calculations, we show just an example of how their synergy might explain some puzzling observations of chemical abundances in star clusters. Figure 33 displays the trend of the surface Li abundance as a function of Teff for a sample of MS stars in the approximately 600 Myr old Hyades open cluster [225,226]. It is easy to notice the so-called Li-dip around 6600 K, a sharp local minimum of the Li abundance, which cannot be explained by standard stellar evolution models including only convective mixing (the Li-dip is observed also in other open clusters of different ages, at similar temperature).

Figure 33.

Figure 33.

Li abundances measured in a sample of Hyades MS stars, as a function of their Teff [225,226]. Inverted open triangles denote upper limits. The four different regions marked in the diagram are discussed in the text.

Lithium is a very fragile element that is burned at temperatures of approximately 2.5×106 K. In solar metallicity MS models calculated with just convective mixing, and Teff between approximately 6400 and 7500 K, the Li burning temperature is well below the base of the convective envelope, hence the Li abundance is constant down to the radiative layers where the temperature reaches approximately 2.5×106 K. This implies that these models—which also are not expected to experience pre-MS Li depletion—predict a constant surface Li with Teff in this temperature range, contrary to observations.

To explain these observations of Li abundances, we outline below the scenario proposed by Talon & Charbonnel [227], which involves the combination of several of the element transport mechanisms discussed above.28 To this purpose, we have divided the temperature range covered by the data of figure 33 into four contiguous ranges:

  • (i) Region 1. These stars have shallow convective envelopes, not efficient at generating a surface magnetic field via a dynamo process, hence the surface is not slowed down by magnetic winds. As a result, rotational mixing is expected to be just about sufficient to counteract atomic diffusion of Li below the convective envelope (e.g. counteracts the creation of a Li gradient right below the convective boundary where Li diffused from the convective region accumulates).

  • (ii) Region 2. Moving towards lower temperatures, the convective envelopes deepen, and dynamo-generated weak surface magnetic fields are expected now to start slowing down the outer layers. The increased shear strenghtens the rotational mixing and Li depletion increases with decreasing Teff. This happens because the more vigorous mixing is trying now to erase the gradient between the Li-burning region (no Li) and Li-rich convective layers.

  • (iii) Region 3. Stars on the cool side of the Li-dip have even deeper convective envelopes, that sustain a very efficient dynamo and slow down even more the external layers. At the same time, these convective layers are now very efficient at generating IGWs that redistribute momentum (driving the star towards solid body rotation) and reduce the efficiency of rotational mixing, inducing an increase in surface Li with decreasing Teff. Calculations based on the approximations described before [227] show that the expected efficiency of IGW-induced angular momentum transport has the required dependence on stellar mass (Teff).

  • (iv) Region 4. At these low Teff, the convective envelope is deep enough to reach Li burning temperatures, causing an increasingly larger depletion with decreasing Teff.

Measurements of projected rotation velocities in this cluster display a decreasing average velocity with decreasing Teff, consistent with an increased efficiency of magnetic braking when moving towards lower stellar masses [229].

10. Conclusion

It is clear that the description of complex physical processes like turbulent convection, semiconvection, thermohaline mixing and rotation, implemented in stellar evolution calculations is necessarily simplified, with the predictive power in some cases hampered by the use of several free parameters of uncertain calibration. Also, the effect of interactions among the various instabilities in rotating stars—which usually are considered as independent—has to be fully explored [230].

It is also fair to say that, in many cases, the approaches used in stellar evolution models are probably reaching their limits, and further developments of multidimensional hydrodynamical simulations are crucial to progress in the field. As we have discussed, there are already some constraints that current hydrodynamical simulations pose to the element transport processes efficient in stars. Even though there is clearly still a long way to go to complete stellar interior modelling with numerical hydrodynamics, physical insights provided by computer simulations are invaluable in improving our understanding of element transport processes [3,231]. Just to give two examples, recent results from hydrodynamical simulations provide new indications about the way to go to replace the MLT in stellar modelling [232], as well as how to implement a more consistent description of overshooting in different regimes [40].

At the same time, the booming field of asteroseismology is starting to provide new information on the efficiency of element transport processes, that can at the very least be used to add further constraints to our current recipes. Two perfect examples are recent works on mixing in low-mass core He-burning stars [233,234], and angular momentum transport in RGB stars [235,236].

The hope is that a synergy between clues from hydrodynamical simulations and the powerful constraints coming from asteroseismic analyses will help to improve our description of element transport in stellar model computations and improve their predicting power.

Acknowledgements

The authors are grateful to Marco Limongi, Victor Silva-Aguirre and David Hyder for preparing some of the figures.

Footnotes

1

For example, main sequence core convection of massive short-lived models involves ≈109 turnover times to complete the exhaustion of fuel. The Reynolds number in stars is Re>109–1010, much larger than the effective Reynolds number, approximately 104 in the current best-resolved simulations.

2

In the case of pre-supernova fast evolutionary phases, the equation of hydrostatic equilibrium usually contains an extra term (14πr2)(2r/t2) called the acceleration term.

3

If A and b are identical particles, the factor nA!nb! has to be replaced with (nA+nb)!.

4

A term −Xs/τd has to be added to the right-hand side of equation (2.7) if element s decays with decaying constant τd.

5

The values assumed in the classical and widely employed MLT flavour [17] are a=18, b=12 and c=24. Solar models calculated with this choice of the MLT flavour usually require αMLT of the order of 1.6–2.0, the exact value depending on the specific physics inputs and treatment of the outer boundary conditions in the calculations. Remarkably enough, different choices of these three parameters found in the literature provide the same results when αMLT is recalibrated on the Sun.

6

But see also [24] who claim that αMLT should vary with stellar properties, using data from the Kepler satellite.

7

Another calibration of αMLT from an independent suite of 3D radiation hydrodynamics simulations of stellar envelopes [27] has not been implemented yet in stellar evolution calculations.

8

Semiconvection can also be important during Si-burning [65] in massive stars.

9

Even a difference in rotational velocity between the convective core and the overlying radiative layers can have a similar effect. In this case, equation (4.5) becomes z=vtang2/(4g(1μ1/μ2)), where vtang is now the difference in rotational velocity between the convective core (expected to rotate like a solid body) and the overlying radiative layers. Equation (4.6) becomes vp=(vtang2vmix)/(4gΛ(1(μ1/μ2)))

10

Even the current uncertainties in the 12C(α,γ)16O reaction rate can affect the CO profiles, and the HB/AGB star count ratio.

11

Thermohaline mixing due to the same reasons as along the RGB is also efficient during the HB and AGB evolution of low-mass stars, according to some calculations [91]. An upper limit of 1.5M[91] or 2.2M[92] is found for RGB stars (the same 1.5M limit is found by Cantiello & Langer [91] for HB and AGB stars) to develop thermohaline mixing. The efficiency of the mixing in general tends to decrease with increasing mass. Thermohaline mixing is also important when carbon has ignited off-centre in the core of super-AGB stars, for it affects significantly the propagation of the flame [93].

12

About 5% of the post-bump RGB objects investigated do not show these abundance variations. A strong magnetic field can potentially inhibit thermohaline mixing [96].

13

The calculations by Cantiello & Langer [91] and the MESA calculations by Lattanzio et al. [101] consider the constant Cth as Cth(32)αth, where αth is in this case the free parameter.

14

We discussed in §4 the effect of diffusion on the HB mixed cores.

15

The surface abundance variations tend to increase the envelope opacities which, in turn, would tend to increase the MS lifetime. But the dominant process is the decrease of available fuel, and the net effect is an approximately 10% decrease of the MS lifetime.

16

Metals observed in a minority of WD spectra are ascribed to the effect of accretion.

17

If diffusion is inhibited only from the outer convective layers of low-mass stars, the effect on the interiors can still affect the age determination of clusters and field stars [138].

18
The He settling velocity wHe can be related to a diffusion coefficient D(He) through
D(He)KBTwHemHeg,
where mHe is the mass of the He nucleus.
19

It may seem surprising to be able to calculate 1D rotating models, given that the centrifugal forces reduce the effective gravity according to the latitude and introduce deviations from spherical symmetry.

20

Hydrodynamical simulations show that shear mixing is already efficient for values of Ri higher than 14[186].

21

The Solberg–Høiland criterion for stability is essentially the Ledoux criterion when rotation is present. For a moderate outward decrease of the angular velocity, rotation favours stability compared to the Ledoux criterion.

22

From now on, for simplifying the notation, we will denote by r the radius of the isobar rP.

23

The model with rotation is not calculated up to the tip of the RGB, but from independent calculations with rotation we know that He-core masses at He-ignition are larger in rotating models. This causes a brighter tip of the RGB and a brighter HB luminosity compared to the non-rotating case.

24

See also [202] for a similar analysis.

25

The total angular momentum luminosity is defined as the average angular momentum flux transported by IGWs through a surface of radius r.

26

The efficiency of this mechanism is, however, questioned by recent numerical simulations [216].

27

The complex angular momentum evolution during the pre-MS phase due to influence of the surrounding disc, is generally not modelled by stellar evolution calculations.

28

Other scenarios to explain the Li-dip are described in [228].

Data accessibility

This article has no additional data.

Authors' contributions

M.S. has designed the outline of the review. Both authors have agreed on the content, shared the writing and gave their final approval for the publication.

Competing interests

We declare we have no competing interests.

Funding

S.C. acknowledges support from INAF PRIN-INAF2014 (PI: S.C.), and from grants: AYA2013-42781-P and AYA2016-77237-C3-1-P from the Spanish Ministry of Economy and Competitiveness (MINECO).

References

  • 1.Eddington AS. 1926. The internal constitution of the stars. Cambridge, UK: Cambridge University Press. [Google Scholar]
  • 2.Salaris M, Cassisi S. 2005. Evolution of stars and stellar populations. Berlin, Germany: Wiley-VCH. [Google Scholar]
  • 3.Arnett WD, Meakin C. 2016. Key issues review: numerical studies of turbulence in stars. Rep. Prog. Physics 79, 102901 (doi:10.1088/0034-4885/79/10/102901) [DOI] [PubMed] [Google Scholar]
  • 4.Henyey LG, Forbes JE, Gould NL. 1964. A new method of automatic computation of stellar evolution. Astrophys. J. 139, 306 (doi:10.1086/147754) [Google Scholar]
  • 5.Kippenhahn R, Weigert A, Weiss A. 2012. Stellar structure and evolution. Berlin, Germany: Springer-Verlag; (doi:10.1007/978-3-642-30304-3) [Google Scholar]
  • 6.Smartt SJ. 2015. Observational constraints on the progenitors of core-collapse supernovae: the case for missing high-mass stars. Publ. Astron. Soc. Aust. 32, e016 (doi:10.1017/pasa.2015.17) [Google Scholar]
  • 7.Kato S. 1966. Overstable convection in a medium stratified in mean molecular weight. Publ. Astron. Soc. Jpn. 18, 374–383. [Google Scholar]
  • 8.Merryfield WJ. 1995. Hydrodynamics of semiconvection. Astrophys. J. 444, 318–337. (doi:10.1086/175607) [Google Scholar]
  • 9.Wood TS, Garaud P, Stellmach S. 2013. A new model for mixing by double-diffusive convection (semi-convection). II. The transport of heat and composition through layers. Astrophys. J. 768, 157 (doi:10.1088/0004-637X/768/2/157) [Google Scholar]
  • 10.Langer N, Fricke KJ, Sugimoto D. 1983. Semiconvective diffusion and energy transport. Astron. Astrophys. 126, 207–208. [Google Scholar]
  • 11.Schwarzschild M, Härm R. 1958. Evolution of very massive stars. Astrophys. J. 128, 348 (doi:10.1086/146548) [Google Scholar]
  • 12.Cameron AGW, Fowler WA. 1971. Lithium and the s-PROCESS in red-giant stars. Astrophys. J. 164, 111–114. (doi:10.1086/150821) [Google Scholar]
  • 13.Cassisi S, Castellani V, Tornambe A. 1996. The evolutionary properties and peculiar thermal pulses of metal-deficient low-mass stars. Astrophys. J. 459, 298–306. (doi:10.1086/176893) [Google Scholar]
  • 14.Stancliffe RJ, Dearborn DSP, Lattanzio JC, Heap SA, Campbell SW. 2011. Three-dimensional hydrodynamical simulations of a proton ingestion episode in a low-metallicity asymptotic giant branch star. Astrophys. J. 742, 121 (doi:10.1088/0004-637X/742/2/121) [Google Scholar]
  • 15.Prandtl L. 1925. Bericht über Untersuchungen zur ausgebildeten Turbulenz. Z. Angew. Math. Mech. 5, 136–139. [Google Scholar]
  • 16.Biermann L. 1932. Untersuchungen über den inneren Aufbau der Sterne. IV. Konvektionszonen im Innern der Sterne. (Veröffentlichungen der Universitäts-Sternwarte Göttingen, Nr. 27.) Mit 5 Abbildungen. Z. Astrophys. 5, 117. [Google Scholar]
  • 17.Böhm-Vitense E. 1958. Über die Wasserstoffkonvektionszone in Sternen verschiedener Effektivtemperaturen und Leuchtkräfte. Mit 5 Textabbildungen. Z. Astrophys. 46, 108. [Google Scholar]
  • 18.Tassoul M, Fontaine G, Winget DE. 1990. Evolutionary models for pulsation studies of white dwarfs. Astrophys. J. Suppl. Ser. 72, 335–386. (doi:10.1086/191420) [Google Scholar]
  • 19.Salaris M, Cassisi S. 2008. Stellar models with the ML2 theory of convection. Astron. Astrophys. 487, 1075–1080. (doi:10.1051/0004-6361:200810253) [Google Scholar]
  • 20.Cloutman LD, Eoll JG. 1976. Comments on the diffusion model of turbulent mixing. Astrophys. J. 206, 548–554. (doi:10.1086/154411) [Google Scholar]
  • 21.Weaver TA, Zimmerman GB, Woosley SE. 1978. Presupernova evolution of massive stars. Astrophys. J. 225, 1021–1029. (doi:10.1086/156569) [Google Scholar]
  • 22.Langer N, El Eid MF, Fricke KJ. 1985. Evolution of massive stars with semiconvective diffusion. Astron. Astrophys. 145, 179–191. [Google Scholar]
  • 23.Ventura P, Castellani M. 2005. Time dependent mixing in He-burning cores: the case of NGC 1866. Astron. Astrophys. 430, 1035–1047. (doi:10.1051/0004-6361:20041347) [Google Scholar]
  • 24.Bonaca A, et al. 2012. Calibrating convective properties of solar-like stars in the Kepler field of view. Astrophys. J. Lett. 755, L12 (doi:10.1088/2041-8205/755/1/L12) [Google Scholar]
  • 25.Trampedach R, Stein RF, Christensen-Dalsgaard J, Nordlund Å, Asplund M. 2014. Improvements to stellar structure models, based on a grid of 3D convection simulations. II. Calibrating the mixing-length formulation. Mon. Not. R. Astron. Soc. 445, 4366–4384. (doi:10.1093/mnras/stu2084) [Google Scholar]
  • 26.Salaris M, Cassisi S. 2015. Stellar models with mixing length and T(τ) relations calibrated on 3D convection simulations. Astron. Astrophys. 577, A60 (doi:10.1051/0004-6361/201525812) [Google Scholar]
  • 27.Magic Z, Weiss A, Asplund M. 2015. The Stagger-grid: a grid of 3D stellar atmosphere models. III. The relation to mixing length convection theory. Astron. Astrophys. 573, A89 (doi:10.1051/0004-6361/201423760) [Google Scholar]
  • 28.Christensen-Dalsgaard J. 2002. Helioseismology. Rev. Mod. Phys. 74, 1073–1129. (doi:10.1103/RevModPhys.74.1073) [Google Scholar]
  • 29.Paterno L, Ventura R, Canuto VM, Mazzitelli I. 1993. Helioseismological test of a new model for stellar convection. Astrophys. J. 402, 733–740. (doi:10.1086/172173) [Google Scholar]
  • 30.Ventura P, D’Antona F, Mazzitelli I. 2008. The ATON 3.1 stellar evolutionary code. A version for asteroseismology. Astrophys. Space Sci. 316, 93–98. (doi:10.1007/s10509-007-9672-8) [Google Scholar]
  • 31.Canuto VM, Mazzitelli I. 1991. Stellar turbulent convection: a new model and applications. Astrophys. J. 370, 295–311. (doi:10.1086/169815) [Google Scholar]
  • 32.Canuto VM. 1992. Turbulent convection with overshooting—Reynolds stress approach. Astrophys. J. 392, 218–232. (doi:10.1086/171420) [Google Scholar]
  • 33.Canuto VM. 1993. Turbulent convection with overshooting: Reynolds stress approach. II. Astrophys. J. 416, 331–354. (doi:10.1086/173238) [Google Scholar]
  • 34.Xiong DR, Cheng QL, Deng L. 1997. Nonlocal time-dependent convection theory. Astrophys. J. Suppl. Ser. 108, 529–544. (doi:10.1086/312959) [Google Scholar]
  • 35.Li Y, Yang JY. 2007. Testing turbulent convection theory in solar models. I. Structure of the solar convection zone. Mon. Not. R. Astron. Soc. 375, 388–402. (doi:10.1111/j.1365-2966.2006.11319.x) [Google Scholar]
  • 36.Yang JY, Li Y. 2007. Testing turbulent convection theory in solar models. II. Solar p-mode oscillations. Mon. Not. R. Astron. Soc. 375, 403–414. (doi:10.1111/j.1365-2966.2006.11320.x) [Google Scholar]
  • 37.Canuto VM. 2011. Stellar mixing. I. Formalism. Astron. Astrophys. 528, A76 (doi:10.1051/0004-6361/201014447) [Google Scholar]
  • 38.Andrássy R, Spruit HC. 2013. Overshooting by convective settling. Astron. Astrophys. 559, A122 (doi:10.1051/0004-6361/201321793) [Google Scholar]
  • 39.Andrássy R, Spruit HC. 2015. Overshooting by differential heating. Astron. Astrophys. 578, A106 (doi:10.1051/0004-6361/201425125) [Google Scholar]
  • 40.Viallet M, Meakin C, Prat V, Arnett D. 2015. Toward a consistent use of overshooting parametrizations in 1D stellar evolution codes. Astron. Astrophys. 580, A61 (doi:10.1051/0004-6361/201526294) [Google Scholar]
  • 41.Stancliffe RJ, Fossati L, Passy JC, Schneider FRN. 2015. Confronting uncertainties in stellar physics: calibrating convective overshooting with eclipsing binaries. Astron. Astrophys. 575, A117 (doi:10.1051/0004-6361/201425126) [Google Scholar]
  • 42.Valle G, Dell’Omodarme M, Prada Moroni PG, Degl’Innocenti S. 2016. Calibrating convective-core overshooting with eclipsing binary systems: the case of low-mass main-sequence stars. Astron. Astrophys. 587, A16 (doi:10.1051/0004-6361/201527389) [Google Scholar]
  • 43.Demarque P, Sarajedini A, Guo XJ. 1994. The gap in the color-magnitude diagram of NGC 2420: a test of convective overshoot and cluster age. Astrophys. J. 426, 165–169. (doi:10.1086/174052) [Google Scholar]
  • 44.Pietrinferni A, Cassisi S, Salaris M, Castelli F. 2004. A large stellar evolution database for population synthesis studies. I. Scaled solar models and isochrones. Astrophys. J. 612, 168–190. (doi:10.1086/422498) [Google Scholar]
  • 45.Dotter A, Chaboyer B, Jevremović D, Kostov V, Baron E, Ferguson JW. 2008. The Dartmouth stellar evolution database. Astrophys. J. Suppl. Ser. 178, 89–101. (doi:10.1086/589654) [Google Scholar]
  • 46.Demarque P, Woo JH, Kim YC, Yi SK. 2004. Y 2 Isochrones with an improved core overshoot treatment. Astrophys. J. Suppl. Ser. 155, 667–674. (doi:10.1086/424966) [Google Scholar]
  • 47.VandenBerg DA, Bergbusch PA, Dowler PD. 2006. The Victoria-Regina stellar models: evolutionary tracks and isochrones for a wide range in mass and metallicity that allow for empirically constrained amounts of convective core overshooting. Astrophys. J. Suppl. Ser. 162, 375–387. (doi:10.1086/498451) [Google Scholar]
  • 48.Roxburgh IW. 1989. Integral constraints on convective overshooting. Astron. Astrophys. 211, 361–364. [Google Scholar]
  • 49.Girardi L, Bressan A, Bertelli G, Chiosi C. 2000. Evolutionary tracks and isochrones for low- and intermediate-mass stars: from 0.15 to 7 M, and from Z=0.0004 to 0.03. Astron. Astrophys. Suppl. 141, 371–383. (doi:10.1051/aas:2000126) [Google Scholar]
  • 50.Bressan A, Marigo P, Girardi L, Salasnich B, Dal Cero C, Rubele S, Nanni A. 2012. PARSEC: stellar tracks and isochrones with the PAdova and TRieste stellar evolution code. Mon. Not. R. Astron. Soc. 427, 127–145. (doi:10.1111/j.1365-2966.2012.21948.x) [Google Scholar]
  • 51.Zahn JP. 1991. Convective penetration in stellar interiors. Astron. Astrophys. 252, 179–188. [Google Scholar]
  • 52.Bressan AG, Chiosi C, Bertelli G. 1981. Mass loss and overshooting in massive stars. Astron. Astrophys. 102, 25–30. [Google Scholar]
  • 53.Roxburgh IW. 1992. Limits on convective penetration from stellar cores. Astron. Astrophys. 266, 291–293. [Google Scholar]
  • 54.Woo JH, Demarque P. 2001. Empirical constraints on convective core overshoot. Astron. J. 122, 1602–1606. (doi:10.1086/322122) [Google Scholar]
  • 55.Herwig F. 2000. The evolution of AGB stars with convective overshoot. Astron. Astrophys. 360, 952–968. [Google Scholar]
  • 56.Schroder KP, Pols OR, Eggleton PP. 1997. A critical test of stellar evolution and convective core ‘overshooting’ by means of zeta Aurigae systems. Mon. Not. R. Astron. Soc. 285, 696–710. (doi:10.1093/mnras/285.4.696) [Google Scholar]
  • 57.Paxton B, Bildsten L, Dotter A, Herwig F, Lesaffre P, Timmes F. 2011. Modules for experiments in stellar astrophysics (MESA). Astrophys. J. Suppl. Ser. 192, 3 (doi:10.1088/0067-0049/192/1/3) [Google Scholar]
  • 58.Paxton B, et al. 2013. Modules for experiments in stellar astrophysics (MESA): planets, oscillations, rotation, and massive stars. Astrophys. J. Suppl. Ser. 208, 4 (doi:10.1088/0067-0049/208/1/4) [Google Scholar]
  • 59.Weiss A, Schlattl H. 2008. GARSTEC—the Garching stellar evolution code: the direct descendant of the legendary Kippenhahn code. Astrophys. Space Sci. 316, 99–106. (doi:10.1007/s10509-007-9606-5) [Google Scholar]
  • 60.Freytag B, Ludwig HG, Steffen M. 1996. Hydrodynamical models of stellar convection: the role of overshoot in DA white dwarfs, A-type stars, and the Sun. Astron. Astrophys. 313, 497–516. [Google Scholar]
  • 61.Meakin CA, Arnett D. 2007. Turbulent convection in stellar interiors. I. Hydrodynamic simulation. Astrophys. J. 667, 448–475. (doi:10.1086/520318) [Google Scholar]
  • 62.Staritsin EI. 2013. Turbulent entrainment at the boundaries of the convective cores of main-sequence stars. Astron. Rep. 57, 380–390. (doi:10.1134/S1063772913050089) [Google Scholar]
  • 63.Salaris M, Pietrinferni A, Piersimoni AM, Cassisi S. 2015. Post first dredge-up [C/N] ratio as age indicator: theoretical calibration. Astron. Astrophys. 583, A87 (doi:10.1051/0004-6361/201526951) [Google Scholar]
  • 64.Doherty CL, Gil-Pons P, Siess L, Lattanzio JC.2017. Super-AGB Stars and their role as Electron Capture Supernova progenitors. (http://arxiv.org/abs/1703.06895. )
  • 65.Woosley SE, Heger A, Weaver TA. 2002. The evolution and explosion of massive stars. Rev. Mod. Phys. 74, 1015–1071. (doi:10.1103/RevModPhys.74.1015) [Google Scholar]
  • 66.Chiosi C, Summa C. 1970. On the evolution of OB stars from the main sequence to the helium exhaustion phase. Astrophys. Space Sci. 8, 478–496. (doi:10.1007/BF00651340) [Google Scholar]
  • 67.Stothers R. 1970. Internal structure of upper main-sequence stars. Mon. Not. R. Astron. Soc. 151, 65 (doi:10.1093/mnras/151.1.65) [Google Scholar]
  • 68.Simpson EE. 1971. Evolutionary models of stars of 15 and 30 M. Astrophys. J. 165, 295 (doi:10.1086/150898) [Google Scholar]
  • 69.Silva Aguirre V, Ballot J, Serenelli AM, Weiss A. 2011. Constraining mixing processes in stellar cores using asteroseismology. Impact of semiconvection in low-mass stars. Astron. Astrophys. 529, A63 (doi:10.1051/0004-6361/201015847) [Google Scholar]
  • 70.Stothers R, Chin CW. 1975. Stellar evolution at high mass with semiconvective mixing according to the Ledoux criterion. Astrophys. J. 198, 407–417. (doi:10.1086/153615) [Google Scholar]
  • 71.Stothers R, Chin CW. 1976. Stellar evolution at high mass with semiconvective mixing according to the Schwarzschild criterion. Astrophys. J. 204, 472–480. (doi:10.1086/154191) [Google Scholar]
  • 72.Langer N, Maeder A. 1995. The problem of the blue-to-red supergiant ratio in galaxies. Astron. Astrophys. 295, 685–692. [Google Scholar]
  • 73.Yoon SC, Langer N. 2005. Evolution of rapidly rotating metal-poor massive stars towards gamma-ray bursts. Astron. Astrophys. 443, 643–648. (doi:10.1051/0004-6361:20054030) [Google Scholar]
  • 74.Spruit HC. 1992. The rate of mixing in semiconvective zones. Astron. Astrophys. 253, 131–138. [Google Scholar]
  • 75.Weaver TA, Zimmerman GB, Woosley SE. 1978. Presupernova evolution of massive stars. Astrophys. J. 225, 1021–1029. (doi:10.1086/156569) [Google Scholar]
  • 76.Heger A, Langer N, Woosley SE. 2000. Presupernova evolution of rotating massive stars. I. Numerical method and evolution of the internal stellar structure. Astrophys. J. 528, 368–396. (doi:10.1086/308158) [Google Scholar]
  • 77.Sukhbold T, Woosley SE. 2014. The compactness of Presupernova stellar cores. Astrophys. J. 783, 10 (doi:10.1088/0004-637X/783/1/10) [Google Scholar]
  • 78.Moore K, Garaud P. 2016. Main sequence evolution with layered semiconvection. Astrophys. J. 817, 54 (doi:10.3847/0004-637X/817/1/54) [Google Scholar]
  • 79.Castellani V, Giannone P, Renzini A. 1971. Overshooting of convective cores in Helium-burning horizontal-branch stars. Astrophys. Space Sci. 10, 340–349. (doi:10.1007/BF00704092) [Google Scholar]
  • 80.Sweigart AV, Gross PG. 1974. Horizontal-branch evolution with semiconvection. I. Interior evolution. Astrophys. J. 190, 101–108. (doi:10.1086/152852) [Google Scholar]
  • 81.Robertson JW, Faulkner DJ. 1972. Semiconvection in the core-Helium phase of stellar evolution. Astrophys. J. 171, 309–315. (doi:10.1086/151283) [Google Scholar]
  • 82.Dorman B, Rood RT. 1993. On partial mixing zones in horizontal-branch stellar cores. Astrophys. J. 409, 387–403. (doi:10.1086/172671) [Google Scholar]
  • 83.Gabriel M, Noels A, Montalbán J, Miglio A. 2014. Proper use of Schwarzschild Ledoux criteria in stellar evolution computations. Astron. Astrophys. 569, A63 (doi:10.1051/0004-6361/201423442) [Google Scholar]
  • 84.Michaud G, Richer J, Richard O. 2007. Horizontal branch evolution and atomic diffusion. Astrophys. J. 670, 1178–1187. (doi:10.1086/522488) [Google Scholar]
  • 85.Chandrasekhar S. 1961. Hydrodynamic and hydromagnetic stability. Oxford, UK: Clarendon. [Google Scholar]
  • 86.Castellani V, Chieffi A, Tornambe A, Pulone L. 1985. Helium-burning evolutionary phases in population II stars. I. Breathing pulses in horizontal branch stars. Astrophys. J. 296, 204–212. (doi:10.1086/163437) [Google Scholar]
  • 87.Caputo F, Chieffi A, Tornambe A, Castellani V, Pulone L. 1989. The ‘Red Giant Clock’ as an indicator for the efficiency of central mixing in horizontal-branch stars. Astrophys. J. 340, 241–248. (doi:10.1086/167387) [Google Scholar]
  • 88.Cassisi S, Salaris M, Irwin AW. 2003. The initial Helium content of galactic globular cluster stars from the R-Parameter: comparison with the cosmic microwave background constraint. Astrophys. J. 588, 862–870. (doi:10.1086/374218) [Google Scholar]
  • 89.Cassisi S, Castellani V, degl’Innocenti S, Salaris M, Weiss A. 1999. Galactic globular cluster stars: from theory to observation. Astron. Astrophys. Suppl. 134, 103–113. (doi:10.1051/aas:1999126) [Google Scholar]
  • 90.Straniero O, Domínguez I, Imbriani G, Piersanti L. 2003. The chemical composition of white dwarfs as a test of convective efficiency during core helium burning. Astrophys. J. 583, 878–884. (doi:10.1086/345427) [Google Scholar]
  • 91.Cantiello M, Langer N. 2010. Thermohaline mixing in evolved low-mass stars. Astron. Astrophys. 521, A9 (doi:10.1051/0004-6361/201014305) [Google Scholar]
  • 92.Charbonnel C, Lagarde N. 2010. Thermohaline instability and rotation-induced mixing. I. Low- and intermediate-mass solar metallicity stars up to the end of the AGB. Astron. Astrophys. 522, A10 (doi:10.1051/0004-6361/201014432) [Google Scholar]
  • 93.Siess L. 2009. Thermohaline mixing in super-AGB stars. Astron. Astrophys. 497, 463–468. (doi:10.1051/0004-6361/200811362) [Google Scholar]
  • 94.Gratton RG, Sneden C, Carretta E, Bragaglia A. 2000. Mixing along the red giant branch in metal-poor field stars. Astron. Astrophys. 354, 169–187. [Google Scholar]
  • 95.Angelou GC, D’Orazi V, Constantino TN, Church RP, Stancliffe RJ, Lattanzio JC. 2015. Diagnostics of stellar modelling from spectroscopy and photometry of globular clusters. Mon. Not. R. Astron. Soc. 450, 2423–2440. (doi:10.1093/mnras/stv770) [Google Scholar]
  • 96.Charbonnel C, Zahn JP. 2007. Inhibition of thermohaline mixing by a magnetic field in Ap star descendants: implications for the Galactic evolution of 3He. Astron. Astrophys. 476, L29–L32. (doi:10.1051/0004-6361:20078740) [Google Scholar]
  • 97.Eggleton PP, Dearborn DSP, Lattanzio JC. 2006. Deep mixing of 3He: reconciling big bang and stellar nucleosynthesis. Science 314, 1580–1583. (doi:10.1126/science.1133065) [DOI] [PubMed] [Google Scholar]
  • 98.Charbonnel C, Zahn JP. 2007. Thermohaline mixing: a physical mechanism governing the photospheric composition of low-mass giants. Astron. Astrophys. 467, L15–L18. (doi:10.1051/0004-6361:20077274) [Google Scholar]
  • 99.Ulrich RK. 1972. Thermohaline convection in stellar interiors. Astrophys. J. 172, 165 (doi:10.1086/151336) [Google Scholar]
  • 100.Kippenhahn R, Ruschenplatt G, Thomas HC. 1980. The time scale of thermohaline mixing in stars. Astron. Astrophys. 91, 175–180. [Google Scholar]
  • 101.Lattanzio JC, Siess L, Church RP, Angelou G, Stancliffe RJ, Doherty CL, Stephen T, Campbell S. 2015. On the numerical treatment and dependence of thermohaline mixing in red giants. Mon. Not. R. Astron. Soc. 446, 2673–2688. (doi:10.1093/mnras/stu2238) [Google Scholar]
  • 102.Denissenkov PA, Merryfield WJ. 2011. Thermohaline mixing: does it really govern the atmospheric chemical composition of low-mass red giants? Astrophys. J. Lett. 727, L8 (doi:10.1088/2041-8205/727/1/L8) [Google Scholar]
  • 103.Traxler A, Garaud P, Stellmach S. 2011. Numerically determined transport laws for fingering (‘thermohaline’) convection in astrophysics. Astrophys. J. Lett. 728, L29 (doi:10.1088/2041-8205/728/2/L29) [Google Scholar]
  • 104.Turcotte S, Richer J, Michaud G, Iglesias CA, Rogers FJ. 1998. Consistent solar evolution model including diffusion and radiative acceleration effects. Astrophys. J. 504, 539–558. (doi:10.1086/306055) [Google Scholar]
  • 105.Burgers JM. 1969. Flow equations for composite gases. New York, NY: Academic Press. [Google Scholar]
  • 106.Thoul AA, Bahcall JN, Loeb A. 1994. Element diffusion in the solar interior. Astrophys. J. 421, 828–842. (doi:10.1086/173695) [Google Scholar]
  • 107.Mason EA, Munn RJ, Smith FJ. 1967. Transport coefficients of ionized gases. Phys. Fluids 10, 1827–1832. (doi:10.1063/1.1762365) [Google Scholar]
  • 108.Muchmore D. 1984. Diffusion in white dwarf stars. Astrophys. J. 278, 769–783. (doi:10.1086/161847) [Google Scholar]
  • 109.Iben I Jr, MacDonald J. 1985. The effects of diffusion due to gravity and due to composition gradients on the rate of hydrogen burning in a cooling degenerate dwarf. I–the case of a thick helium buffer layer. Astrophys. J. 296, 540–553. (doi:10.1086/163473) [Google Scholar]
  • 110.Paquette C, Pelletier C, Fontaine G, Michaud G. 1986. Diffusion coefficients for stellar plasmas. Astrophys. J. Suppl. Ser. 61, 177–195. (doi:10.1086/191111) [Google Scholar]
  • 111.Schlattl H, Salaris M. 2003. Quantum corrections to microscopic diffusion constants. Astron. Astrophys. 402, 29–35. (doi:10.1051/0004-6361:20030230) [Google Scholar]
  • 112.Zhang QS.2016. Numerical integral of resistance coefficients in diffusion. (http://arxiv.org/abs/1611.03115. )
  • 113.Daligault J, Baalrud SD, Starrett CE, Saumon D, Sjostrom T. 2016. Ionic transport coefficients of dense plasmas without molecular dynamics. Phys. Rev. Lett. 116, 075002 (doi:10.1103/PhysRevLett.116.075002) [DOI] [PubMed] [Google Scholar]
  • 114.Hu H, Tout CA, Glebbeek E, Dupret MA. 2011. Slowing down atomic diffusion in subdwarf B stars: mass loss or turbulence?. Mon. Not. R. Astron. Soc. 418, 195–205. (doi:10.1111/j.1365-2966.2011.19482.x) [Google Scholar]
  • 115.Seaton MJ. 2005. Opacity Project data on CD for mean opacities and radiative accelerations. Mon. Not. R. Astron. Soc. 362, L1–L3. (doi:10.1111/j.1365-2966.2005.00019.x) [Google Scholar]
  • 116.LeBlanc F, Michaud G, Richer J. 2000. Opacity sampling in radiative acceleration calculations. Astrophys. J. 538, 876–884. (doi:10.1086/309167) [Google Scholar]
  • 117.Cassisi S, degl’Innocenti S, Salaris M. 1997. The effect of diffusion on the red giant luminosity function ‘bump’. Mon. Not. R. Astron. Soc. 290, 515–520. (doi:10.1093/mnras/290.3.515) [Google Scholar]
  • 118.Richard O, Michaud G, Richer J, Turcotte S, Turck-Chièze S, VandenBerg DA. 2002. Models of metal-poor stars with gravitational settling and radiative accelerations. I. Evolution and abundance anomalies. Astrophys. J. 568, 979–997. (doi:10.1086/338952) [Google Scholar]
  • 119.Koester D. 2009. Accretion and diffusion in white dwarfs. New diffusion timescales and applications to GD 362 and G 29-38. Astron. Astrophys. 498, 517–525. (doi:10.1051/0004-6361/200811468) [Google Scholar]
  • 120.Michaud G, Richer J, Richard O. 2011. Horizontal branch evolution, metallicity, and sdB stars. Astron. Astrophys. 529, A60 (doi:10.1051/0004-6361/201015997) [Google Scholar]
  • 121.Cassisi S, Salaris M. 2013. Old stellar populations: how to study the fossil record of galaxy formation. Berlin, Germany: Wiley-VCH. [Google Scholar]
  • 122.Vick M, Michaud G, Richer J, Richard O. 2013. Population II stars and the Spite plateau. Stellar evolution models with mass loss. Astron. Astrophys. 552, A131 (doi:10.1051/0004-6361/201220726) [Google Scholar]
  • 123.Bravo E, Isern J, Canal R, Labay J. 1992. On the contribution of Ne-22 to the synthesis of Fe-54 and Ni-58 in thermonuclear supernovae. Astron. Astrophys. 257, 534–538. [Google Scholar]
  • 124.Deloye CJ, Bildsten L. 2002. Gravitational settling of 22Ne in liquid white dwarf interiors: cooling and seismological effects. Astrophys. J. 580, 1077–1090. (doi:10.1086/343800) [Google Scholar]
  • 125.Althaus LG, García-Berro E, Renedo I, Isern J, Córsico AH, Rohrmann RD. 2010. Evolution of white dwarf stars with high-metallicity progenitors: the role of 22Ne diffusion. Astrophys. J. 719, 612–621. (doi:10.1088/0004-637X/719/1/612) [Google Scholar]
  • 126.García-Berro E, Torres S, Althaus LG, Renedo I, Lorén-Aguilar P, Córsico AH, Rohrmann RD, Salaris M, Isern J. 2010. A white dwarf cooling age of 8 Gyr for NGC 6791 from physical separation processes. Nature 465, 194–196. (doi:10.1038/nature09045) [DOI] [PubMed] [Google Scholar]
  • 127.Camisassa ME, Althaus LG, Córsico AH, Vinyoles N, Serenelli AM, Isern J, Miller Bertolami MM, García-Berro E. 2016. The effect of 22NE diffusion in the evolution and pulsational properties of white dwarfs with solar metallicity progenitors. Astrophys. J. 823, 158 (doi:10.3847/0004-637X/823/2/158) [Google Scholar]
  • 128.Hughto J, Schneider AS, Horowitz CJ, Berry DK. 2010. Diffusion of neon in white dwarf stars. Phys. Rev. E 82, 066401 (doi:10.1103/PhysRevE.82.066401) [DOI] [PubMed] [Google Scholar]
  • 129.Vick M, Michaud G, Richer J, Richard O. 2011. Abundance anomalies in pre-main-sequence stars: stellar evolution models with mass loss. Astron. Astrophys. 526, A37 (doi:10.1051/0004-6361/201015533) [Google Scholar]
  • 130.Charpinet S, Fontaine G, Brassard P, Chayer P, Rogers FJ, Iglesias CA, Dorman B. 1997. A driving mechanism for the newly discovered class of pulsating subdwarf B stars. Astrophys. J. Lett. 483, L123–L126. (doi:10.1086/310741) [Google Scholar]
  • 131.Bahcall JN, Pinsonneault MH, Wasserburg GJ. 1995. Solar models with helium and heavy-element diffusion. Rev. Mod. Phys. 67, 781–808. (doi:10.1103/RevModPhys.67.781) [Google Scholar]
  • 132.Villante FL, Serenelli AM, Delahaye F, Pinsonneault MH. 2014. The chemical composition of the Sun from Helioseismic and Solar Neutrino data. Astrophys. J. 787, 13 (doi:10.1088/0004-637X/787/1/13) [Google Scholar]
  • 133.Korn AJ, Grundahl F, Richard O, Mashonkina L, Barklem PS, Collet R, Gustafsson B, Piskunov N. 2007. Atomic diffusion and mixing in old stars. I. Very large telescope FLAMES-UVES observations of stars in NGC 6397. Astrophys. J. 671, 402–419. (doi:10.1086/523098) [Google Scholar]
  • 134.Lind K, Korn AJ, Barklem PS, Grundahl F. 2008. Atomic diffusion and mixing in old stars. II. Observations of stars in the globular cluster NGC 6397 with VLT/FLAMES-GIRAFFE. Astron. Astrophys. 490, 777–786. (doi:10.1051/0004-6361:200810051) [Google Scholar]
  • 135.Mucciarelli A, Salaris M, Lovisi L, Ferraro FR, Lanzoni B, Lucatello S, Gratton RG. 2011. Lithium abundance in the globular cluster M4: from the turn-off to the red giant branch bump. Mon. Not. R. Astron. Soc. 412, 81–94. (doi:10.1111/j.1365-2966.2010.17884.x) [Google Scholar]
  • 136.Gruyters P, et al. 2016. Atomic diffusion and mixing in old stars. VI. The lithium content of M30. Astron. Astrophys. 589, A61 (doi:10.1051/0004-6361/201527948) [Google Scholar]
  • 137.Korn A, Grundahl F, Richard O, Barklem P, Mashonkina L, Collet R, Piskunov N, Gustafsson B. 2006. New abundances for old stars–atomic diffusion at work in NGC 6397. The Messenger 125, 6–10. [Google Scholar]
  • 138.Salaris M. 2016. Age dating of old field stars: challenges from the uncertain efficiency of atomic diffusion. Astron. Nachr. 337, 805–809. (doi:10.1002/asna.201612375) [Google Scholar]
  • 139.Richer J, Michaud G, Turcotte S. 2000. The evolution of AMFM stars, abundance anomalies, and turbulent transport. Astrophys. J. 529, 338–356. (doi:10.1086/308274) [Google Scholar]
  • 140.Deal M, Richard O, Vauclair S. 2016. Hydrodynamical instabilities induced by atomic diffusion in A stars and their consequences. Astron. Astrophys. 589, A140 (doi:10.1051/0004-6361/201628180) [Google Scholar]
  • 141.Gebran M, Vick M, Monier R, Fossati L. 2010. Chemical composition of A and F dwarfs members of the Hyades open cluster. Astron. Astrophys. 523, A71 (doi:10.1051/0004-6361/200913273) [Google Scholar]
  • 142.Spite F, Spite M. 1982. Abundance of lithium in unevolved halo stars and old disk stars—interpretation and consequences. Astron. Astrophys. 115, 357–366. [Google Scholar]
  • 143.Salaris M, Weiss A. 2001. Atomic diffusion in metal-poor stars. II. Predictions for the Spite plateau. Astron. Astrophys. 376, 955–965. (doi:10.1051/0004-6361:20010982) [Google Scholar]
  • 144.Richard O, Michaud G, Richer J. 2005. Implications of WMAP observations on Li abundance and stellar evolution models. Astrophys. J. 619, 538–548. (doi:10.1086/426470) [Google Scholar]
  • 145.Quievy D, Charbonneau P, Michaud G, Richer J. 2009. Abundances anomalies and meridional circulation in horizontal branch stars. Astron. Astrophys. 500, 1163–1171. (doi:10.1051/0004-6361/200811262) [Google Scholar]
  • 146.Swenson FJ. 1995. Lithium in halo dwarfs: the undoing of diffusion by mass loss. Astrophys. J. Lett. 438, L87–L90. (doi:10.1086/187722) [Google Scholar]
  • 147.Vauclair S, Charbonnel C. 1995. Influence of a stellar wind on the lithium depletion in halo stars: a new step towards the lithium primordial abundance. Astron. Astrophys. 295, 715–724. [Google Scholar]
  • 148.Vick M, Michaud G, Richer J, Richard O. 2010. AmFm and lithium gap stars. Stellar evolution models with mass loss. Astron. Astrophys. 521, A62 (doi:10.1051/0004-6361/201014307) [Google Scholar]
  • 149.Brown JM, Garaud P, Stellmach S. 2013. Chemical transport and spontaneous layer formation in fingering convection in astrophysics. Astrophys. J. 768, 34 (doi:10.1088/0004-637X/768/1/34) [Google Scholar]
  • 150.Matrozis E, Stancliffe RJ. 2016. Radiative levitation in carbon-enhanced metal-poor stars with s-process enrichment. Astron. Astrophys. 592, A29 (doi:10.1051/0004-6361/201628540) [Google Scholar]
  • 151.Richard O, Michaud G, Richer J. 2002. Models of metal-poor stars with gravitational settling and radiative accelerations. III. Metallicity dependence. Astrophys. J. 580, 1100–1117. (doi:10.1086/343733) [Google Scholar]
  • 152.Gruyters P, Nordlander T, Korn AJ. 2014. Atomic diffusion and mixing in old stars. V. A deeper look into the globular cluster NGC 6752. Astron. Astrophys. 567, A72 (doi:10.1051/0004-6361/201423590) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 153.Önehag A, Gustafsson B, Korn A. 2014. Abundances and possible diffusion of elements in M67 stars. Astron. Astrophys. 562, A102 (doi:10.1051/0004-6361/201322663) [Google Scholar]
  • 154.VandenBerg DA, Bergbusch PA, Dotter A, Ferguson JW, Michaud G, Richer J, Proffitt CR. 2012. Models for metal-poor stars with enhanced abundances of C, N, O, Ne, Na, Mg, Si, S, Ca, and Ti, in turn, at constant Helium and Iron abundances. Astrophys. J. 755, 15 (doi:10.1088/0004-637X/755/1/15) [Google Scholar]
  • 155.Morel P, Thévenin F. 2002. Atomic diffusion in star models of type earlier than G. Astron. Astrophys. 390, 611–620. (doi:10.1051/0004-6361:20020768) [Google Scholar]
  • 156.Garcia-Berro E, Hernanz M, Mochkovitch R, Isern J. 1988. Theoretical white-dwarf luminosity functions for two phase diagrams of the carbon-oxygen dense plasma. Astron. Astrophys. 193, 141–147. [Google Scholar]
  • 157.Salaris M, Domínguez I, García-Berro E, Hernanz M, Isern J, Mochkovitch R. 1997. The cooling of CO white dwarfs: influence of the internal chemical distribution. Astrophys. J. 486, 413–419. (doi:10.1086/304483) [Google Scholar]
  • 158.Segretain L, Chabrier G. 1993. Crystallization of binary ionic mixtures in dense stellar plasmas. Astron. Astrophys. 271, L13–L16. [Google Scholar]
  • 159.Mochkovitch R. 1983. Freezing of a carbon-oxygen white dwarf. Astron. Astrophys. 122, 212–218. [Google Scholar]
  • 160.Isern J, Mochkovitch R, García-Berro E, Hernanz M. 1997. The physics of crystallizing white dwarfs. Astrophys. J. 485, 308–312. (doi:10.1086/304425) [Google Scholar]
  • 161.Horowitz CJ, Schneider AS, Berry DK. 2010. Crystallization of carbon-oxygen mixtures in white dwarf stars. Phys. Rev. Lett. 104, 231101 (doi:10.1103/PhysRevLett.104.231101) [DOI] [PubMed] [Google Scholar]
  • 162.Segretain L. 1996. Three-body crystallization diagrams and the cooling of white dwarfs. Astron. Astrophys. 310, 485–488. [Google Scholar]
  • 163.Hunter I, Lennon DJ, Dufton PL, Trundle C, Simón-Díaz S, Smartt SJ, Ryans RSI, Evans CJ. 2008. The VLT-FLAMES survey of massive stars: atmospheric parameters and rotational velocity distributions for B-type stars in the Magellanic Clouds. Astron. Astrophys. 479, 541–555. (doi:10.1051/0004-6361:20078511) [Google Scholar]
  • 164.Penny LR, Gies DR. 2009. A FUSE survey of the rotation rates of very massive stars in the small and large Magellanic Clouds. Astrophys. J. 700, 844–858. (doi:10.1088/0004-637X/700/1/844) [Google Scholar]
  • 165.Bragança GA, Daflon S, Cunha K, Bensby T, Oey MS, Walth G. 2012. Projected rotational velocities and stellar characterization of 350 B Stars in the nearby galactic disk. Astron. J. 144, 130 (doi:10.1088/0004-6256/144/5/130) [Google Scholar]
  • 166.Huangu W, Gies DR, McSwain MV. 2010. A stellar rotation census of B stars: from ZAMS to TAMS. Astrophys. J. 722, 605–619. (doi:10.1088/0004-637X/722/1/605) [Google Scholar]
  • 167.Royer F, Zorec J, Gómez AE. 2007. Rotational velocities of A-type stars. III. Velocity distributions. Astron. Astrophys. 463, 671–682. (doi:10.1051/0004-6361:20065224) [Google Scholar]
  • 168.Stauffer JB, Hartmann LW. 1986. The rotational velocities of low-mass stars. Publ. Astron. Soc. Pac. 98, 1233–1251. (doi:10.1086/131926) [Google Scholar]
  • 169.Carlberg JK, Majewski SR, Patterson RJ, Bizyaev D, Smith VV, Cunha K. 2011. The frequency of rapid rotation among K giant stars. Astrophys. J. 732, 39 (doi:10.1088/0004-637X/732/1/39) [Google Scholar]
  • 170.Herrero A, Kudritzki RP, Vilchez JM, Kunze D, Butler K, Haser S. 1992. Intrinsic parameters of galactic luminous OB stars. Astron. Astrophys. 261, 209–234. [Google Scholar]
  • 171.Hunter I, et al. 2009. The VLT-FLAMES survey of massive stars: constraints on stellar evolution from the chemical compositions of rapidly rotating Galactic and Magellanic Cloud B-type stars. Astron. Astrophys. 496, 841–853. (doi:10.1051/0004-6361/200809925) [Google Scholar]
  • 172.Martins F, Simon-Diaz S, Barba RH, Gamen RC, Ekstroem S.2016. A study of the effect of rotational mixing on massive stars evolution: surface abundances of Galactic O7-8 giant stars. (http://arxiv.org/abs/1611.05223. )
  • 173.Monaghan FF, Roxburgh IW. 1965. The structure of rapidly rotating polytropes. Mon. Not. R. Astron. Soc. 131, 13 (doi:10.1093/mnras/131.1.13) [Google Scholar]
  • 174.Ostriker JP, Mark JWK. 1968. Rapidly rotating stars. I. The self-consistent-field method. Astrophys. J. 151, 1075–1088. (doi:10.1086/149506) [Google Scholar]
  • 175.Deupree RG. 1990. Stellar evolution with arbitrary rotation laws. I. Mathematical techniques and test cases. Astrophys. J. 357, 175–187. (doi:10.1086/168903) [Google Scholar]
  • 176.Deupree RG. 2001. Stellar evolution with arbitrary rotation laws. IV. Survey of zero-age main-sequence models. Astrophys. J. 552, 268–277. (doi:10.1086/320464) [Google Scholar]
  • 177.Espinosa Lara F, Rieutord M. 2013. Self-consistent 2D models of fast-rotating early-type stars. Astron. Astrophys. 552, A35 (doi:10.1051/0004-6361/201220844) [Google Scholar]
  • 178.Zahn JP. 1992. Circulation and turbulence in rotating stars. Astron. Astrophys. 265, 115–132. [Google Scholar]
  • 179.Meynet G, Maeder A. 1997. Stellar evolution with rotation. I. The computational method and the inhibiting effect of the μ-gradient. Astron. Astrophys. 321, 465–476. [Google Scholar]
  • 180.Maeder A. 2009. Physics, formation and evolution of rotating stars. Berlin, Germany: Springer-Verlag; (doi:10.1007/978-3-540-76949-1) [Google Scholar]
  • 181.Zeng YR. 2002. A more powerful evolution model for rotating stars. Astron. Astrophys. 394, 965–969. (doi:10.1051/0004-6361:20021233) [Google Scholar]
  • 182.Kippenhahn R, Thomas HC. 1970. A simple method for the solution of the stellar structure equations including rotation and tidal forces. In IAU Colloq. 4: Stellar Rotation (ed. A Slettebak), p. 20. Philadelphia, PA: Gordon and Breach Science Publishers.
  • 183.Endal AS, Sofia S. 1976. The evolution of rotating stars. I–method and exploratory calculations for a 7-solar-mass star. Astrophys. J. 210, 184–198. (doi:10.1086/154817) [Google Scholar]
  • 184.Talon S,. 2008. Transport processes in stars: diffusion, rotation, magnetic fields and internal waves. EAS Publications Ser. 32, 81–130. (doi:10.1051/eas:0832003) [Google Scholar]
  • 185.Palacios A,. 2013. Influence of rotation on stellar evolution. EAS Publications Ser. 62, 227–287. (doi:10.1051/eas/1362007) [Google Scholar]
  • 186.Brüggen M, Hillebrandt W. 2001. Mixing through shear instabilities. Mon. Not. R. Astron. Soc. 320, 73–82. (doi:10.1046/j.1365-8711.2001.03951.x) [Google Scholar]
  • 187.Palacios A, Charbonnel C, Talon S, Siess L. 2006. Rotational mixing in low-mass stars, II. Self-consistent models of Pop II RGB stars. Astron. Astrophys. 453, 261–278. (doi:10.1051/0004-6361:20053065) [Google Scholar]
  • 188.Decressin T, Mathis S, Palacios A, Siess L, Talon S, Charbonnel C, Zahn J-P. 2009. Diagnoses to unravel secular hydrodynamical processes in rotating main sequence stars. Astron. Astrophys. 495, 271–286. (doi:10.1051/0004-6361:200810665) [Google Scholar]
  • 189.Marques JP, Goupil MJ. 2013. The influence of initial conditions on stellar rotation history. In Studying stellar rotation and convection (eds M Goupil, K Belkacem, C Neiner, F Lignières, JJ Green). Lecture Notes in Physics, vol. 865, p. 75. Berlin, Germany: Springer (doi:10.1007/978-3-642-33380-4_4).
  • 190.Eggenberger P, Meynet G, Maeder A, Hirschi R, Charbonnel C, Talon S, Ekström S. 2008. The Geneva stellar evolution code. Astrophys. Space Sci. 316, 43–54. (doi:10.1007/s10509-007-9511-y) [Google Scholar]
  • 191.Chieffi A, Limongi M. 2013. Pre-supernova evolution of rotating solar metallicity stars in the mass range 13-120 M solar masses and their explosive yields. Astrophys. J. 764, 21 (doi:10.1088/0004-637X/764/1/21) [Google Scholar]
  • 192.Meynet G, Maeder A. 2005. Rotation and mixing in massive stars: principles and uncertainties. In The nature and evolution of disks around hot stars (eds R Ignace, KG Gayley). Astronomical Society of the Pacific Conference Series, vol. 337, p. 15. San Francisco, CA: Astronomical Society of the Pacific.
  • 193.Chaboyer B, Zahn JP. 1992. Effect of horizontal turbulent diffusion on transport by meridional circulation. Astron. Astrophys. 253, 173–177. [Google Scholar]
  • 194.Maeder A, Zahn JP. 1998. Stellar evolution with rotation. III. Meridional circulation with MU—gradients and non-stationarity. Astron. Astrophys. 334, 1000–1006. [Google Scholar]
  • 195.Brun AS, Palacios A. 2009. Numerical simulations of a rotating red giant star. I. Three-dimensional models of turbulent convection and associated mean flows. Astrophys. J. 702, 1078–1097. (doi:10.1088/0004-637X/702/2/1078) [Google Scholar]
  • 196.Ekström S, et al. 2012. Grids of stellar models with rotation. I. Models from 0.8 to 120 solar masses at solar metallicity (Z=0.014). Astron. Astrophys. 537, A146 (doi:10.1051/0004-6361/201117751) [Google Scholar]
  • 197.Maeder A. 1997. Stellar evolution with rotation. II. A new approach for shear mixing. Astron. Astrophys. 321, 134–144. [Google Scholar]
  • 198.Talon S, Zahn JP. 1997. Anisotropic diffusion and shear instabilities. Astron. Astrophys. 317, 749–751. [Google Scholar]
  • 199.Maeder A. 2003. Stellar rotation: evidence for a large horizontal turbulence and its effects on evolution. Astron. Astrophys. 399, 263–269. (doi:10.1051/0004-6361:20021731) [Google Scholar]
  • 200.Mathis S, Palacios A, Zahn JP. 2004. On shear-induced turbulence in rotating stars. Astron. Astrophys. 425, 243–247. (doi:10.1051/0004-6361:20040279) [Google Scholar]
  • 201.Meynet G, Ekstrom S, Maeder A, Eggenberger P, Saio H, Chomienne V, Haemmerlé L. 2013. Models of rotating massive stars: impacts of various prescriptions. In Studying stellar rotation and convection (eds M Goupil, K Belkacem, C Neiner, F Lignières, JJ Green), pp. 3–22. Lecture Notes in Physics, vol. 865. Berlin, Germany: Springer (doi:10.1007/978-3-642-33380-4_1)
  • 202.Potter AT, Tout CA, Brott I. 2012. Towards a unified model of stellar rotation—II. Model-dependent characteristics of stellar populations. Mon. Not. R. Astron. Soc. 423, 1221–1233. (doi:10.1111/j.1365-2966.2012.20952.x) [Google Scholar]
  • 203.Pinsonneault MH, Kawaler SD, Sofia S, Demarque P. 1989. Evolutionary models of the rotating sun. Astrophys. J. 338, 424–452. (doi:10.1086/167210) [Google Scholar]
  • 204.Brott I, de Mink SE, Cantiello M, Langer N, de Koter A, Evans CJ, Hunter I, Trundle C, Vink JS. 2011. Rotating massive main-sequence stars. I. Grids of evolutionary models and isochrones. Astron. Astrophys. 530, A115 (doi:10.1051/0004-6361/201016113) [Google Scholar]
  • 205.Choi J, Dotter A, Conroy C, Cantiello M, Paxton B, Johnson BD. 2016. Mesa isochrones and stellar tracks (MIST). I. Solar-scaled models. Astrophys. J. 823, 102 (doi:10.3847/0004-637X/823/2/102) [Google Scholar]
  • 206.Zahn JP, Talon S, Matias J. 1997. Angular momentum transport by internal waves in the solar interior. Astron. Astrophys. 322, 320–328. [Google Scholar]
  • 207.Talon S, Kumar P, Zahn JP. 2002. Angular momentum extraction by gravity waves in the Sun. Astrophys. J. Lett. 574, L175–L178. (doi:10.1086/342526) [Google Scholar]
  • 208.Talon S, Charbonnel C. 2005. Hydrodynamical stellar models including rotation, internal gravity waves, and atomic diffusion. I. Formalism and tests on Pop I dwarfs. Astron. Astrophys. 440, 981–994. (doi:10.1051/0004-6361:20053020) [Google Scholar]
  • 209.Goldreich P, Murray N, Kumar P. 1994. Excitation of solar p-modes. Astrophys. J. 424, 466–479. (doi:10.1086/173904) [Google Scholar]
  • 210.Charbonnel C, Talon S. 2005. Influence of gravity waves on the internal rotation and Li abundance of solar-type stars. Science 309, 2189–2191. (doi:10.1126/science.1116849) [DOI] [PubMed] [Google Scholar]
  • 211.Beck PG, et al. 2012. Fast core rotation in red-giant stars as revealed by gravity-dominated mixed modes. Nature 481, 55–57. (doi:10.1038/nature10612) [DOI] [PubMed] [Google Scholar]
  • 212.Deheuvels S, et al. 2012. Seismic evidence for a rapidly rotating core in a lower-giant-branch star observed with Kepler. Astrophys. J. 756, 19 (doi:10.1088/0004-637X/756/1/19) [Google Scholar]
  • 213.Rogers TM, Lin DNC, McElwaine JN, Lau HHB. 2013. Internal gravity waves in massive stars: angular momentum transport. Astrophys. J. 772, 21 (doi:10.1088/0004-637X/772/1/21) [Google Scholar]
  • 214.Alvan L, Brun AS, Mathis S. 2014. Theoretical seismology in 3D: nonlinear simulations of internal gravity waves in solar-like stars. Astron. Astrophys. 565, A42 (doi:10.1051/0004-6361/201323253) [Google Scholar]
  • 215.Spruit HC. 2002. Dynamo action by differential rotation in a stably stratified stellar interior. Astron. Astrophys. 381, 923–932. (doi:10.1051/0004-6361:20011465) [Google Scholar]
  • 216.Zahn JP, Brun AS, Mathis S. 2007. On magnetic instabilities and dynamo action in stellar radiation zones. Astron. Astrophys. 474, 145–154. (doi:10.1051/0004-6361:20077653) [Google Scholar]
  • 217.Petrovic J, Langer N, Yoon SC, Heger A. 2005. Which massive stars are gamma-ray burst progenitors? Astron. Astrophys. 435, 247–259. (doi:10.1051/0004-6361:20042545) [Google Scholar]
  • 218.Heger A, Woosley SE, Spruit HC. 2005. Presupernova evolution of differentially rotating massive stars including magnetic fields. Astrophys. J. 626, 350–363. (doi:10.1086/429868) [Google Scholar]
  • 219.Maeder A, Meynet G. 2005. Stellar evolution with rotation and magnetic fields. III. The interplay of circulation and dynamo. Astron. Astrophys. 440, 1041–1049. (doi:10.1051/0004-6361:20053261) [Google Scholar]
  • 220.Bouvier J, Forestini M, Allain S. 1997. The angular momentum evolution of low-mass stars. Astron. Astrophys. 326, 1023–1043. [Google Scholar]
  • 221.Matt SP, MacGregor KB, Pinsonneault MH, Greene TP. 2012. Magnetic braking formulation for Sun-like stars: dependence on dipole field strength and rotation rate. Astrophys. J. Lett. 754, L26 (doi:10.1088/2041-8205/754/2/L26) [Google Scholar]
  • 222.Amard L, Palacios A, Charbonnel C, Gallet F, Bouvier J. 2016. Rotating models of young solar-type stars. Exploring braking laws and angular momentum transport processes. Astron. Astrophys. 587, A105 (doi:10.1051/0004-6361/201527349) [Google Scholar]
  • 223.Kawaler SD. 1988. Angular momentum loss in low-mass stars. Astrophys. J. 333, 236–247. (doi:10.1086/166740) [Google Scholar]
  • 224.Chaboyer B, Demarque P, Pinsonneault MH. 1995. Stellar models with microscopic diffusion and rotational mixing. 1. Application to the Sun. Astrophys. J. 441, 865–875. (doi:10.1086/175408) [Google Scholar]
  • 225.Boesgaard AM, Tripicco MJ. 1986. Lithium in the Hyades cluster. Astrophys. J. Lett. 302, L49–L53. (doi:10.1086/184635) [Google Scholar]
  • 226.Thorburn JA, Hobbs LM, Deliyannis CP, Pinsonneault MH. 1993. Lithium in the Hyades. I. New observations. Astrophys. J. 415, 150–173. (doi:10.1086/173152) [Google Scholar]
  • 227.Talon S, Charbonnel C. 2003. Angular momentum transport by internal gravity waves. I. Pop I main sequence stars. Astron. Astrophys. 405, 1025–1032. (doi:10.1051/0004-6361:20030672) [Google Scholar]
  • 228.Michaud G, Charbonneau P. 1991. The lithium abundance in stars. Space Sci. Rev. 57, 1–58. (doi:10.1007/BF00195950) [Google Scholar]
  • 229.Talon S, Charbonnel C. 1998. The Li dip: a probe of angular momentum transport in low mass stars. Astron. Astrophys. 335, 959–968. [Google Scholar]
  • 230.Maeder A, Meynet G, Lagarde N, Charbonnel C. 2013. The thermohaline, Richardson, Rayleigh-Taylor, Solberg-Høiland, and GSF criteria in rotating stars. Astron. Astrophys. 553, A1 (doi:10.1051/0004-6361/201220936) [Google Scholar]
  • 231.Viallet M, Meakin C, Arnett D, Mocák M. 2013. Turbulent convection in stellar interiors. III. Mean-field analysis and stratification effects. Astrophys. J. 769, 1 (doi:10.1088/0004-637X/769/1/1) [Google Scholar]
  • 232.Arnett WD, Meakin C, Viallet M, Campbell SW, Lattanzio JC, Mocák M. 2015. Beyond mixing-length theory: a step toward 321D. Astrophys. J. 809, 30 (doi:10.1088/0004-637X/809/1/30) [Google Scholar]
  • 233.Bossini D, et al. 2015. Uncertainties on near-core mixing in red-clump stars: effects on the period spacing and on the luminosity of the AGB bump. Mon. Not. R. Astron. Soc. 453, 2290–2301. (doi:10.1093/mnras/stv1738) [Google Scholar]
  • 234.Constantino T, Campbell SW, Christensen-Dalsgaard J, Lattanzio JC, Stello D. 2015. The treatment of mixing in core helium burning models. I. Implications for asteroseismology. Mon. Not. R. Astron. Soc. 452, 123–145. (doi:10.1093/mnras/stv1264) [Google Scholar]
  • 235.Eggenberger P, Lagarde N, Miglio A, Montalbán J, Ekström S, Georgy C, Salmon S, Meynet G, Maeder A. 2016. Models of rotating stars constrained by asteroseismic measurements of red giants. Astron. Nachr. 337, 832–836. (doi:10.1002/asna.201612381) [Google Scholar]
  • 236.Eggenberger P, et al. 2016. Constraining the efficiency of angular momentum transport with asteroseismology of red giants: the effect of stellar mass. (http://arxiv.org/abs/1612.04258. )

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

This article has no additional data.


Articles from Royal Society Open Science are provided here courtesy of The Royal Society

RESOURCES