Abstract
The classical path approximation is reviewed in a manner which permits definitive statements concerning its region of validity.
Keywords: Classical path methods, line broadening theory
1. Introduction
The classical path approximation which is used in most line broadening theories, as well as many other atomic and molecular problems, is not always clearly understood because it is based on rather heuristic arguments. Previous derivations have relied more on physical insight than mathematical analysis hence such features as the neglect of “back reaction” and the assumption that the density matrix is “diagonal in wave packet states” are not given to concise mathematical statements.
In this paper, statistical methods are used in deriving a precise mathematical expression for the “thermal average.” The approximations used will be given as mathematical statements which are easily understood both physically and mathematically. It is hoped that this approach will be of some pedagogical value in understanding the foundations of the usual classical path methods.
In the following paper [1]1, henceforth referred to as II, this approach will be extended to show how the usual binary collision theories are obtained, and a systematic method for improving these theories is outlined.
2. Construction of a Mathematical Model
The line broadening problem for gases (or plasmas) is basically a study of the distribution of spectral radiation emitted or absorbed by the gas. The emission (or absorption) is usually produced by a small number of particles, and the spectral lines are shifted and broadened by the interaction between these radiators and the remainder of the particles in the gas. Since the number of radiators is usually a very small fraction of the total number of particles in the gas, the average distance between radiators will be large enough that any direct interactions between them are negligible. We will therefore regard the gas as being divided into a large number of cells in such a way that each cell contains one radiator and a large number of perturbing particles. We will assume that the particles at the border between adjacent cells are effectively at infinity with respect to the radiators; that is, the interaction between a radiator and a perturber at the edge of a cell is negligibly small. The distribution of particles at the border between cells will then be uniform (i.e., not altered by the presence of the radiators) and we may neglect all interactions between cells provided that we maintain this uniform distribution near the edges of the cells.
We have now represented the gas by a collection of noninteracting cells. Each cell may be described by a Hamiltonian of the form
| (1) |
where Ha is an unperturbed Hamiltonian for the radiator, Hp is an unperturbed Hamiltonian for the remaining particles in the cell (henceforth called the perturbers), and V represents the interaction between the radiator and these perturbers.
The probability per unit time that a cell, initially in the state ∣m>, will make a spontaneous dipole transition to another state, ∣n>, is [2]
| (2) |
where ∣m> and ∣n> are H eigenstates belonging to the eigenvalues Em and En, ωmn is the frequency difference and d is the dipole moment of the cell. Since we are interested only in line spectra, we may exclude the continuum radiation produced by the perturbers (bremsstrahlung, etc.) by excluding the dipole moment of the perturbers. That is, d will be the dipole moment operator for the radiator, thus ignoring all transitions which do not involve a change in the dipole moment of the radiator. The power radiated when a cell makes a dipole transition from ∣m> to ∣n> is just and the contribution this makes to the total power spectrum emitted by the gas is or
| (3) |
if we use delta functions as natural lines. If one wishes to use very narrow Lorentzian natural lines [3], it is necessary only to use a complex frequency variable in place of ω; this is of little practical importance although it is a mathematical convenience for some problems. In P(ω), negative values of ω (or ωmn) represent absorptive transitions.
The total power spectrum emitted by the gas is obtained by adding the contributions, P(ω), from each of the cells. Since all cells have the same spectrum of accessible states (H eigenstates), this will turn out to be a weighted sum of eq (3) over all possible initial and final states. If the total number of cells (total number of radiators in the gas) is NR and if the probability of finding a cell (one radiator plus its perturbers) in the initial state ∣m> is ρm, then the number of cells initially in the state ∣m> is NRρm. Since Pmn, given by eq (2), is the probability that a cell initially in the state ∣m> will make a dipole transition to the state ∣n>, any given transition ∣m> → ∣n> will occur in NRρmPmn of the cells in the gas. The power radiated by this group of cells is and their contribution to the total power spectrum is We obtain the total power spectrum for the gas by summing this expression over all possible ∣m> → ∣n> transitions. The total power radiated per unit frequency interval is thus given by
| (4) |
This expression is equivalent to summing eq (3) over all ∣m> → ∣n> transitions, weighting each term by NRρm, the number of cells initially in the state ∣m>.
In most line broadening problems one is interested only in a small range of frequencies which pertains to a single broadened line. In such a case the factor ω4 in eq (4) does not change appreciably, and most of the frequency dependence of P(ω) is contained in the sum over δ(ω − ωmn). We thus define a line shape function, I(ω), by
| (5) |
| (6) |
The evaluation of I(ω) is the central problem of line broadening theories. Since eq (6) is not multiplied by NR, we see that it is necessary to consider only a single cell (with appropriate weighting of the initial states) when evaluating I(ω). For this reason, most line broadening theories start out with a mathematical model which contains a single radiator immersed in a gas of perturbing particles. We have included this section merely to justify the use of such a model, and in all following sections we will refer to this model rather than the gas as a whole.
3. The Weak Coupling Approximation
Using the results of the previous section, it can easily be shown [4, 5, 6] that the line shape for spectral radiation is given by
| (7) |
where C(t) is a correlation function which has the property C*(t) = C(−t). For dipole radiation, C(t) is an autocorrelation function of the dipole operator d for the radiator. This function may be given by
| (8) |
where T(t) and ρ are the time development operator [7] and density matrix for the system (a radiator and its perturbers) and the trace is over all states of the system.
The autocorrelation function C(t) may be given various physical interpretations. Some of these interpretations are discussed in detail in section 7.
Following Fano [8, 9], we regard the system as being composed of two weakly coupling subsystems; subsystem “a” is the radiator (e.g., an atom, molecule, etc.) while subsystem “p” is the gas of perturbers. The Hamiltonian, H, for the system has been given in eq (1) by (Ha + Hp+ V) where Ha and Hp are Hamiltonians describing the radiator and perturber subsystems and V represents a coupling interaction between these subsystems [10].
We assume that the radiator and the gas of perturbers are statistically independent; that is, the state of the perturbers does not depend explicitly on the state of the radiator, and vice versa. This is the neglect of “back reaction” in the density matrix. A mathematical statement of this approximation has been given by Fano [9] in terms of the density matrix:
| (9) |
where ρ(a) and ρ(p) depend only on radiator and perturber variables respectively. As a first approximation we assume that each subsystem is in equilibrium:
| (10) |
For a canonical ensemble, ρ∝exp {− H/kT}: this requires that exp {− V/kT} be replaced by unity leaving ρ ∝ exp {− (Ha + Hp)/kT} which has the form required by eqs (9) and (10). The interpretation of this approximation is a matter of some importance and it will be discussed in detail in section 8.
The assumption of statistical independence, eq (9), permits one to write the correlation function, eq (8), in the form
| (11) |
where the subscripts on the trace operations denote a trace over states of the indicated subsystem [9]. The bracket 〈…〉 represents an ensemble average over the perturber subsystem. This average takes the place of the usual “thermal average.”
4. The Perturber Ensemble
We have assumed that the radiator and the perturber subsystems are statistically independent. The density operator, ρ(p), for the perturber subsystem will therefore be the same as the density operator for an isolated gas of N particles in a volume (the volume of a cell) maintained at a constant temperature T. Given the restrictions (N, , T), there are many possible configurations of positions, x = (x1, x2 …, xN), and velocities, v = (v1, v2, …, vN), for the particles which are consistent with these macroscopic constraints. Speaking quantum mechanically, we say that there are many different state functions, φ(x, t), which could describe such an isolated gas. We may index each of these possibilities with a superscript (i), and we imagine a collection of perturber gases, called an ensemble [11]. Each φ(i)(x, t) must be a solution of the Schrödinger equation,
| (12) |
which is normalized to unity:
| (13) |
Due to its mathematical nature, the concept of an ensemble is not readily given to simple physical interpretations. However, in the following section it will be shown that, by neglecting the influence of the radiator on the state of the perturbers which surround it (i.e., “back reaction” effects in the wave functions), a physical interpretation of the ensemble is possible. In that case we may regard each of the cells discussed in section 2 as being a member of the ensemble (provided that there are enough cells); that is, each φ(i) will then represent one of the possible states for the perturbers in a cell and, for a given state of the radiator, the superscript i may then be used to index the cells. It must be emphasized that in this section we are not representing the wave function for the perturbers in the cells by φ(i). At this point we are simply constructing an approximate density operator for these perturbers which is based on an isolated gas. In the following section we will use the φ(i) as basis functions in constructing wave functions for the perturbers in the cells, however, the treatment of the wave functions in the following section is distinct from the discussion of the statistics in this section.
For an isolated gas, energy would be a constant of the motion hence each φ(i) will be an eigenfunction of Hp [13]. If we define a complete orthonormal set of Hp eigenfunctions {fk(x)} by
| (14) |
| (15) |
then, from eqs (12) and (13), it is obvious that
| (16) |
Using the notation
| (17) |
Equation (16) may also be written in the form
| (18) |
The probability of finding the gas of perturbers in a particular state φ(i) will be denoted by q(i), and the q(i) will be normalized so. that
| (19) |
The density operator, ρ(p) is then defined [9] by a sum over all possible states φ(i), weighting each by its probability of occurrence q(i):
| (20) |
For the macroscopic constraints N, and T, the statistical weights are given by
| (21) |
where are the Hp eigenvalues. Since the states φ(i) are Hp eigenfunctions, they are also eigenfunctions of the operator exp(−Hp/kT)/Trp{exp(−Hp/kT)} having the eigenvalues q(i) as given by eq (21). Using this property and a closure relation for the φ(i) we may combine eqs (20) and (21) to obtain a more familiar form for ρ(p).
| (22) |
It is obvious, from this equation, that the perturber density matrix is diagonal between eigenstates of Hp. Since the states φ(i)(x, t) are eigenfunctions of Hp, the density matrix will be diagonal between these states. In the following section it will be assumed that φ(i)(x, t) may, under certain conditions, be viewed as a product of wave packets for each of the perturbers. We will then be able to state that the density matrix is “diagonal in wave packet states.”
5. Approximate Wave Functions
In this section, approximations will be made concerning the form of the wave functions for the system (radiator and perturbers). We will derive the wave functions used by Baranger [4], retaining the superscript (i) notation of the previous section in order to clarify the influence of the perturbers in these wave functions.
The wave functions for the system must be solutions of the Schrodinger equation
| (23) |
where x is a 3N-vector (x1, x2, …, xN), which denotes the positions of the N perturbers, and R denotes the internal coordinates of the radiator (e.g., for an atom, R denotes the positions of the orbital electrons).
We wish to expand the wave functions Ψ(R, x, t) in terms of some complete set of basis functions for the system. Since the functions fk(x), introduced in the previous section, form a complete set for the perturber subsystem and the Ha eigenfunctions, which we shall call gj(R), form a complete set for the radiator subsystem, we may expand Ψ(R, x, t) in the following manner:
| (24) |
where the ajk(t) are to be determined by using eq (23) and some boundary condition on Ψ.
Using eq (16) we may rewrite eq (24) in the form
| (25) |
where the coefficients
| (26) |
must be determined by requiring that Ψ satisfy the Schrödinger equation, eq (23), as well as some boundary condition.
At some initial time, t = 0, we may require that the perturber gas be represented by one of the wave functions φ(i):
| (27) |
That is, at some instant in time, the coupling interaction V will be small and the perturbers will behave like an isolated gas at that instant. Equation (27) constitutes the boundary condition for Ψ. Substituting eq (25) into the Schrodinger equation, eq (23), and using the orthogonality of the φ(k) we obtain a set of coupled equations for the coefficients χ(k):
| (28) |
where
| (29) |
| (30) |
We will now assume that the state of the perturbers is not affected by the interaction V; that is,
| (31) |
With this approximation the states φ(i) may be regarded as approximate V eigenfunctions,
| (32) |
as well as Hp eigenfunctions. This relation provides a mathematical statement of the neglect of “back reaction” in the wave functions. Since the state of the perturbers is now unaffected by the presence of the atom, the coupling term in eq (28) vanishes and, using the boundary condition given in eq (27), we readily obtain the approximate wave function
| (33) |
where χ(i) satisfies the equation
| (34) |
Comparing eqs (12), (29), (33), and (34) with eqs (16), (17), (21), and (22) of reference [4], we see that the approximate wave function Ψ(i) is identical with the approximate wave function introduced by Baranger. The superscript (i) now indicates the dependence of the total wave function, Ψ(i), on the various possible perturber configurations in a cell (as mentioned in the previous section). Although this dependence is implied by Baranger, it is not illustrated by such a parameter.
It is obvious that a functionΨ(i)(R, x, t) having the form given in eq (33) cannot rigorously satisfy the Schrödinger, equation, eq (23), because the latter is, in general, not separable equation. It would be possible to find solutions of the form χ(i)(R, x, t)φ(i)(x, t), but the “radiator wave function,” χ(i), would then depend explicitly on the perturber coordinates, x, as well as R and t. By neglecting “back reaction,” according to eq (32), we have obtained an approximate wave function of the form χ(i)φ(i) where χ(i) does not depend explicitly on perturber coordinates. The superscript (i) which appears on this “radiator wave function” indicates that χ(i) does depend implicitly on the state of the perturbers through the effective potential V(i)(t); that is, we get different functions V(i) for each of the possible states φ(i).
A general procedure for approximating the solution of the Schnidinger equation (23) by a function Ψ(i) having the form stated in eq (33) would be an iterative one. The first approximation uses φ(i) and χ(i) as given by eqs (12) and (34). If we chose to, we could obtain the next approximation to φ(i) by substituting Ψ(i)= φ(i)χ(i) into the Schrödinger equation (23), using eq (34) to remove some of the χ(i) terms. Multiplying by χ(i)* and integrating over dR then gives a differential equation for φ(i) which is similar in form to eq (34). That is, in addition to the Hp which appears in eq (12) we would have a potential energy function which is averaged over R. The φ(i) solution of this new equation is presumably an improvement over the solution of eq (12) since the former contains some back reaction effects; however, the improved φ(i) would no longer be an Hp eigenfunction. Since it is desirable that φ(i) be an Hp eigenfunction and thus diagonalize ρ(p), we will use the first approximation to φ(i) (i.e., the solution of eq (12)). If back reaction effects are important, it is not sufficient to use this first approximation. If a higher order approximation is used, the density matrix, ρ(p), will not be diagonal, as noted by Griem [14]. [In sec. 8 it will be shown that when these back reaction effects become important, the classical path approximation is breaking down anyway.] In this case, it would be more convenient to use the Hp eigenfunctions as a basis set in evaluating the trace over perturber coordinates (i.e., the thermal average as defined by eq (11)). The perturber density matrix would then be diagonal, but it would be necessary to make a fully quantum mechanical calculation of the time development operator (similar to that discussed by Baranger [4, 5]). Such a procedure would not be a classical path calculation and we will not consider this case any further.
In keeping with the usual classical path treatment, we next assume that each wave function φ(i)(x, t) may be expressed as a product of wave packet states for each of the perturbers:
| (35) |
The wave packets, are assumed to be sharply localized during the times of interest in line broadening. Although one does not usually think of energy eigenstates as being wave packets, it is nonetheless possible for a wave packet state to be an approximate eigenfunction of both Hp and V (as required by eq (32)) having eigenvalues and V(i)(t) which arc classical functions of perturber coordinates. The validity of this approximation is discussed in section (8).
We now note that the influence of the perturbers which is implicit in V(i)(t) may be specified by giving the locations, of the peaks of the packets as well as their group velocities, We thus complete the classical path picture by regarding the wave packets as classical particles described by their positions
Since the wave packets are assumed to be sharply localized, we may replace V(i)(t) by a classical potential function. That is,
| (36) |
where V(i)(R, x(i), v(i), t) is a classical potential function of the 3N-vectors x(i) and v(i), the operator R, and the time t.
6. The Thermal Average
To obtain an expression for the “thermal average” in eq (11), we must first obtain an expression for the time development operator, T(t), for the total system. Since we are approximating the wave function Ψ(R, x, t) by Ψ(i)(R, x, t), we will need the corresponding time development operator T(i)(t) which is defined by
| (37) |
From eqs (33) and (34), we see that this operator may be written in the form
| (38) |
where is the solution of
| (39) |
with the boundary conditions
Since d (see eq (11)) is the dipole operator for the radiator, it will commute with the perturber Hamil-tonian Hp and we may write
| (40) |
Substituting this identity into the “thermal average” in eq (11), we obtain
| (41) |
where ∣k> denotes an arbitrary basis vector for the perturber subsystem. Since has no explicit dependence on perturber coordinates (i.e., it depends on perturbers implicitly through the coordinates x(i), v(i) of the wave packets φ(i), we have
| (42) |
Substituting this identity into eq (41), we see that we need only the diagonal matrix elements of ρ(p) in the states ∣k>. Using the matrix elements of ρ(p) as given by eq (20) and noting the normalization condition of eq (16), we obtain
| (43) |
To complete our evaluation of the “thermal average,” we must express this weighted average in a form which is amenable to practical calculations.
Since q(i) is a function of defined by eq (21), we may express q(i) as a function of x(i) and v(i) by using
where Vp denotes the interaction between perturbers. If the radiator is an ion in a plasma, we may add a Coulomb interaction, V0(x(i)), to Hp [10]. This expression is consistent with the wave packet interpretation of φ(i)(x, t).
In the classical limit, the sum over (i) is replaced by an integral over x(i) and v(i). We may thus write q(i) in the form
| (45) |
where Q is the classical partition function
| (46) |
We may now write the “thermal average” in the form
| (47) |
where the time development operator is obtained by solving eq (39) with the classical potential V(i)(R, x(i), v(i), t). This expression will be reduced to the more familiar averages over collision variables (velocity, impact parameter, etc.) in II.
7. The Autocorrelation Function
7.1. The Influence of the Density Matrix and the Wave Functions
Before discussing the validity of the approximations concerning the density matrix and the wave functions (sec. 8), it is useful to first consider the roles they play in determining the line shape. Since the line shape has been given as a Fourier transform of the autocorrelation function, eq (7), it is convenient to study this function as well.
The autocorrelation function was defined in eq (8) by C(t) = Tr{d·d(t)ρ} Where d(t) = T†(t)dT(t) and ρ is an equilibrium density matrix for a radiator and its perturbers. Since both ρ and T(t) are functions of H (see eq (1)), they will commute at all times (i.e., ρ is stationary) and we may write
| (48) |
for any time t0. This relation is characteristic of a stationary random process and it indicates that when we speak of an initial time, we are referring to any arbitrarily chosen instant, not some time infinitely far in the past. In writing C(t) = Tr{d · d (t)ρ} we have arbitrarily chosen t = 0 as our initial time. Autocorrelation functions [15] such as C(t) fall to zero as t increases; this represents the loss of correlation between the dipole moment at the time t = 0 and its value at some later time.
We may regard the trace, in the defining relation for C(t), as a sum over all possible initial configurations of perturbers and initial states of the radiator; each term in this sum is weighted by the matrix element of ρ which gives the probability of finding the corresponding initial state. We may also interpret this trace as a sum over cells, with each initial state corresponding to a given radiator-perturber configuration as discussed in section 2. For each initial state there will be a dipole moment, d, and at some time t later this moment will have changed to d(t). Once the initial state has been specified, the temporal change in d is determined solely by the time development operator T(t), and is not influenced by the statistics (density matrix) in any way.
We may now think of the line shape as being obtained by taking the Fourier transform of the d · d(t) contribution from each initial state, and then summing these Fourier transforms, weighting each term by the appropriate density matrix element (the probability of occurrence of a given configuration in a cell). It is thus apparent that the density matrix can influence the line shape only by its weighting of each d · d(t) contribution; the wave functions influence I(ω) only by their effect on the calculation of each d · d(t) term. The influence of the density matrix is quite distinct from the influence of the wave functions, and similarly, approximations relevant to the statistics (density matrix) are completely independent of the approximations relevant to the dynamics (time development operator or wave functions).
7.2. Classical Correspondence
a. Classical Theories
In discussing any quantum mechanical theory it is frequently beneficial to have a classical correspondence available as an aid to understanding the physical properties underlying the mathematical description. In a classical path description of line broadening, the perturbers are regarded as classical particles, and there is little difficulty in understanding the physical properties of this subsystem. The radiator is treated quantum mechanically however, and it is useful to develop its classical analog to clarify the influence of collisions on the spectral radiation.
In classical line broadening theories one considers a classical oscillating dipole which emits a continuous wave train (referred to as the light train). The light train is described by the classical dipole moment d(t), and the line shape is given by a Fourier transform of the dipole autocorrelation function C(t) = < dd(t) > just as in eq (7); in this case < … > denotes a classical average over collisions. In the Lorentz interruption theory (see p. 571 of ref. [16]), it is assumed that a collision terminates the unperturbed light train d(t) = d exp (− iω0t) where ω0 is the unperturbed frequency. The autocorrelation function is therefore given by, C(t) = d2 ∫ P(t0)U(t < t0) exp (− iω0t)dt0, where P(t0) is the probability that a collision will occur after a time t0, and U(t < t0) is a step function which equals 1 before the collision occurs (i.e., when t < t0) and equals zero after the collision. Although the line shape is not always given in this form (cf. eq (1.2) of ref. [16] noting that P(t0) = νc exp (− νct0)), the results are the same. In another classical theory Weisskopf, and later Lindholm, assumed that collisions can change the phase of the light train [16, 17]. The line shape is again given by the Fourier transform of the light train autocorrelation function (see also eq (1.13a) of ref. [16] or eq (11.13) of ref. [17]) hence a phase shift decreases the phase correlation thereby broadening the Fourier spectrum (line shape). In these phase shift theories, d(t) has the form exp [− iω0t − iη(t)] where η(t) represents the phase shifts produced by collisions. The autocorrelation function is C(t) = d2 exp (− iω0t) 〈exp −iη(t)]〉 where < … > denotes an average over phase shifts (thermal average). The Weisskopf theory assumed that small phase shifts (less than unity) have no effect on C(t), whereas large phase shifts (greater than unity) destroy the phase correlation of the light train and cause C(t) to fall to zero. This effect is only apparent after one performs an average over a probability distribution of phase shifts. To show this, we consider a collision which occurs in the time interval (t, t + Δt) and produces a phase shift η0. If the duration of this collision τ is very short, τ < Δt, then we have C(t + Δt) = C(t) 〈exp (− iη0)〉 (the two factors are independent and are averaged separately as in the impact approximation; see sec. (11.3.2) of ref. [17]), where 〈exp (−iη0)〉 = ∫ exp (−iη0)P(η0)dη0 and P(η0) denotes the probability of finding a phase shift of magnitude η0. If P(η0) does not vary rapidly, then the rapid oscillation of exp (− iη0) for η0 > 1 will produce
(the integral from − ∞ to − 1 is also much less than 1) whereas the integral from − 1 to + 1 is on the order of unity. Thus, for η0 > 1, we have C(t + Δt) ≃ 0 and for η0 < 1, C(i + Δt) ≃ C(t). While the average over η0 is given more accurately by later phase shift theories (e.g., Lindholm’s theory), the qualitative results for the line shape stated by Weisskopf for η0 > 1 are basically unchanged. It is in this sense that we will regard a large phase shift as an effective termination of the light train.
b. Quantum Mechanical Theory
In the quantum mechanical problem the radiators in a gas emit a series of spectral lines which are broadened due to the perturbation V. We are usually interested in just one of these broadened lines corresponding to some given unperturbed frequency ω0. The photons which contribute to this particular line are emitted in radiative transitions between some particular initial and final energy levels of the radiator (which may be perturbed). In this case, the light train is the wave function for a photon emitted in such a transition; the amplitude of the light train gives the probability of finding a photon emitted after some time t. Baranger has shown that the function C(t) defined in eq (8) may be interpreted as the autocorrelation function for the light train amplitude (see p. 498 of ref. [4]). Thus the line shape is obtained from the Fourier transform of the light train autocorrelation function just as in the classical theories.
The effects of a collision may be roughly divided into two categories: adiabatic and nonadiabatic. We will discuss these separately although a single collision may have both effects. An adiabatic collision will simply shift the energy levels thereby producing a phase shift (which may be time dependent) in the time development operator T(t). Such collisions will reduce the phase correlation between d(0) and d(t) = T†(t)dT(t) just as the collisions in the classical phase shift theories. A nonadiabatic collision will induce a radiationless transition out of the initial state. This type of transition usually “terminates” the probability that the radiator will emit a photon with a frequency near ω0. However, if there is another state whose energy is very close to that of the initial state, nonadiabatic collisions may simply produce transitions back and forth between these two states without completely destroying the probability of emission near ω0. The latter situation is usually referred to as the problem of overlapping lines (although it is sometimes called a “phase-memory” effect) [18]. In a classical oscillator having more than one mode of oscillation, a nonadiabatic collision simply changes the oscillation from one mode to another. The Lorentz interruption theory implicitly assumes that the mode whose frequency is ω0 is the only mode of interest and a (nonadiabatic) collision terminates this mode. However, if there were another mode with a frequency very close to ω0, nonadiabatic collisions could switch the oscillator back and forth between these two modes without destroying the phase coherence of the light train (overlapping line case). For example, an oscillator in the mode ω0 will have a phase ω0t at some time t (apart from an arbitrary initial phase). If this oscillator is switched to the mode ω1 at some time t and, after a time t0, is switched back to ω0, the resulting phase shift in the natural oscillation is η ≃ [ω0(t + t0) − (ω0t + ω1t0)] = (ω0 − ω1)t0. Estimating t0 by the mean free time (l/νc) and noting that the collision frequency νc may be identified with the halfwidth Δω1/2, (in a Lorentz picture), we have η ≃ (ω0 − ω1)/Δω1/2. If (ω0 − ω1) > Δω1/2 (isolated lines), we have η > 1 and we may say that the light train is terminated by nonadiabatic collisions (as the Lorentz theory assumes). If (ω0 − ω1) < Δω1/2 (overlapping lines), we may say that the light train has retained a “memory of its initial phase.” Thus we see that nonadiabatic collisions are analogous to those considered by the Lorentz interruption theory only in the case of isolated lines. In the case of overlapping lines, nonadiabatic collisions are not treated by any classical theories although the classical analog is obvious. We also note that a collision which produces both adiabatic and nonadiabatic effects cannot be conveniently treated by classical theories either.
To make our discussion of the quantum mechanical C(t) more quantitative we will consider the case of a quantitative three-state radiator having a ground state with energy E0 and two excited states with energies E1 and E2. We will also assume that the ground state is not perturbed by the interaction V(t) and we use V00 = V01 = V02 = 0 (as well as Vij=Vji). A generalization to more complicated cases may be achieved by means of an obvious extension of the following arguments. The time development operator for the radiator (see eq (39)) will be written in an interaction representation defined by
| (49) |
where
| (50) |
and is the time ordering operator [19]. Notice that V0j = Vj0 = 0 implies that and for j ≠ 0; this simplification is the reason for neglecting ground state interactions.
If we are interested only in the line whose unperturbed frequency is we do not need all of the terms in the trace over states of the radiator (see eq (11)). We need only consider the terms
| (51) |
where exp (− Ej/kT). The natural oscillation exp (− iω10t) in the first term may be regarded as an unperturbed light train which causes the Fourier transform of this term (eq (7)) to be large in the vicinity of ω10. Thus, when calculating the line ω10 we need only consider those terms in eq (11) whose natural oscillation is close to ω10. The second term in eq (51) describes radiation near ω20; this term will be important (when calculating the ω10 line) only if the lines ω10 and ω20 overlap (i.e., if their half widths are on the order of (ω10 − ω20)). It will be shown later in this section that the phase of the off-diagonal terms such as varies according to the line spacing ω12; thus when the average time between collisions (which is on the order of l/Δω12) is small compared with 1/ω12, these off-diagonal terms will become important and phase memory effects (overlapping lines) will have to be considered. If ω10 and ω20 are isolated lines (i.e., their halfwidths are much smaller than ω12), the second term in eq (51) (as well as the off-diagonal T′ matrix elements) may be neglected. For some lines in the microwave region it may happen that the halfwidth of ω10 is on the order of ω10; in this case it would be necessary to add the “negative resonance” terms, exp (− iω01t), to eq (51).
If we factor the natural oscillation out of C(t), we may write where is a decreasing function of t [15]. The line shape (see eq (7)) is proportional to ∫ exp where Δω = (ω − ω10). Since exp (itΔω) oscillates rapidly for t > 1/Δω, the intensity at the point Δω is determined primarily by the part of for which
| (52) |
For this reason, 1/Δω is frequently referred to as the time of interest in evaluating the intensity at the point Δω.
If the duration of a collision is much longer than the time of interest, the perturber may be regarded as stationary during this time. This quasi-static approximation is frequently employed in the description of ions in a plasma [4, 6]. Such static perturbations simply add a constant to the oscillation frequencies ω10, ω20, etc. The influence of dynamic perturbations, for which the collision time is less than or the order of the time of interest, is obtained by calculating and We are interested in the classical correspondence for the quantum mechanical C(t) primarily as an aid in establishing a criterion for the so-called strong collisions (this will be done in sec. 7.3). When a single collision is capable of reducing C(t) to zero, it is referred to as a strong collision. To study this effect we need only consider the influence of a single collision on C(t). We will consider some particular type of collision (e.g., given values of impact parameter, velocity, etc.) which is described by the interaction potential V(t). We will assume that this is the first collision to occur after some arbitrarily chosen starting time t = 0 (if a collision does not reduce C(t) to zero, it is “weak,” and to find a strong collision criterion we may neglect the effects of all previous “weak” collisions). The collision will begin at some time t0 ⩾ 0, and we define a collision time τ such that V(t) = 0 unless t0 ⩽ t ⩽ t0 + τ. From eq (50) we see that T′ (t) = 1 when t < t0; when t > (t0 + τ), we change the variable of integration to t″ = (t′ − t0) and obtain
| (53) |
where S is the S-matrix for the collision,
| (54) |
and
| (55) |
The contribution to C(t) from this collision will be called C(V(t), t); this function will have the same form as C(t) in eq (51) except that there will be no average < … > over collisions. To obtain C(t) we would simply average C(V(t), t) over the collision parameters for the first collision (and all successive collisions if the first collision is not strong).
c. Isolated Lines
If the lines ω10 and ω20 are isolated, C(V(t), t) is given by exp (− iω10t). In this series (no ground state interaction) and is unimportant (for isolated lines only). As mentioned earlier, to show that is unimportant it is necessary to average over t0. This has been done for the line wings in eq (19) of ref. [20] and the second term in this equation, which contains the off-diagonal elements of the time development operator, is negligible in the case of isolated lines (see also the following section on overlapping lines). Using this fact we have
| (56) |
Since the complex function can always be written in the classical form exp [− iη(t) − ϵ(t)], the classical correspondence for C(V(t), t) becomes obvious. Collisions for which ϵ = 0 are equivalent to the collisions in the classical phase shift theories and those for which ϵ ≳ 1 are equivalent to the nonadiabatic collisions in the Lorentz theory. To illustrate the functional form of η and ϵ, we will evaluate for t > (t0 + τ), to lowest order in V(t). We will use V11 = V22 = 0 since V has no diagonal matrix elements in most problems (i.e., V usually has nonvanishing matrix elements only between states with different angular momenta). Using eqs (53), (54), and (55) we obtain
| (57) |
where ΔE21 = (E2 − E1) and denotes the average value of V21(t) during the collision. For fast collisions we have
| (58) |
For slower collisions we replace τ by ∞ when integrating the exponential in eq (57). We then replace ΔE21 by (ΔE21 + iδ), perform the integral, and then take the limit δ → 0. The justification for this procedure is the “radiation damping” which produces the natural line width (see sec. 13.A of ref. [3]). In this manner we obtain
| (59) |
These results may be summarized by
| (60) |
where
| (61) |
and
| (62) |
Notice that if the perturbation is not strong enough to mix adjacent levels, then ϵ ⪡ 1 and we have only a phase shift. That is, all collisions are adiabatic if If adjacent levels are mixed, we will have phase shifts produced by slow (adiabatic) collisions and inelastic transitions produced by fast (nonadiabatic) collisions.
For a series of collisions which do not overlap in time (or for which this overlap is negligible) the classical form is preserved and the net result is a phase shift η and a damping ϵ which are obtained by adding up the phase shifts and damping constants from each of the collisions. When two collisions overlap, quantum mechanical mixing effects occur and the classical form is destroyed.
d. Overlapping Lines
If the lines ω10 and ω20 are not isolated we must consider all terms in eq (51):
| (63) |
The term gives the probability that a radiator initially in the state E1 will remain in that state after a time t; the term gives the probability that a radiator initially in the state E2 will make a radiationless transition to E1 during the time t; together these two terms describe the probability, at the time t, that a radiator will emit a photon with a frequency near ω10. The remaining terms in eq (63) describe the probability of emission near ω20 (we must consider these terms when the lines ω10 and ω20 overlap).
When t < t0, the radiator is unperturbed hence and in this case, When the collision occurs, we get a probability for switching between the modes ω10 and ω20 as discussed earlier; that is, and increase in amplitude while and decrease. To show this effect, we will calculate the T′ matrix elements for t > (t0 + τ).
The condition for overlapping lines is since we may identify Δω1/2 with νc where (1/νc) is the mean free time between collisions and since τ is usually much less than 1/νc (at least for strong collisions), we must have Noting the results of eqs (57) and (58), we see that when the S matrix elements may easily be evaluated to all orders with the following results:
| (64) |
Using eqs (55), (63), and (64) we may give an explicit expression for C(V(t), t) in terms of To simplify the following discussion we will use C(V(t), t) = C1(V(t), t) + C2(V(t), t), where
| (65) |
and C2(V(t), t) contains the two ρ2 terms in eq (63). (C2 may be obtained by interchanging the indices 2 and 1 in C1.) We will discuss only C1 since the interpretation of C2 is obviously identical.
C1(V(t), t) describes the behavior of a radiator which was initially in the state E1. Before the collision
| (66) |
and after the collision, t > t0 + τ C1 is given by eq (65). cos gives the probability that the radiator will be in the state E1 after the collision, and sin2 gives the probability that the collision will induce a radiationless transition to the state E2. If the collision will definitely cause a transition to E2 and we would have
| (67) |
after the collision. Equation (67) describes a light train for the line ω20 with a phase shift ω21 t0 just as in the classical oscillator discussed earlier (the factor d10/d20 represents the arbitrary initial phase difference between the two modes as well as an amplitude difference if one should exist). This result is identical with the result for a two mode classical oscillator because, when equals some odd multiple of (π/2), there is definitely a change of mode; for other values of the quantum mechanical expression, eq (65), shows only a probability for this mode switching effect. Notice that the mode switching or phase-memory effects are described by the off-diagonal T′ matrix elements. These off-diagonal elements were not needed in the description of isolated lines because the average phase shift ω21t0 is greater than unity (recall that the average value of t0 is the mean free time (l/νc) which is estimated by l/Δω1/2) and these elements vanish after the t0 average.
In the case of overlapping lines, C(V(t), t) does not have the simple classical form which was obtained for isolated lines. In this case we would average C(V(t), t) over collision parameters by defining a quantity and a function P(ϵ) which gives the probability that the collision will produce a whose magnitude is ϵ. The average over C(V(t), t) contains terms like
If P(ϵ) does not vary rapidly then we may say that collisions for which ϵ > 1 effectively terminate the wave train (this argument is analogous to the treatment of large phase shifts in the classical theories).
7.3. Strong Collisions
In discussing the approximations relevant to the density matrix and the wave functions, our general procedure will be to show that each approximation breaks down only during a “strong collision.” It is thus convenient at this point to clarify our meaning of strong collisions and to discuss briefly the dynamics of certain types of collisions.
In the previous section it was shown that each collision produces a phase shift and changes the amplitude of the light train. In this manner, a series of collisions will eventually destroy the correlation between d(t) and d(0) and reduce C(t) to zero. It may happen that a single collision is capable of reducing C(t) to zero; such an event is defined to be a strong collision. From the results of the previous section we see that a strong collision is one which produces a large phase shift, η > 1, or a large decrease in amplitude, ϵ > 1. Such collisions terminate the light train and therefore satisfy the hypotheses of the Lorentz interruption theory when ϵ > 1, or Weisskopfs phase shift theory when η > 1. The general program of classical path theories is to describe all but the strong collisions assuming that these will be treated by some other method such as a Lorentz-Weisskopf approximation [21].
From the results of the previous section, we see that a strong collision will occur when
| (68) |
unless the level spacing is greater than V (cf eq. (61) using If we must replace the V in eq (68) by in this case, Veff is the actual perturbation of the radiator since we have used V11 = 0. It is in this sense that we use eq (68) as a working definition of strong collisions. It should be noted that our definition of a strong collision as one which terminates the light train is basically equivalent to the strong collisions in the impact theory [21] which are said to occur when the terms in the S-matrix expansion (see eq (54)) are on the order of unity or larger.
Most classical potential functions are inversely proportional to some power of the perturber position is the radiator-perturber separation at the time of closest approach and v(t) is the instantaneous perturber velocity. The maximum value of the potential where ∣r(t)∣ is smallest, namely when r(0) = ρ0. The half maximum occurs when (which may be taken as a definition of the collision time τ). If v(t) changes very little during a collision we may use v(τ) ≃ v(0) = v0 or ∣r(τ)∣ ≃ ∣ρ0 + τv0∣ ≃ 2ρ0. Since ρ0 · v0 = 0, the collision time is on the order of
| (69) |
where ν0 is the velocity at the time of closest approach. It can be shown by a rather lengthy analysis of hyperbolic trajectories, that this estimate is valid even when the radiator is an ion which accelerates a charged perturber.
If the radiator is electrically neutral, we may estimate ν0 by the thermal velocity If both radiator and perturber are charged particles, we may estimate ν0 by
| (70) |
where Zr and Zp denote the charge of the radiator and perturber respectively.
8. Validity Criteria
8.1. General
An expression for the “thermal average” has been derived, eq (47), using a classical path treatment. This expression, or one like it, is the starting point for most of the modern semiclassical line broadening theories and it may be regarded as the end result of the classical path approximation. It is therefore worthwhile to examine the validity criteria for this expression.
It has been shown that the classical path approximation, as it is used in line broadening theories, actually embodies several types of approximations. As we have stated them, they may be roughly divided into three classes; (1) the representation of the gas by a collection of noninteracting cells, (2) simplifications of the density matrix, and (3) a classical treatment of the perturber wave functions.
In order to justify the mathematical model which is used by most line broadening theories, we have assumed that the average spacing between radiators is large enough that all direct and indirect interactions between radiators are negligible. (By indirect interactions we mean that one radiator may influence a perturber which is interacting with another radiator.) It is probably possible to justify the mathematical model with more relaxed assumptions, however, since our assumptions are satisfied for virtually all problems of interest, this may not be necessary. For resonance broadening in neutral gases, this approximation may have to be reconsidered.
It should be noted that the radiator Hamiltonian Ha should describe translational motion of the radiators as well as their internal structure (i.e., bound states). In practice, however, one usually neglects this translational motion and calculates only the discrete spectrum. This procedure neglects Doppler shifts which also broaden a given line. The Doppler broadening may be reinstated, in an approximate manner, by means of convolution integrals [6]. This folding procedure should be valid if the radiators trajectory, during the time of interest, is not appreciably altered by collisions with other particles. For most plasma line broadening problems this criterion is well satisfied however, for some foreign gas broadening problems one may have to treat Doppler effects more carefully.
8.2. Density Matrix
The approximations relevant to the density matrix are for the purpose of achieving a product form, eq (8), which permits us to factor the trace operations, as in eq (11), and thereby perform the average over perturber states without affecting the sum over states of the radiator. In this weak coupling approximation, the subsystems are statistically independent and this is what is meant by the neglect of back reaction as it applies to the statistics (density matrix). This approximation has been made by replacing the term exp {− V/kT} in the Boltzmann factor by unity; that is, we have assumed that all V matrix elements (or at least those which pertain to the spectral line of interest) will satisfy V ⪡ kT. It should be noted that in obtaining this inequality we have tacitly assumed that the radiators and perturbers are in thermal equilibrium at a temperature T. For many line broadening experiments, however, the radiators and perturbers have different temperatures. Nonetheless, if the perturbers are in thermal equilibrium at a temperature Tp, the weak coupling approximation will still be valid for V ⪡ kTp and the perturber statistics will be described by a Boltzmann factor as in eq (47). In such a case, the statistical weights for the atomic states (matrix elements of ρ(a) would be determined from the rate equations which describe the population and depopulation of atomic levels. We will henceforth take V ⪡ kT, where T is the perturber temperature, as the validity criterion for the weak coupling approximation. It is obvious that this inequality will be violated for some interactions, however, it will be shown that the errors made by this approximation are important only in the far wings of a line where If the perturbation V ≳ kT is produced by a static ion (one which does not move appreciably during the time of interest, see eq (52)) in a plasma, the Stark splitting is on the order of kT, and it is immediately obvious that this will influence only the far wings, of a line.
When the perturbation V > kT is produced during a collision with a dynamic perturber (we will consider electrons as representatives of dynamic perturbers), the radiator-perturber separation must be on the order of (the thermal wavelength, or average de Broglie wavelength, for the perturber) or smaller [22]. Using V = kT, and ν0 − νav in eq (68), we see that, for neutral radiators, V ≳ kT can occur only during a strong collision (for most neutral radiators the energy spacing between adjacent states is less than kT and it is not necessary to consider Veff as discussed in sec. 7.3). If a charged perturber collides with an oppositely charged radiating ion, it will be accelerated by the Coulomb attraction V0, hence v0 > vav. Using V = kT, and v0 = vav(v0/vav) in eq (68), we find that is on the order of (vav/v0), which is less than 1. Thus, in this case, V ≿ kT need not correspond to a strong collision. In a collision involving a radiating ion and a charged perturber of like sign, the perturber will be decelerated, and it is unlikely that it will come close enough to the radiator to produce V ≿ kT. To show this, we note that the smallest value of ρ0 given by eq (70) is or in order that ρ0 be on the order of it is necessary that T be larger than 108 deg K.
Regarding C(t) as a weighted sum of d · d(t) terms as discussed in section 7.1, it is clear that the approximate density matrix ρ(a)ρ(p) will incorrectly weight only those d · d(t) terms which correspond to a radiator initially undergoing a V ≿ kT collision (i.e., the density matrix weights each term according to the state at t = 0). This does not preclude the possibility that V ≿ kT collisions may occur at later times, t > 0, in the remaining, correctly weighted, d · d(t) terms. Such events will indeed occur, however, they are determined by the dynamics of the system and this has no connection with the statistical weighting (density matrix) of the corresponding initial states. If we define a collision frequency νkT for the V ≿ kT collisions then the probability that such a collision will occur within a time τ of t = 0 is νkTτ. The number of incorrectly weighted d · d(t) terms will therefore be νkTτNT, where NT is the total number of terms. The number of correctly weighted terms will be (1 − νkTτ)NT, and the error resulting from the use of the weak coupling density matrix will be on the order of νkTτ/(1 − νkTτ). Since the duration, τ, of a V ≿ kT collision will be much smaller than the time between such collisions, we may replace (1 − νkTτ) by 1 and the error estimate becomes νkTτ. The error will be largest when all collisions produce V ≿ kT; in this case we may estimate the collision frequency by Δω1/2, the halfwidth of the observed line. The largest value of τ = ρ0/v0 for a V ≿ kT collision occurs for and v0 = vav (decelerated perturbers, v0 < vav are not likely to produce V ≿ kT). The largest value of τ is hence the largest error is on the order of Since is much less than kT for virtually all lines of interest, this error is negligible. A more detailed analysis of the incorrectly weighted terms shows that this error estimate breaks down when Δω ⪢ Δω1/2. In this case the time of interest is less than the average time between collisions (recall that Δω1/2 ≅ νkT for the worst case), and some of the correctly weighted d · d(t) terms will not suffer a collision during the time of interest. These “unperturbed” terms will have little influence on the line shape in the region Δω > Δω1/2 (their dominant influence is in the line center when Δω < Δω1/2). We must therefore compare the number of incorrectly weighted terms with (νkT/Δω)(1 —νkTτ)Nt (the number of correctly weighted terms which have undergone a collision during the time of interest, 1/Δω). In this case the error estimate becomes which is small if It must be emphasized that these error estimates are only an upper bound for the true error which, in most cases, is much less than or For neutral radiators V ≿ kT occurs only during a strong collision. Since a strong collision completely disrupts the light train, the weighted sum of all d · d(t) terms which correspond to a strong collision at t = 0 will have fallen to zero by the time these collisions are completed. That is, the halfwidth for the sum of all incorrectly weighted d · d(t) terms will be on the order of or smaller. The corresponding Fourier transform will thus be a constant for Since the incorrectly weighted terms add only a constant to I(ω) for the approximate density matrix will not affect the shape of I(ω) for neutral radiators. Since most V ≿ kT collisions are strong collisions even when the radiator is an ion, the above argument may be loosely applied to ion lines as well. We may therefore give the condition for validity of the weak coupling density matrix as
| (71) |
This condition was also obtained by Baranger [5, 23] within the framework of a quantum mechanical impact theory. We note, in passing, that the region of validity for the impact approximation is given τ0Δω ⪡ 1 where τ0 is some representative collision time (see sec. 5.1 of ref. [4]). Since we may state that when the impact approximation is valid the weak coupling density matrix may be used; the converse of this statement is not necessarily true.
8.3. Wave Functions
The dynamics of the system are needed only to calculate d(t) from d(0) for each given initial state; this requires a knowledge of T(t) but not Ψ(R, x, t). That is, since we are working with a trace, eq (8), it is not necessary to know the wave functions Ψ(R, x, t) for the system (although it is necessary to define a set of basis functions). Semiclassical approximations have been made in the wave functions simply to permit the use of a classical potential function, V(i)(R, x(i), v(i), t), in the time development operator. These semiclassical approximations, which are the hallmark of the classical path approximation, permit a simple physical interpretation of the dynamics of the system. More powerful quantum mechanical treatments are available [3,4]; however, in many cases, the results of these theories differ only slightly from the classical path treatment and the physical insight afforded by the latter is frequently advantageous.
The wave packet assumption was used to obtain the relation
| (72) |
where V(i) is a classical function of the classical position and velocity coordinates x(i) and v(i). V(R, x) can be expressed as a sum of interactions V(R, xj) between the radiator and each perturber (recall that x = (x1, x2, … xN)), thus we need only require that
| (73) |
when V(R, xj) ≠ 0 in order to obtain eq (72). Messiah [24] has shown that eq (73) is obtained if the perturber is represented by a classical wave packet [25, 26] and if the potential field seen by the perturber does not vary appreciably (as compared with the perturbers kinetic energy) over the width of the wave packet.
The conditions for validity of eq (73) are discussed in detail by Messiah and we shall only outline this discussion as it applies to our problem. We must show; (1) that it is possible to construct a wave packet which does not overlap with the wave functions for the other particles, (2) that this wave packet does not spread appreciably during a collision (i.e., when V(R, xj) is nonzero), and (3) that the potential seen by the perturber does not vary appreciably over the width of a wave packet. The potential seen by the perturbers may be obtained by iterating the Schrödinger equations for χ(i) and φ(i) to the next higher order as discussed in section 5. This potential will be the matrix elements of V(R, x) between atomic states or, in some cases, it will be Veff as discussed in section 7.3, (or some other type of optical potential) [27]. The momentum of a perturber moving in the potential V(R, x) is given by where E denotes the perturber energy. From this expression we see that Δp = mΔV/p, where Δp and ΔV denote the variation in p and V over the width of a wave packet. The condition that ΔV ⪡ p2/2m is thus equivalent to
| (74) |
which is the usual condition for validity of WKB calculations (see eq (7.13) of ref. [28] noting that λ(dp/dx) may be replaced by Δp). We will assume for the moment that this inequality can be satisfied, and it will be shown that the other validity conditions are easily met.
We wish to represent those perturbers which are not interacting with the radiator by Gaussian wave-packets of width Δxg and momentum spread Δpg ⪡ pav at some instant of time. For this wave packet construction we must require that Δxg be much less than the average particle spacing n−1/3 (n is the perturber density) so that the wave packets do not overlap. Using (for a Gaussian wavepacket), we see that the condition Δpg ⪡ pav also requires We must therefore be able to satisfy.
| (75) |
For virtually all problems of interest we have hence it will be possible to construct Gaussian wave packets for the overwhelming majority of perturbers at some instant of time.
Since static perturbers (e.g., ions in a plasma) are regarded as stationary during the time of interest, it is only necessary that they be represented by wave packets at some instant of time. As discussed above, this will be possible for the overwhelming majority of perturbers if The treatment of static perturbers will be discussed further in II and for the remainder of this section we will consider only dynamic perturbers.
During a collision a wave packet will generally increase both its width, Δx, and its momentum spread, Δp. If Δp ⪡ p (or ΔV < p2/2m, which will be verified shortly), the increase in momentum spread will be negligible, and for a collision of duration τ = ρ0/v0, the wave packet will increase its width to a maximum of Δxτ = (Δx + τΔp/m). To prevent overlap with the radiators wave function during the collision Δxτ should satisfy (Δxτ + R0) < ρ0, where R0 denotes the effective “extent” of the perturber wave function. Using τ = ρ0/v0 and Δp ⪡ p, we see that (τΔp/m) ⪡ ρ0 hence we need onlv require (Δx − R0) < ρ0.
For a collision involving a neutral radiator the perturber momentum may be taken to be pav, hence we may use Δx ≃ Δxg and Δp ≃ Δpg (which will then satisfy Δp ⪡ p). Since Δx may be within an order of magnitude of (and still satisfy ), we may obtain an order of magnitude estimate of the overlap condition, (Δx + R0) < ρ0, by considering If the wave functions will overlap only when and for neutral radiators, this produces a strong collision (see sec. 8.2). If a more detailed analysis of the perturbation potential is required; as an example we will consider the Stark broadening interaction, for which Since for temperatures of 106 K or less (for electron perturbers), we see that the overlap condition, R0 > ρ0, is violated only during a strong collision. Thus, for neutral radiators, the wave functions will overlap only during a strong collision. In the case where a radiating ion accelerates a charged perturber the deBroglie wavelength of the perturber is decreased, If v0 is not much larger than vav, there is essentially no difference between this case and the neutral radiator discussed above. If v0 ⪢ vav the situation is somewhat different. In this case the perturber will come very close to the radiator and it is necessary to construct the initial wave packet with a width Δx much smaller than Since we construct the wave packet so that which, using implies that Δp < p0. Such a wave packet will not hold together very long, however, it will hold together for a time on the order of τ = ρ0/v0 which is all that is required by eq (73). In this case we will have and the wave functions will overlap when ρ0 ≲ R0. The perturbation potential will be
we do not need to consider a Veff as discussed in section 7.3 because when ρ0 ≲ R0, V is comparable to the ionization energy for ions [29]. If v0 ⪢ vav then, from eq (70), we have v0 ≃ (2Ze2/mρ0)1/2 and
where is the Bohr radius (we have chosen m as the electron mass; for an ion perturber would be even larger). Since R0 is larger than a0, we see that the overlap of wave functions again produces a strong collision.
It is interesting to note that, for Stark broadening in plasmas we have where is the angular momentum of the perturber. Since R0 > a0, we will have strong collisions when l ≲ 1. Other interaction potentials may be treated in a similar manner with the same general results. In any case, the wave functions will not overlap if using and we find that the radiator and perturber will be distinct (i.e., their wave functions do not overlap) when
| (76) |
It is also interesting to note that the perturbers will look like classical particles as they pass the radiator only if Δx ⪡ x and Δp ⪡ p; using we again obtain the inequality or l > 1. We note further that Δl = ±1 in a dipole transition, thus Δl < l and it is in this sense that the angular momentum of a perturber is conserved during a collision, as it must be if we are to use classical mechanics to describe the perturbers.
We have now shown that the validity criteria for the use of classical wavepackets are all easily satisfied if Δp ⪡ p, or equivalently, ΔV ⪡ p2/2m. That is, the essential validity criterion for the classical wave-packets is the same as the condition for validity of a WKB calculation. This was also noted by Baranger [4] who found that the results of a quantum mechanical impact theory reduce to the classical path results when a WKB treatment of the wave functions is employed.
To show that the condition ΔV < p2/2m is satisfied we note that V increases monotonically as the radiator-perturber separation is decreased, thus ΔV ⩽ V and it is sufficient to show that V < p2/2m. Again V may be replaced by Veff if V is not strong enough to mix adjacent states. Using τ = ρ0/v0, and we obtain
| (77) |
Noting eqs (68) and (76), it is clear that this inequality can be violated only during a strong collision (recall that l ≲ 1 produces a strong collision).
Equation (73) requires that a perturber be represented by a wave packet when it collides with the radiator. If we let denote a Gaussian wave packet for the jth particle then the wave function φ(i)(xj, t) = Tj(t)φ(i)(xj, 0), where Tj is the time development operator for this particle, will be a Gaussian packet at some time tj if we construct φ(i)(xj, 0) such that
In this manner we can arrange the wave function φ(i)(x, t) = Πjφ(i)(xj, t) in such a way that all particles which collide with the radiator during the time of interest are represented by Gaussian wavepackets at the time of their collision. In general these particles will not be represented by wavepackets before or after their collisions, but this does not matter as far as eq (73) is concerned.
We have now shown that it is possible to represent the perturbers by. classical wavepackets as required by eq (73) unless a strong collision occurs (in which case we do not use wavepackets). It is interesting to note that it may be possible to treat some strong collisions by classical path methods. That is, eq (77) could be satisfied when if l is sufficiently large. If one wished to do this, it would be necessary to reexamine the validity criteria in more detail. For the purposes of this paper, however, it is sufficient to state that the use of classical wavepackets is justified for all collisions which are not strong collisions.
8.4. An Approximate Classical Potential
Although we have justified the use of a classical potential as given by eq (73), we have not yet specified its functional form. If back reaction effects in φ(i) are negligible, the potential V(i) will result from a perturber passing by the radiator on a straight line trajectory (or a hyperbola if both radiator and perturber are charged particles). If back reaction effects are important, the trajectory will be a rather complicated function of velocity, acceleration, etc. A wavepacket formed by WKB calculations (the iterative procedure discussed in sec. 5) may be able to account for the true classical trajectory, however, the perturber wave function φ(i) would no longer be an Hp eigenfunction. While this may not matter for some scattering problems, it makes a great deal of difference in line broadening.
In section 6 we used the fact that φ(i) is an Hp eigenfunction to obtain the classical energy expression in eq (44). If the perturber wave function were not an energy eigenfunction, the perturber density matrix would not be “diagonal in wave packet states” and the average over perturber states, eq (41), would be intractable in a wave packet representation. We must therefore show: (1) that φ(i) may be an Hp, eigenfunction (i.e., back reaction effects in φ(i) are negligible), and (2) that the energy eigenvalues may be represented by as stated in eq (44).
Since energy eigenvalues are not time dependent, we need only show that represents the Hp eigenvalue at some instant of time. This problem is essentially the same as eq (73) except that all perturbers must be represented by classical wave packets at some instant. Since we have a large number of perturbers in our system (a cell), there will always be some wave functions which overlap; however, for virtually all problems of interest we have and it is possible to construct wave packets for the overwhelming majority of perturbers at some instant of time (cf. eq (75)). It is in this sense that eq (44) is satisfied.
To show that back reaction effects in φ(i) are negligible, we consider the perturbation V(i) (x(i), v(i), t) which results when a perturber moves past a neutral radiator on a straight line trajectory (the trajectory will be a hyperbola if both perturber and radiator are charged particles). The true perturber trajectory will deviate from a straight line (or hyperbola) because of accelerations (back reaction) resulting from the potential V seen by the perturbers (this potential may be Veff as discussed in sec. 7.3). The approximate trajectory will be valid if this deviation does not appreciably alter the potential V(i) seen by the radiator during the collision (i.e., during the time τ ≃ ρ0/v0, or the time of interest 1/Δω, if this should be shorter). V(i) is influenced by the perturber trajectory through its dependence on r (t), the position of the perturber at the time t. If the deviation, Δr(t), produced by the perturbation is small enough that r(t) + Δr(t) ≃ r(t), the approximate trajectory will provide a good estimate of the true perturbation potential (whether the radiator is an ion or a neutral).
For a time on the order of τ, the magnitude of the deviation may be estimated by Δr = ∣F ∣ τ2/2m = ∣∇V∣τ2/m where F = −∇V is the back reaction force which causes the perturber to deviate from the approximate trajectory. Since V is inversely proportional to some power of r(t) we have ∣∇V∣ ≃ V/r(t). Using r(t) ⩾ ρ0 we have
| (78) |
where τ= ρ0/V0, and l > 1. This result shows that (Δr/r) ⪡ 1 unless a strong collision occurs.
The above argument shows that the approximate trajectory is valid (unless a strong collision occurs) for elastic collisions. If an inelastic collision should occur, the radiator and perturber will exchange an amount of energy ΔE. This will produce a sudden change in the perturber velocity, ΔE = mvΔv, which will give rise to a deviation Δr ⩽ τΔv in the trajectory. Again using τ0 = ρ0/v0, and l > 1, we obtain
| (79) |
In section 7.2 it was shown that the probability for inelastic transitions is essentially zero unless (for a more detailed treatment see ref. [30]). Thus the approximate trajectory is valid for both elastic and inelastic collisions.
It is interesting to note, that the change in the perturbers kinetic energy during the collision is a small fraction of its total kinetic energy.
9. Summary and Comments
It has been found that if the approximations which are used in the classical path treatment of line broadening break down, then a strong collision, denned by eq (68), is occuring. It should be noted that the converse of this statement is not necessarily true: that is, it may be possible to extend the classical path treatment of some problems into the region where In this case the validity conditions must be considered in more detail.
In the application of classical path methods, strong collisions are usually treated by interruption approximations based on the Lorentz-Weisskopf theory; the influence of all other collisions is evaluated by means of the time development operator as discussed in section 6.
In section 8.4 it was shown that back reaction effects on the perturber trajectory are negligible whenever a classical path calculation is justified. That is, the validity criteria for the use of classical wave packets break down at the same time as (or in some cases before) the validity conditions for the use of unperturbed trajectories.
It was noted that our definition of strong collisions, eq (68), is essentially the same as the result obtained by setting the lead terms in an S-matrix expansion to unity. However, using this S-matrix expansion until the lead terms are on the order of unity may be a very poor approximation of the S-matrix; indeed, most errors attributed to strong collisions stem from this series approximation. To avoid possible confusion, we wish to emphasize that this approximation concerning the S-matrix is not part of the classical path approximation and any errors resulting from it have no bearing on the validity of classical path methods. It is therefore quite possible to improve upon these so-called strong collision problems within the framework of the classical path approach. The series approximation has been improved for Lyman alpha lines [31], by summing terms in the series and, for hydrogen lines in general [32], by an approximate treatment of the exponential definition of the S-matrix.
The region of validity for the weak coupling density matrix has been given as We wish to emphasize once again that the approximations relevant to this density matrix are not influenced by the semi-classical treatment of the wave functions which is a part of the classical path approximation. The weak coupling density matrix may be (and has been [3, 4]) used in more rigorous quantum mechanical theories.
Footnotes
Figures in brackets indicate the literature references at the end of this paper.
10. References and Notes
- [1].Smith E. W., Vidal C. R., and Cooper J., J. Res. Nat. Bur. Stand. (U.S.), 73A (4), 405–420 (1969). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [2].Schiff L. L., Quantum Mechanics, p. 261 (McGraw-Hill Book Co., Inc., New York: 1955). [Google Scholar]
- [3].Smith E. W., and Hooper C. F. Jr., Phys. Rev. 157, 126 (1967). [Google Scholar]
- [4].Baranger M., Atomic and Molecular Processes, chap. 13, edited by Bates D. (Academic Press Inc., New York, 1962). [Google Scholar]
- [5].Baranger M., Rand Corporation Report No. (RM-2118-AEC) ASTIA Document No. (AD 150692, February 1958. [Google Scholar]
- [6].Griem H. R., Plasma Spectroscopy, chap. 4 (McGraw-Hill Book Co., Inc., New York, 1964). [Google Scholar]
- [7].T(t) governs the time development of the system from any arbitrary time τ to the time t + τ; this operator may also be written T(t + τ, τ).
- [8].Fano U., Phys. Rev. 131, 259 (1963). [Google Scholar]
- [9].Fano U., Rev Mod. Phys. 29, 74 (1957). [Google Scholar]
- [10].If the radiator is an ion in a plasma, the Coulomb interaction V0, between this ion and the charged perturbers may be added to Hp because it does not depend on the internal structure of the radiator.
- [11].There are many different ways of treating ensembles; for a more detailed discussion of the approach which we use, the reader is referred to references [9] and [12]. In reference [12], the emphasis is placed on microcanonical ensembles, for which energy is one of the fixed macroscopic constraints; the results for a canonical ensemble, which we use, are stated in section (3 4) of that reference. Reference [9] gives a very good discussion of ensembles in general.
- [12].Mori H., Oppenheim I., and Ross J., Studies in Statistical Mechanics, pp. 240–255, edited by DeBoer J. and Uhlenbeck G. E. (North-Holland Publishing Co., Amsterdam, 1962). [Google Scholar]
- [13].It is not actually necessary that the φ(i) be energy eigen-functions. It is necessary only that each φ(i) be an eigen-function of some complete set of commuting operators (i.e., the φ(i) are pure states). Since there may be more than one complete set of operators, there may also be more than one set of pure states. For our purposes, it is convenient to use some set of operators which contains the Hamiltonian, Hp. Results obtained with other sets will be identical with ours, but are probably more complicated.
- [14].Griem H. R., Phys. Rev 165, 258 (1967), footnote 17. [Google Scholar]
- [15].An extensive study of relaxation functions of the autocorrelation type has been made by; Kubo R., J. Math. Phys. 4, 175 (1963), and Lectures in Theoretical Physics, edited by Downs W. E. and Down J. (Interscience Publisher, Inc., New York, 1959), Vol 1, p. 120. Applications to spectral line broadening in gases are discussed by Kubo in Fluctuation, Relaxation and Resonance in Magnetic Systems, edited by D. ter Haar (Plenum Press, New York, 1961), p. 49, and in sec. 2.c of Ref. 3. [Google Scholar]
- [16].Margenau H., and Lewis M., Rev. Mod. Phys. 31, 269 (1959). [Google Scholar]
- [17].Cooper J., Rep. Progr. Phys. (London: Institute of Physics and Physical Society) 29, 35 (1966), sec. (11.3). [Google Scholar]
- [18].Burshtein A. I., and Naberukhin Yu. I., Soviet Physics JETP (English trans.) 25, 799 (1967). [Google Scholar]
- [19].Schweber S., An Introduction to Relativistic Quantum Field Theory, pp 330–334 (Harper and Row Publishers Inc., New York, 1962). [Google Scholar]
- [20].Cooper J., Phys. Rev. Letters 17, 991 (1966). [Google Scholar]
- [21].Griem H. R., Kolb A. C, and Shen K. Y., Phys. Rev. 116, 4 (1959). [Google Scholar]
- [22].For various types of interactions one obtains different estimates of the impact parameter which gives rise to V ≿ kT; however the thermal wavelength estimate is within an order of magnitude for most problems of interest.
- [23].Baranger M., Phys. Rev. 112, 855 (1958). [Google Scholar]
- [24].Messiah A., Quantum Mechanics, Vol. 1, p. 66 (John Wiley & Sons, Inc., New York, 1961). [Google Scholar]
- [25].op. cit. ch. VI.
- [26].Dirac P. A. M., Principles of Quantum Mechanics, pp. 121–125 (Claredon Press, Oxford, 1958). [Google Scholar]
- [27].Mott N. F., and Massey H. S. W., The Theory of Atomic Collisions p. 332 and p. 350 (Claredon Press, Oxford, 1965). [Google Scholar]
- [28].Merzbacher E., Quantum Mechanics (John Wiley & Sons, Inc., New York, 1961). [Google Scholar]
- [29].For hydrogenic ions the level spacing is on the order of e2Z2/a0n3 and the ionization energy is (e2Z2/2n2a0).
- [30].Seaton M. J., Proc. Phys. Soc. 79, 1105 (1962). [Google Scholar]
- [31].Griem H. R., Phys. Rev. 140, A1140 (1965). [Google Scholar]
- [32].Shen K. Y., and Cooper J., Ap. J. 155, 37 (1969). [Google Scholar]
