Voicing produced by a constant velocity lung source

M S Howe; R S McGowan

doi:10.1121/1.4794385

. 2013 Apr;133(4):2340–2349. doi: 10.1121/1.4794385

Voicing produced by a constant velocity lung source

M S Howe ^1,^a), R S McGowan ²

PMCID: PMC3631246 PMID: 23556600

Abstract

An investigation is made of the influence of subglottal boundary conditions on the prediction of voiced sounds. It is generally assumed in mathematical models of voicing that vibrations of the vocal folds are maintained by a constant subglottal mean pressure p_I, whereas voicing is actually initiated by contraction of the chest cavity until the subglottal pressure becomes large enough to separate the vocal folds. The problem is reformulated to determine voicing characteristics in terms of a prescribed volumetric flow rate Q_o of air from the lungs—the evolution of the resulting time-dependent subglottal mean pressure $\bar{p}_(t)$ is then governed by glottal mechanics, the aeroacoustics of the vocal tract, and the influence of continued contraction of the lungs. The new problem is analyzed in detail for an idealized mechanical vocal system that permits precise specification of all boundary conditions. Predictions of the glottal volume velocity pulse shape are found to be in good general agreement with the traditional constant-p_I theory when p_I is set equal to the time averaged value of $\bar{p}_(t)$ . But, in all cases examined the constant-p_I approximation yields values of the mean flow rates Q_o and sound pressure levels that are smaller by as much as 10%.

INTRODUCTION

Numerical simulations of voiced speech usually determine the glottal flow in terms of a prescribed constant or slowly varying subglottal mean pressure (for recent examples see: Zhang et al., 2002; Zhao et al., 2002; Rosa et al., 2003; de Vries et al., 2003; Tao et al., 2007; Luo et al., 2008; Link et al., 2009; Zheng et al., 2011). The magnitude of the corresponding mean volume flow rate Q_o of air from the lungs is then deduced from the results of the simulation. In reality, however, the lung cavity contracts at a more or less fixed rate Q_o, and the subglottal driving pressure is time dependent and determined by flow continuity and the excess Q(t) − Q_o of the glottal volume velocity Q(t) over Q_o (t denotes time).

The constant pressure hypothesis was used in previous work by McGowan and Howe (2012) on Level I source-tract interactions [for which the motion of the vocal folds is prescribed; Titze (2008)], the assumption being that the pressure of air flowing from the lungs was approximately constant at the glottis, at least prior to any back-reaction from the subglottal system. In this paper we show that this assumption is unnecessary, and that an arguably more natural and rigorous boundary condition is the specification of the flow rate Q_o at the lungs without regard to the nature of the resulting pressure behind the glottis. This approach is more suited to modeling the practical situation where voicing is initiated by contraction of the chest cavity until the subglottal pressure becomes large enough to force apart the vocal folds—the subsequent evolution of the subglottal pressure and its mean value are then determined by the equations of motion, subject to the influence of continued contraction of the lung cavity.

We describe in this paper how this procedure can be incorporated into the reduced complexity equation for the glottal volume velocity Q. This is the “Fant equation” introduced in Fant's (1960) pioneering analysis of the problem. It determines Q in terms of forcing by the subglottal and supraglottal pressures and by the hydrodynamic pressures produced by jet formation at the glottis and jet interactions with vocal tract structures. The latter includes the false folds, whose contribution has not hitherto appeared explicitly in the Fant equation. McGowan and Howe (2010) argued from a simplified model that the false folds have a negligible impact on the glottal pulse amplitude, which is in agreement with the numerical predictions of Zhang et al. (2002) and Zheng et al. (2011).

On the other hand, several studies have revealed that structures within the supraglottal tract can substantially modify the voice source, either by changing the inertia of the glottal flow, or the spectrum of the supraglottal pressure. Titze (1994) and Titze and Story (1997) have pointed out that proper geometrical adjustment of the lower supraglottal tract (the epilarynx tube, the piriform sinuses, the pharynx) plays a crucial role in voice quality control. A relatively narrow epilarynx tube, for example, appears to promote interactions between higher formants and the glottal flow and produces rippling of the volume velocity pulse profile (Titze and Story, 1997). At the very least one might expect the false folds to produce an acoustic mass that could skew the glottal pulse beyond that calculated in their absence. Indeed, analytical modeling of source-tract interaction (McGowan and Howe, 2012) has confirmed the existence, established previously by Rothenberg (1981), Ananthapadmanabha and Fant (1982), and Fant (1986), of a rightward skewing of the glottal pulse for glottis frequencies smaller than either the first subglottal formant or supraglottal formant. The skewing is attributed to an effective mass-loading of the tracts at those frequencies.

A “level I” analysis is discussed in this paper of the modified Fant equation for the idealized mechanical model of the vocal system illustrated in Fig. 1. It would actually have been more satisfactory to extend the treatment given by McGowan and Howe (2012) which uses experimental data from measurements of the acoustic impedance on either side of the glottis. But, because the desired modification that properly accounts for steady lung contraction is contentious, it was decided to proceed first with a model that is simple enough to permit an exact comparison to be made of predictions of Q(t) and the subglottal pressure for a prescribed lung contraction rate Q_o with corresponding predictions derived from the constant-pressure-driven Fant equation. In view of the level I limitation, however, a proper account is taken of the influence on Q(t) of excitation by lung contraction at a prescribed rate Q_o and of jet interactions with the glottis and the false folds, but possibly related effects on the mechanics of vocal fold vibration are ignored.

Idealized configuration of the vocal tract used to illustrate voicing produced by steady contraction of the lung cavity at volume velocity *Q_o*. The upper tract span *ℓ_s* is in the x₃ direction, out of the plane of the paper.

The modified Fant equation is derived in Sec. 2 for the vocal tract model of Fig. 1. The level I numerical procedure for solving the equation is discussed in Sec. 3. Illustrative numerical results are analyzed (Sec. 4), and a comparison is made with predictions of the constant-pressure-driven Fant equation. The influence of the false folds on the skewing of the glottal pulse is also briefly discussed.

FORMULATION

Model configuration

The voice source is usually of sufficiently low frequency to permit the supraglottal tract to be modeled as a plane-wave guide. We shall therefore consider the idealized mechanical vocal system of Fig. 1, consisting of a nominally hard-walled upper tract of length L and cross-sectional area A terminated at its upper extreme by a “mouth,” and at its lower end by the glottis and false vocal folds. The subglottal tract will be treated also as a rigid, uniform duct of cross-section A that enters at distance L_s from the glottis the “lung complex” modeled by a plenum of cross-section A_L ≫ A and length H. The hard wall boundary condition will be relaxed in Sec. 2D in order to take into account damping induced by small amplitude vibrations of the walls. The glottis is taken to have the simplified form of a narrow duct of rectangular cross-section A_g(t) and streamwise length $ℓ_{g} ≪ \sqrt{A}$ , which opens and closes at a nominal frequency f_o. Take the origin of coordinates x = (x₁, x₂, x₃) at the geometric center of the opening of the glottis into the upper tract, with x₁ directed along its axis. The upper tract has a rectangular cross-section of span l_s ≫ l_g in the x₃ direction (out of the plane of the paper in Fig. 1) and width w (parallel to x₂). It will be assumed that l_s = 1.5 cm and w = 2 cm, as in the recent numerical simulations of Zheng et al. (2011). The idealized symmetric approximation of Fig. 1 incorporates simplified false vocal folds, also based on the model of Zheng et al., which was derived from a high resolution laryngeal CT scan of a normal, 30 yr old male. The false folds have a span of l_s and an overall length of 1.5 cm. The area A_f of the rectangular channel between the false folds is ∼0.34A.

Voicing is initiated by contraction of the lung cavity causing a rise in subglottal pressure. In the model of Fig. 1, steady contraction of the cavity is achieved by a piston-like motion of the lower end [at x₁ = −(L_s + H) in Fig. 1] with constant volume velocity Q_o. This results in a very low Mach number flow into the upper tract that is interrupted by the periodic opening and closing of the glottis. The air emerges into the upper tract as a succession of “puffs” of volume velocity Q(t), the latter being equivalent to the effective monopole source strength of the sound radiated into the supraglottal tract.

Equation for the glottal flow

Fant's (1960) treatment of voicing was based on an equation for the glottal flow, which may be regarded as locally incompressible, and where the inertia of the fluid column in motion through the glottis is balanced against an aggregate of forces consisting of a constant subglottal overpressure p_I, back-pressure associated with turbulence losses and interactions with the upper tract, and viscous forces at the walls. Subsequent analyses involving increasingly sophisticated applications of this basic equation have been discussed extensively in the literature and reviewed by McGowan and Howe (2012). Howe and McGowan (2011) derived the general form of Fant's equation from the equations of aerodynamic sound. For the model of Fig. 1 it takes the form

ρ_{o} \bar{ℓ} \frac{d Q}{d t} + ρ_{o} \int_{V} (\nabla Y \cdot ω \land v) (y, t) d^{3} y = A (p_{-} - p_{+}),

(1)

where ρ_o is the mean air density, v ≡ v(y,t) is the velocity at y and time t in the fluid volume V,ω = curl v is the vorticity, and p_± ≡ p_±(t) are, respectively, the pressures just downstream of the false folds and upstream of the glottis.

The auxiliary function Y(y,t) is a solution of Laplace's equation $\nabla^{2} Y = 0$ that defines the unique velocity potential of a hypothetical incompressible flow from y₁ < 0 to y₁ > 0 through the glottis and false folds. It is normalized to have unit speed in the positive y₁ direction within the uniform upper and lower tracts (where $\partial Y / \partial y_{1} = 1$ ), with normal velocity $\partial Y / \partial y_{n} = 0$ on the instantaneous configuration of the glottis and vocal tract walls (see Howe, 2002, for a detailed discussion]. The length $\bar{ℓ} \equiv \bar{ℓ} (t)$ is the effective column length of fluid involved in unsteady motion through the glottis, given by

\bar{ℓ} (t) = \int (\frac{\partial Y}{\partial y_{1}} (y, t) - 1) d y_{1},

(2)

where the integration path passes in the positive direction through the glottis between sufficiently distant points within the lower and upper tracts at which $\partial Y / \partial y_{1} = 1$ .

Contributions to Eq. 1 from additional surface integrals involving Y(y,t) have been neglected. These represent surface viscous forces within the glottal region and the influence of normal motions of the glottis wall and neighboring tissue, which were shown by Howe and McGowan (2012) to be responsible for small modifications of the glottal pulse profile; they have been ignored for the purpose of the present discussion.

The vortex force

The integral

F (t) = ρ_{0} \int_{V} (\nabla Y \cdot ω \land v) (y, t) d^{3} y

(3)

represents the drag force F(t) exerted on the glottis and false folds by vortex structures (“turbulence”) in the flow (Howe, 2002, Sec. 4.4.2). This force acts to modulate the airstream through the glottis. It is equal and opposite to the vortex-surface interaction force on the fluid in the glottal region, and its evaluation requires a detailed knowledge of the complex flow near the glottis. This is not normally available, but the dominant contribution will obviously be supplied by the mean characteristics of the glottis jet, and for the configuration of Fig. 1 we can put F = F_g + F_f, where F_g, F_f, respectively, denote the force components produced respectively by jet interactions with the glottis and the false folds.

The glottis component F_g can be evaluated approximately by use of a simple quasi-static, “free-streamline” model of the jet (Howe and McGowan, 2011; McGowan and Howe, 2012). The jet in the vicinity of the idealized glottis is modeled as in Fig. 2, where the vorticity is confined to thin vortex sheet shear layers at the outer edge S_J of the jet. Then $ω \land v = (1 / 2) U_{σ}^{2} δ (s_{⊥}) n$ , where U_σ(t) is the jet velocity just inside the shear layer (constant along the free streamline), s_⊥ is the distance measured in the direction of the outward unit normal n from the jet, and the vorticity is convected at half the free streamline velocity. Figure 2 illustrates schematically the family of “streamlines” near the glottis of the velocity potential Y(y,t). The main contribution to the integral is from the section of the jet close to the glottis, where these streamlines cut across the edge of the jet and spatial variations of U_σ can be neglected, so that

\frac{F_{g}}{ρ_{o}} \equiv \int_{V (t)} \frac{\partial Y}{\partial y} \cdot ω \land v d^{3} y \approx \frac{U_{σ}^{2} (t)}{2} \oint_{S_{J}} \frac{\partial Y}{\partial y} \cdot n d S = \frac{(A - σ A_{g}) U_{σ}^{2}}{2},

(4)

where σ = σ(t) is the jet contraction ratio, and the surface integral is just equal to the net flux (A − σA_g) through the jet boundary of the hypothetical flow defined by the velocity potential Y(y,t).

Local streamline pattern of the hypothetical flow through the glottis defined by the velocity potential Y (y,t) intersecting the vortex sheet boundary of the idealized jet.

This result for F_g does not depend on the precise functional form of Y(y,t). However, some knowledge of the behavior of Y(y,t) in the vicinity of the false folds is necessary to calculate the component F_f in terms of the vorticity convecting between the folds. In principle Y(y,t) can be found by numerical integration of Laplace's equation. But more insight is obtained by use of the following approximation (Rayleigh, 1945, Sec. 308; Howe, 2002, p. 80)

Y (y, t) ≃ \int_{0}^{y_{1}} \frac{A}{S (ξ, t)} d ξ,

(5)

where S(ξ,t) is the cross-sectional area at any point y₁ = ξ within the entire vocal tract at time t. In our case S(y₁,t) is time dependent only within the glottis (−ℓ_g < y₁ < 0), where S(y₁,t) = A_g(t).

Equation 5 is strictly valid when the cross-sectional area varies “slowly” with position in the duct. But it nonetheless provides a good and physically correct approximation when used to evaluate the force integral [Eq. 3]. Formula (5) determines only the component $\partial Y / \partial y_{1}$ of $\nabla Y$ . The equation of continuity can be used to obtain an improved approximation to $\nabla Y$ , and for the present two-dimensional duct geometry one finds (Rayleigh, 1945, Sec. 308)

\frac{\partial Y}{\partial y_{1}} = \frac{A}{S (y_{1}, t)}, \frac{\partial Y}{\partial y_{2}} = - y_{2} \frac{\partial}{\partial y_{1}} (\frac{A}{S (y_{1}, t)}) .

(6)

The jet emerging from the glottis expands laterally and can be expected to “wet” the surfaces of the false folds. The lateral velocities are small, however, compared with the axial mean speed, and to evaluate F_f it will be assumed that the vorticity convects parallel to the duct axis through the section of cross-sectional area A_f between the false folds at the mean speed Q_o/A_f. Thus

F_{f} (t) ≃ - ρ_{o} \int y_{2} {(ω \land v)}_{2} (y, t) (\frac{A}{A_{f}} - 1) (δ (y_{1} - ℓ_{f}) - δ (y_{1} - ℓ_{t})) d^{3} y,

(7)

where the main contributions to F_f arise from vortex-surface interactions at the leading and trailing edges, respectively, x₁ = ℓ_f,ℓ_t, of the idealized false folds, where $\nabla Y$ is singular.

However, vorticity interacting with a “trailing edge” x₁ = ℓ_t (labeled B in Fig. 1) would in practice induce the shedding of new vorticity into a wake. When this shedding is ignored the force calculated from Eq. 7 has two components, from vortex elements passing the leading and trailing edges. The overall force, however, also involves a contribution from the shed vorticity. A similar problem arises in the calculation of the force produced by a turbulent “gust” in mean flow past an airfoil. In that case it is known that in a first approximation the force component attributed to the wake is equal and opposite to that generated by the gust at the trailing edge (Howe, 1976, 2002, Sec. 6.3). This happens because shedding acts to smooth the trailing edge flow, removing potential flow singularities that would otherwise occur. Therefore the leading order effect of the wakes of the false folds can be determined, without knowledge of details of the shed vorticity, merely by deleting the singularity δ(y₁ − ℓ_t) from the integrand of Eq. 7 and ignoring the contribution to the remaining integral from the wake vorticity. Of course, the trailing edge of a false fold is actually tapered and much smoother than the sharp corner B of Fig. 1 (see Fig. 1 of Zheng et al., 2011). But separation of the mean flow must still occur in this region, which will again reduce its contribution to F_f. The wake was neglected in the analogous false folds problem discussed by McGowan and Howe (2010), where surface wetting by the mean jet was ignored.

If lateral expansion of the jet is ignored over the relatively short distance ∼ℓ_f between the glottis and the false folds, the integration in Eq. 7 can be performed by means of the vortex sheet approximation used above for Eq. 4, to obtain

\frac{F_{f} (t)}{ρ_{o}} ≃ - σ (\frac{A}{A_{f}} - 1) [\frac{A_{g} (t) U_{σ}^{2} (t)}{2}],

(8)

where the square brackets [] indicate that the enclosed quantity is to be evaluated at the “retarded” time at which the vorticity passing the leading edge x₁ = ℓ_f at time t emerged from the glottis.

Combining this result with F_g of Eq. 4, we find

\frac{F (t)}{ρ_{o}} \equiv \int_{V} (\nabla Y \cdot ω \land v) (y, t) d^{3} y ≃ \frac{(A - σ A_{g} (t)) U_{σ}^{2} (t)}{2} - σ (\frac{A}{A_{f}} - 1) [\frac{A_{g} (t) U_{σ}^{2} (t)}{2}] .

(9)

The contribution from the false folds is smaller than that from the glottis by a factor of order A_g/A_f. It is also of opposite sign, and represents the effect of a “suction” force at the leading edge of the false folds acting on the mean flow in the +x₁ direction, whereas the glottis component F_g opposes the motion through the glottis. This reduces the overall flow resistance through the glottis produced by vortex-surface interactions, in agreement with numerical simulations reported by Zhao et al. (2002), and should therefore reduce the subglottal pressure required to maintain a given mean volume velocity Q_o.

The pressures p₊ and p₋

The pressure fluctuations p₊ in Eq. 3 are determined by the volume source of strength Q(t) at the glottis of the flow into the upper tract. Standard acoustic analysis for the supraglottal duct of Fig. 1 (cf. Murray and Howe, 2012) provides the Fourier representation

p_{+} = - \frac{i ρ_{o} c_{o}}{2 π A} \int \int_{- \infty}^{\infty} Q (τ) \frac{\sin (k_{o} \bar{L}) e^{- i ω (t - τ)}}{\cos (k_{o} \bar{L})} d ω d τ,

(10)

where k_o = ω/c_o, c_o is the speed of sound, and the length $\bar{L}$ is equal to the interior duct length L suitably augmented to account for the end-correction of the open mouth.

The pressure p₋ near the glottis in the lower tract is governed by the net volumetric rate of inflow, comprising the steady inflow Q_o due to lung contraction and the outflow Q(t) through the glottis. By making the usual assumptions of continuity of pressure and volume velocity at the junction x₁ = −L_s, we find

p_{-} = \frac{i ρ_{o} c_{o}}{2 π A} \int \int_{- \infty}^{\infty} (Q_{o} - Q (τ)) \frac{[A \cos (k_{o} L_{s}) \cos (k_{o} H) - A_{L} \sin (k_{o} L_{s}) \sin (k_{o} H)] e^{- i ω (t - τ)}}{[A \sin (k_{o} L_{s}) \cos (k_{o} H) + A_{L} \cos (k_{o} L_{s}) \sin (k_{o} H)]} d ω d τ .

(11)

In both of these formulas causality requires the path of integration with respect to ω to pass above all singularities (simple poles) of the integrands.

These expressions are strictly applicable in the absence of damping. Thus, the undamped poles for the upper tract pressure p₊ [Eq. 10] occur at the resonance frequencies $ω = ω_{n} = (n - 1 / 2) π c_{o} / \bar{L} (- \infty < n < \infty) .$ Damping in the upper tract is produced by thermo-viscous wall-losses, open-end radiation (consistent with the idealized model of Fig. 1), and flexural motions of the tract walls. These effects cause the poles to be shifted into the lower complex plane, perturbing in general both their real and imaginary parts (Stevens, 1998). In a first approximation we can write

ω = \pm ω_{n} - i ϵ_{n}, n \geq 1, ω_{n} = (n - \frac{1}{2}) \frac{π c_{o}}{\bar{L}}, ϵ_{n} > 0.

(12)

Evaluation by residues of the integral [Eq. 10] then gives

p_{+} = ρ_{o} c_{o}^{2} \sum_{n = 1}^{\infty} Z_{n} (t),

(13)

where

Z_{n} = \frac{2}{A L} \int_{- \infty}^{t} Q (τ) \cos ω_{n} (t - τ) e^{- ϵ_{n} (t - τ)} d τ, n \geq 1.

(14)

The integrand of Eq. 11 for the subglottal pressure p₋ contains a simple pole at ω = 0, which yields the contribution ${\bar{p}}_{-}$ , say, given by

{\bar{p}}_{-} = ρ_{o} c_{o}^{2} W_{0} (t), W_{0} (t) = \frac{1}{V} \int_{- \infty}^{t} (Q_{o} - Q (τ)) d τ,

(15)

where V = AL_s + A_LH is the mean volume of the lung cavity and the subglottal tract.

When A_L ≫ A and ω ≠ 0

\frac{A \cos (k_{o} L_{s}) \cos (k_{o} H) - A_{L} \sin (k_{o} L_{s}) \sin (k_{o} H)}{A \sin (k_{o} L_{s}) \cos (k_{o} H) + A_{L} \cos (k_{o} L_{s}) \sin (k_{o} H)} ≃ - \frac{\sin (k_{o} L_{s})}{\cos (k_{o} L_{s})} .

The undamped subglottal resonance frequencies therefore correspond approximately to the quarter-wave modes of a duct of length L_s open at one end. When the influence of damping is included the perturbed resonance frequencies are taken in the form

ω = \pm ω_{n}^{s} - i ϵ_{n}^{s}, n \geq 1, ω_{n}^{s} = (n - \frac{1}{2}) \frac{π c_{o}}{L_{s}}, ϵ_{n}^{s} > 0,

(16)

so that the net subglottal pressure p₋, including the zero frequency pole contribution ${\bar{p}}_{-}$ , becomes

p_{-} = ρ_{o} c_{o}^{2} (W_{0} (t) + \sum_{n = 1}^{\infty} W_{n} (t)),

(17)

where

W_{n} (t) = \frac{2}{A L_{s}} \int_{- \infty}^{t} (Q_{o} - Q (τ)) \cos ω_{n}^{s} (t - τ) e^{- ϵ_{n}^{s} (t - τ)} d τ, n \geq 1.

(18)

The Fant equation

The results of Eqs. 9, 13, 18 and the relation Q = σA_gU_σ now permit the Fant Eq. 1 for the idealized vocal system of Fig. 1 to be cast in the form

\frac{\bar{ℓ}}{A} \frac{d Q}{d t} + (1 - σ \frac{A_{g}}{A}) \frac{Q^{2}}{2 σ^{2} A_{g}^{2} (t)} - (\frac{A}{A_{f}} - 1) [\frac{Q^{2} (t)}{2 σ A A_{g} (t)}] = c_{o}^{2} (W_{0} (t) + \sum_{n = 1}^{\infty} {W_{n} (t) - Z_{n} (t)}),

(19)

where the glottal column length $\bar{ℓ}$ is given by

\bar{ℓ} (t) = ℓ_{g} (\frac{A}{A_{g} (t)} - 1) + (ℓ_{t} - ℓ_{f}) (\frac{A}{A_{f}} - 1) .

(20)

In general the time dependent variation of the glottis cross-section A_g(t) is known, or determined by an equation that is to be solved simultaneously with Eq. 19. The solution Q(t) of Eq. 19 vanishes identically in the absence of lung contraction, i.e., unless Q_o ≠ 0 in the definition [Eq. 15] of W₀.

Interpretation of $\bar{p}_= ρ_{o} c_{o}^{2} W_{0} (t)$

Integration of the continuity equation $\partial ρ / \partial t + ρ_{o} div v = 0$ over the entire volume V of the subglottal region, and use of the linear, adiabatic formula $δ p = δ p / c_{o}^{2}$ , relating changes in pressure δp and the density δρ, reveals that

\frac{V}{ρ_{o} c_{o}^{2}} \frac{d \bar{p}}{d t} = Q_{o} - Q (t),

(21)

where $\bar{p} (t)$ is the space-averaged subglottal pressure. Therefore

\bar{p} \equiv {\bar{p}}_{-} = ρ_{o} c_{o}^{2} W_{0} (t) .

(22)

It follows fairly obviously from Eq. 21 that during periodic voicing characterized by a limit cycle solution of the Fant Eq. 19, the mean volume velocity $〈 Q (t) 〉 \equiv Q_{o}$ , where

〈 Q (t) 〉 = f_{o} \oint Q (t) d t,

the integration being over a complete period 1/f_o of the glottal motion.

Equations 21, 22 also permit the Fant Eq. 19 to be re-cast in the form

\frac{d^{2} {\bar{p}}_{_}}{d t^{2}} + ω_{H}^{2} \bar{p}_= ω_{H}^{2} ρ_{o} {(1 - σ \frac{A_{g} (t)}{A}) \frac{Q^{2} (t)}{2 σ^{2} A_{g}^{2} (t)} - (\frac{A}{A_{f}} - 1) [\frac{Q^{2} (t)}{2 σ A A_{g} (t)}] + c_{o}^{2} \sum_{n = 1}^{\infty} {W_{n} (t) - Z_{n} (t)}},

(23)

where $ω_{H} = \sqrt{c_{o}^{2} A / V \bar{ℓ}}$ is the instantaneous value of the resonance frequency of the “Helmholtz resonator” formed by the entire subglottal region of volume V with a small opening at the glottis (Rayleigh, 1945). Equation 23 determines the space-averaged subglottal pressure $\bar{p}_(t)$ produced by lung contraction, nonlinear jet forces, and acoustic modes within and outside the cavity.

The constant-pressure-driven Fant equation

Most discussions of the Fant equation assume that the motion is driven by a constant subglottal pressure p_I, rather than the contraction rate Q_o of the lungs. To derive this approximation the space-averaged subglottal pressure term W₀(t) in Eq. 19 is replaced by its time-averaged value $〈 W_{0} (t) 〉 = 〈 \bar{p}_(t) 〉 / ρ_{o} \equiv p_{I} / ρ_{o}$ . Similarly, we must set Q_o = 0 in the definition [Eq. 18] of W_n. This yields the constant-pressure-driven Fant equation

\frac{\bar{ℓ}}{A} \frac{d Q}{d t} + (1 - σ \frac{A_{g} (t)}{A}) \frac{Q^{2}}{2 σ^{2} A_{g}^{2} (t)} - (\frac{A}{A_{f}} - 1) [\frac{Q^{2} (t)}{2 σ A A_{g} (t)}] = \frac{p_{I}}{ρ_{o}} + c_{o}^{2} \sum_{n = 1}^{\infty} {W_{n} (t) - Z_{n} (t)} .

(24)

The value of the volumetric inflow velocity Q_o no longer provides a boundary condition for this equation and the ultimate voice source; its value must be determined from the solution of Eq. 24 and the formula $Q_{o} = f_{o} \oint Q (t) d t .$

Other special cases

The Fant Eq. 19 can be modified further to treat idealized representations of two alternative geometrical configurations that are frequently discussed in the literature.

(1)
Exposed glottis: The upper tract downstream of the false folds is removed. Then p₊ = 0 to a very good approximation, and the required Fant equation is obtained from Eq. 19 by discarding $\sum_{n = 1}^{\infty} Z_{n} .$
(2)
Non-reflective upper tract: $c_{o}^{2} \sum_{n = 1}^{\infty} Z_{n}$ in Eq. 19 is replaced by c_oQ(t)/A. This is equivalent to modeling the upper tract as a semi-infinite duct.

THE LEVEL I APPROXIMATION

Prescribed glottal motion

The numerical analysis of the Fant Eqs. 19, 24 will be based on Titze's (2008) level I approach, whereby the character of the solution generated by contraction of the lungs is examined for prescribed cyclic variations of the glottis cross-section A_g(t). This is done using the formula $A_{g} (t) = A {a_{0} + (1 / 2) a_{1} (1 - \cos (2 π f_{o} t))}$ , where f_o Hz is the fundamental voicing frequency and a₀, a₁ are suitable coefficients. The coefficient a₀ must be assigned a small positive value (≪ a₁) to avoid instability in the numerical solution. The maximum open area is then ∼a₁A.

In practice, however, the glottis closes for a finite time t_c, say, during each cycle. To model this, the following more general formula will be used

\frac{A_{g} (t)}{A} = {\begin{matrix} a_{0,} & for 0 \leq f_{o} t - [f_{o} t] < f_{o} t_{c} \\ a_{0} + \frac{a_{1}}{2} (1 - \cos (2 π \frac{{f_{o} (t - t_{c}) - [f_{o} t]}}{1 - f_{o} t_{c}})), & for f_{o} t_{c} < f_{o} t - [f_{o} t] < 1, \end{matrix}

(25)

where [f_ot] denotes the largest integer not exceeding f_ot. The open duty factor of the glottis is then equal to 1 − f_ot_c.

The jet contraction ratio

Calculation for an orifice of uniform rectangular cross section indicates that the glottis-jet contraction ratio σ varies abruptly between 0.61 and 1.1 during opening and closing of the glottis (Howe and McGowan, 2010). This is consistent with the experimental findings of Park and Mongeau (2007), although the variation must actually be a smooth function of the time. Discontinuities associated with rapid jumps in σ will therefore be avoided by using in the Fant equation the following smoothed representation of the variation:

σ = 0.61 + \frac{0.49}{(a_{0} + a_{1})} \frac{A_{g} (t)}{A} .

(26)

A similar smoothing formula was used by Zanartu et al. (2007) to model vocal fold mechanics.

Numerical procedure

The Fant equation is solved by fourth order Runga-Kutta integration applied to Eq. 19 and to the corresponding system of equations satisfied by W₀(t), W_n(t), and Z_n(t), n ≥ 1, viz.,

\frac{d W_{0}}{d t} = \frac{Q_{o} - Q (t)}{V},

(27)

\begin{matrix} \frac{d W_{n}}{d t} = \frac{2 (Q_{o} - Q (t))}{A {\bar{L}}_{s}} - ϵ_{n}^{s} W_{n} - ω_{n}^{s} W_{n}' \\ \frac{d W_{n}'}{d t} = ω_{n}^{s} W_{n} - ϵ_{n}^{s} W_{n}' \end{matrix}} n = 1, 2, \dots,

(28)

\begin{matrix} \begin{matrix} \frac{d Z_{n}}{d t} = \frac{2 Q (t)}{A L} - ϵ_{n} Z_{n} - ω_{n} Z_{n}^{'} \\ \frac{d Z_{n}^{'}}{d t} = ω_{n} Z_{n} - ϵ_{n} Z_{n}^{'} \end{matrix} \end{matrix}} n = 1, 2, ...,

(29)

where

\begin{matrix} W_{n}' = \frac{2}{A {\bar{L}}_{s}} \int_{- \infty}^{t} {Q_{o} - Q (τ)} \sin ω_{n}^{s} (t - τ) e^{- ϵ_{n}^{s} (t - τ)} d τ \\ Z_{n}^{'} = \frac{2}{A \bar{L}} \int_{- \infty}^{t} Q (τ) \sin ω_{n} (t - τ) e^{- ϵ_{n} (t - τ)} d τ, \end{matrix}} n = 1, 2, \dots

(30)

(cf. Howe and McGowan, 2011).

The modal coefficients W_n(t), Z_n(t) govern the subglottal and supraglottal cavity mode contributions to the pressures $p_{\mp}$ , and are small when $2 π f_{o} ≪ ω_{n}^{s}$ , ω_n. This means that it is usually permissible to truncate the infinite systems [Eqs. 28, 29] at n ∼ 3, i.e., by taking account of the first three formants of the lower and upper tracts. The integration is started at t = 0 subject to the initial conditions Q = 0, $W_{0} = W_{n} = W_{n}^{'} = Z_{n} = Z_{n}^{'} = 0 (n \geq 1)$ .

Periodic solutions of Eq. 19 must satisfy the condition

f_{o} \oint Q (t) d t = Q_{o},

which provides a convenient check on convergence to a limit cycle solution.

NUMERICAL RESULTS

Model vocal tract parameters

Sample solutions of the Fant Eq. 19 and of the constant-pressure-driven approximation [Eq. 24] are now discussed. Typical parameter values for the simplified model of Fig. 1 are given in Table Table I., based on those used in the recent investigation of Zheng et al. (2011).

Table I.

Parameter values for the ideal vocal system of Fig. 1.

Parameter	Value
Glottal length ℓ_g	3 mm
Glottis and duct span ℓ_s	15 mm
Glottis coefficient a₀	0.00005
Glottis coefficient a₁	0.1
Duct width w	20 mm
False fold ℓ_f	5 mm
False fold ℓ_t	20 mm
False fold A_f/A	0.34
Volume of subglottal cavity V	4000 ml
Glottis frequency (nominal) f_o	120 Hz
Density of air ρ_o	1.23 kg/m³
Speed of sound c_o	350 m/s

Open in a new tab

The first subglottal formant $ω_{1}^{s} / 2 π \equiv F_{s} 1 \sim 620 Hz$ (Ishizaka et al., 1976). Therefore for the purpose of illustration the three subglottal formants F_s1, F_s2, F_s3 and their corresponding half-power bandwidths Δf are taken to be defined as in Table Table II. (Ishizaka et al., 1976; Zanartu et al., 2007; Howe and McGowan, 2012).

Table II.

Subglottal frequencies.

	F_s1	F_s2	F_s3
Frequency, Hz	620	1860	3100
Bandwidth Δf, Hz	200	150	100

Open in a new tab

These values are consistent with the simple rectilinear model of the subglottal tract of Fig. 1 and the frequency formula [Eq. 16]. The half-power bandwidths supply the dissipation rates $ϵ_{n}^{s}$ by means of the formula

ϵ_{n}^{s} = π {(Δ f)}_{n} .

(31)

Numerical results will be discussed for a notional neutral vowel phoneme corresponding to supraglottal formants F1, F2, F3 consistent with the rectilinear duct model of Fig. 1. They are defined along with their bandwidths (estimated from Olive et al., 1993) in Table Table III..

Table III.

Supraglottal formants.

	F1	F2	F3
Frequency, Hz	500	1500	2500
Bandwidth Δf, Hz	50	75	100

Open in a new tab

Influence of glottal frequency

The glottis volume velocity Q(t) is determined by the Fant Eq. 19 entirely in terms of the prescribed value of the lung cavity contraction rate Q_o. The details of the glottal pulse depend on the acoustic properties of both the subglottal region (Table Table II.) and the articulatory state of the upper vocal tract. Figure 3 depicts a survey of possible neutral vowel waveforms, each for fixed values of Q_o but for a range of increasing values of the glottis frequency f_o. In all cases f_ot_c = 0.3, so that the glottis duty factor is 0.7.

Illustrating the variation with *f_o* of limit cycle glottal pulse profiles Q(t)/*Q_o* predicted by the Fant Eq. 19 for the conditions of Table Table I., when *Q_o* = 200 cm³/s, *f_ot_c* = 0.3. The subglottal resonances are defined as in Table Table II., and the calculations are performed for the neutral vowel defined by the upper tract formants of Table Table III..

The contraction rate is fixed at Q_o = 200 cm³/s and f_o is increased in stages from 120 to 700 Hz. This interval encompasses the subglottal resonance at F_s1 = 620 Hz and the first supraglottal formant F1 = 500 Hz. Each of the frames [Figs. 3(a)–3(j)] depicts three limit cycles of the glottal pulse Q(t)/Q_o plotted against f_ot. In case (a) f_o = 120 Hz is well below the two resonance frequencies, and the smooth volume velocity pulse is slightly skewed to the latter half of the glottis open phase. This skewing is due almost entirely to the back-reaction on the glottis flow of the acoustic modes in the upper and lower tracts. The back-reactions are weak because f_o ≪ F_s1, F1, but the skewing is absent only when the contributions to Eq. 19 from modes in both tracts are ignored (cf. the discussion below of Fig. 4). The contribution to skewing from the augmentation of the slug length $\bar{ℓ}$ produced by the narrowed passage between the false folds is much smaller (Rothenberg, 1981; Ananthapadmanabha and Fant, 1982; Fant, 1986; Titze, 1994; Titze and Story, 1997), and does not become noticeable until A_f/A is smaller than about 0.1 (Fig. 6).

Illustrating the limit cycle variations (—) of A_g(t)/*A, Q*(t)/*Q_o*, ${\bar{p}}_{-} (t)$ Pa for *f_o* = 120 Hz, *f_ot_c* = 0.3, *Q_o* = 200 cm³/s. The mean subglottal pressure *p_I* = 478.41 Pa. The broken-line curve (- - - -) is the non-skewed profile of Q(t)/*Q_o* predicted when back reactions from acoustic modes in the upper and lower tracts are ignored. The dotted curve (• • •) is the volume velocity predicted by the constant-pressure-driven Fant Eq. 24 for *p_I* = 478.41 Pa, for which *Q_o* = 180.46 cm³/s.

Volume velocity profile skewing produced by narrowing of the gap between the false folds. The figure compares limit cycle volume velocity profiles for *Q_o* = 320 cm³/s, *f_o* = 120 Hz, a₀ = 0.06, *t_c* = 0 in the two cases *A_f*/A = 0.1 (—) and *A_f*/A = 1 (- - - -).

The appearance of a “knuckle” at the front of the glottal pulse when f_o = 170 Hz [labeled A in Fig. 3b] indicates the increasing influence of the first formant F1. The knuckle advances toward the peak of the volume velocity profile with increasing frequency [Figs. 3c, 3d]; the distinct double peak in Fig. 3c occurs at the subharmonic f_o = 250 Hz of F1. In Fig. 3d the secondary peak is maintained by subglottal interactions at the subharmonic 310 Hz of F_s1. At higher frequencies the double peak disappears, and Fig. 3e reveals a new disturbance B associated with the first formant F1 advancing toward the velocity peak. At f_o = 480 Hz the formation of a secondary ripple C is evident. The combination leads to the triple peak profile of Fig. 3g at the resonance condition f_o = F1. The central peak in this profile is produced by interaction with F2; when the contribution from the second formant is omitted from Eq. 19 the resonant profile assumes a characteristic double peak form, with a deep central minimum that is typical of resonance forcing (Lighthill, 1978; Howe and McGowan, 2011). At higher frequencies the volume velocity profile becomes skewed to the first half of the glottal cycle. Resonance forcing at the subglottal frequency f_o = F_s1 [Fig. 3i] does not exhibit a double peak behavior, because of the heavy damping of the subglottal modes.

Figures 4 5 display the combined limit cycle variations of A_g(t)/A, Q(t)/Q_o, and the subglottal space-averaged pressure $\bar{p}_(t)$ Pa for special cases of the neutral vowel. Figure 4 is for case (a) of Fig. 3, i.e., for f_o = 120 Hz, Q_o = 200 cm³/s and f_ot_c = 0.3, typical of quiet speech. The back-reaction of the upper and lower tract resonant modes produce skewing of the volume velocity profile to the latter half of the open phase of the glottis. The back-reaction is relatively mild because of the large disparity between the glottal frequency f_o and F_s1 and the first formant F1. In the absence of these modes there is no skewing of the corresponding Q(t)/Q_o-profile (- - - -), obtained by discarding W_n, Z_n, n ≥ 1 in Fant Eq. 19 and the systems [Eqs. 28, 29]. The mean subglottal pressure $〈 \bar{p}_(t) 〉 \equiv p_{I} = 478.41$ Pa, as indicated by the broken line in the upper part of Fig. 4, and $\bar{p}_(t)$ varies over a narrow range of about ±15 Pa about this mean.

Limit cycle variations (—) of A_g(t)/*A, Q*(t)/*Q_o*, $\bar{p}_(t)$ Pa for *f_o* = 250 Hz, *f_ot_c* = 0.3, *Q_o* = 273.78 cm³/s. The mean subglottal pressure *p_I* = 1000 Pa. The dotted curve (• • •) is the volume velocity predicted by theconstant-pressure-driven Fant Eq. 24 for *p_I* = 1000 Pa, for which *Q_o* = 258.09 cm³/s.

The profiles in Fig. 5 are for f_o = 250 Hz, f_ot_c = 0.3 and Q_o = 273.78 cm³/s, the latter having been adjusted to yield a mean subglottal pressure p_I = 1000 Pa—a value frequently used in voicing studies. The glottis frequency f_o is a subharmonic of the formant F1 = 500 Hz, and the variation of the volume velocity Q(t)/Q_o is strongly influenced by interaction with this resonant mode. The subglottal mean pressure $\bar{p}_(t)$ exhibits a near saw-tooth cyclic waveform over ±10 Pa about the mean.

Solution of the constant-pressure-driven Fant equation

Inspection of Figs. 4 5 reveals that the departures of the space-averaged subglottal pressure $\bar{p}_(t)$ from its mean value p_I do not exceed about ±3%. It might therefore be surmised that corresponding predictions of the constant-pressure-driven Fant Eq. 24 should, in such cases, be similar or very close to those of the “exact” Eq. 19.

To examine this hypothesis Q(t) has been calculated from Eq. 24 for the conditions of Figs. 4 5 by setting the pressure p_I in the equation respectively equal to the mean values $〈 \bar{p}_(t) 〉 = 478.41$ and 1000 Pa calculated from the Fant Eq. 19. The numerical solution of Eq. 24 is then used to evaluate the corresponding mean volume velocity $Q_{o}^{'} = f_{o} \oint Q (t) d t$ , say, and thence the ratio $Q (t) / Q_{o}^{'}$ , which is plotted as the dotted curves (• • •) in Figs. 4 5. This procedure is seen to yield an excellent approximation to the fractional volume velocity waveform Q(t)/Q_o determined by Eq. 19.

However, in all cases the predicted mean volume velocity $Q_{o}^{'} < Q_{o}$ , as indicated in Table Table IV.. To obtain equality of the mean volumetric flow rates (and therefore of the predicted speech sound pressure levels) it is necessary to increase the magnitude of the constant driving pressure p_I in Eq. 24 to the respective values labeled $p_{I}^{'}$ in the table.

Table IV.

Compared predictions of Eqs. 19, 24.

f_o (Hz)	Q_o (cm³/s)	$Q_{o}^{'}$ (cm³/s)	p_I (Pa)	$p_{I}^{'}$ (Pa)
120	200.00	180.46	478.41	549.10
250	273.78	258.09	1000.00	1089.47

Open in a new tab

Influence of the false folds on profile skewing

According to Rothenberg (1981), Ananthapadmanabha and Fant (1982), Fant (1986), Titze (1994), and Titze and Story (1997), constriction of the glottis flow by a narrowing of the lower end of the supraglottal tract is one factor that causes skewing of the volume velocity wave pulse. Narrowing can occur in the model of Fig. 1 by reducing the area A_f between the false folds, which has the effect of increasing the glottal column length $\bar{ℓ}$ [see Eq. 20], and therefore of slowing the initial rate of rise of the velocity pulse. This is illustrated in Fig. 6, which displays the waveforms for the conditions: f_o = 120 Hz, Q_o = 320 cm³/s, and t_c = 0, when a₀ = 0.06 [corresponding to a maximum glottis area A_g(t) = 18 mm²] in the two cases A_f/A = 0.1 (—) and A_f/A = 1 (- - - -).

CONCLUSION

Voiced speech arises from vibrations of the vocal folds produced by air forced to flow through the glottis by contraction of the lung cavity. However, mathematical representations of this mechanism have largely been formulated in terms of a prescribed subglottal pressure p_I applied to the folds, the amplitude of the pressure being fixed to accord with experiment. But experiment has also determined the characteristic rate Q_o of volumetric airflow produced by lung contraction which flows through the glottis during voicing. The actual value of the subglottal pressure $\bar{p}_(t)$ determined by Q_o is not constant, even when the lungs contract at a constant rate. The relation between these parameters has been investigated in terms of the Fant equation for an idealized mechanical vocal system that is simple enough to permit precise specification of all boundary conditions.

The approximate, constant-pressure-driven Fant equation in which p_I is set equal to $〈 \bar{p}_(t) 〉$ yields predictions of Q(t)/Q_o that are generally in excellent agreement with those obtained from the exact equation. However, in all cases examined it is found that the absolute level of the mean flow rate Q_o calculated from the approximate equation can be up to 10% smaller than for the exact equation. This implies also that there would be corresponding discrepancies in the predicted sound pressure levels. The differences are admittedly small, and can be removed by suitably increasing by a few percent the driving pressure p_I in the approximate equation, but the conclusion suggests that it would be worthwhile to extend the present investigation to a more realistic model of the vocal system. Such calculations should be done at Titze's (2008) “level II,” by including a separate equation of motion for the vocal fold vibrations, and should also incorporate a geometrically precise representation of vocal tract area variations, perhaps by use of concatenated cylindrical elements (Lighthill, 1978).

ACKNOWLEDGMENT

This work was supported by a subaward of Grant No. 1R01 DC009229 from the National Institute on Deafness and other Communication Disorders to the University of California, Los Angeles.

References

Ananthapadmanabha, T. V., and Fant, G. (1982). “ Calculation of the true glottal flow and its components,” Speech Comm. 1, 167–184. 10.1016/0167-6393(82)90015-2 [DOI] [Google Scholar]
de Vries, M. P., Hamburg, M. C., Schutte, H. K., Verkerke, G. J., and Veldman, A. E. P. (2003). “ Numerical simulation of self-sustained oscillation of a voice-producing element based on Navier-Stokes equations and the finite element method,” J. Acoust. Soc. Am. 113, 2077–2083. 10.1121/1.1560163 [DOI] [PubMed] [Google Scholar]
Fant, G. (1960). Acoustic Theory of Speech Production (Mouton, The Hague: ), Sec. A2. [Google Scholar]
Fant, G. (1986). “ Glottal flow: Models and interaction,” J. Phonetics 14, 393–399. [Google Scholar]
Howe, M. S. (1976). “ The influence of vortex shedding on the generation of sound by convected turbulence,” J. Fluid Mech. 76, 711–740. 10.1017/S0022112076000864 [DOI] [Google Scholar]
Howe, M. S. (2002). Theory of Vortex Sound (Cambridge University Press, Cambridge: ), Secs. 4.4.2 and 6.3, p. 80 and Chap. 3. [Google Scholar]
Howe, M. S., and McGowan, R. S. (2010). “ On the single-mass model of the vocal folds,” Fluid Dyn. Res. 42, 015001. 10.1088/0169-5983/42/1/015001 [DOI] [PMC free article] [PubMed] [Google Scholar]
Howe, M. S., and McGowan, R. S. (2011). “ Production of sound by unsteady throttling of flow into a resonant cavity, with application to voiced speech,” J. Fluid Mech. 672, 428–450. 10.1017/S0022112010006117 [DOI] [PMC free article] [PubMed] [Google Scholar]
Howe, M. S., and McGowan, R. S. (2012). “ On the role of glottis-interior sources in the production of voiced sound,” J. Acoust. Soc. Am. 131, 1391–1400. 10.1121/1.3672655 [DOI] [PMC free article] [PubMed] [Google Scholar]
Ishizaka, K., Matsudaira, M., and Kaneko, T. (1976). “ Input acoustic-impedance measurement of the subglottal system,” J. Acoust. Soc. Am. 60, 190–197. 10.1121/1.381064 [DOI] [PubMed] [Google Scholar]
Lighthill, J. (1978). Waves in Fluids (Cambridge University Press, Cambridge: ), p. 119. [Google Scholar]
Link, G., Kaltenbacher, M., Breuer, M., and Doellinger, M. (2009). “ A 2D finite-element scheme for fluid-solid-acoustic interactions and its application to human phonation,” Comput. Methods Appl. Mech. Eng. 198, 3321–3334. 10.1016/j.cma.2009.06.009 [DOI] [Google Scholar]
Luo, H., Mittal, R., Bielamowize, S., Walsh, R., and Hahn, J. (2008). “ An immersed-boundary method for flow-structure interaction in biological systems with applications to phonation,” J. Comput. Phys. 227, 9303–9332. 10.1016/j.jcp.2008.05.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
McGowan, R. S., and Howe, M. S. (2010). “ Influence of the ventricular folds on a voice source with specified vocal fold motion,” J. Acoust. Soc. Am. 127, 1519–1527. 10.1121/1.3299200 [DOI] [PMC free article] [PubMed] [Google Scholar]
McGowan, R. S., and Howe, M. S. (2012). “ Source-tract interaction with prescribed vocal fold motion,” J. Acoust. Soc. Am. 131, 2999–3016. 10.1121/1.3685824 [DOI] [PMC free article] [PubMed] [Google Scholar]
Murray, P. R., and Howe, M. S. (2012). “ On the thermo-acoustic Fant equation,” J. Sound Vib. 331, 3345–3357. 10.1016/j.jsv.2012.03.014 [DOI] [Google Scholar]
Olive, J. P., Greenwood, A., and Coleman, J. (1993). Acoustics of American English Speech: A Dynamic Approach (Springer-Verlag, New York: ), pp. 104 and 208. [Google Scholar]
Park, J. B., and Mongeau, L. (2007). “ Instantaneous orifice discharge coefficient of a physical, driven model of the human larynx,” J. Acoust. Soc. Am. 121, 442–455. 10.1121/1.2401652 [DOI] [PubMed] [Google Scholar]
Rayleigh, Lord (1945). Theory of Sound (Dover, New York: ), Vol. 2, Sec. 308. [Google Scholar]
Rosa, M. D. O., Pereira, J. C., Grellet, M., and Alwan, A. (2003). “ A contribution to simulating a three-dimensional larynx model using the finite element method,” J. Acoust. Soc. Am. 114, 2893–2905. 10.1121/1.1619981 [DOI] [PubMed] [Google Scholar]
Rothenberg, M. (1981). “ Acoustic interaction between the glottal source and the vocal tract,” in Vocal Fold Physiology, edited by Stevens K. N. and Hirano M. (University of Tokyo Press, Tokyo: ), pp. 305–328. [Google Scholar]
Stevens, K. N. (1998). Acoustic Phonetics (MIT Press, Cambridge, MA: ), pp. 55–152. [Google Scholar]
Tao, C., Zhang, Y., Hottinger, D. G., and Jiang, J. J. (2007). “ Asymmetric airflow and vibration induced by the Coanda effect in a symmetric model of the vocal fold,” J. Acoust. Soc. Am. 112, 2270–2278. 10.1121/1.2773960 [DOI] [PubMed] [Google Scholar]
Titze, I. R. (1994). Principles of Voice Production (Prentice Hall, Upper Saddle River, NJ: ), p. 72. [Google Scholar]
Titze, I. R. (2008). “ Nonlinear source-filter coupling in phonation: Theory,” J. Acoust. Soc. Am. 123, 2733–2749. 10.1121/1.2832337 [DOI] [PMC free article] [PubMed] [Google Scholar]
Titze, I. R., and Story, B. H. (1997). “ Acoustic interactions of the voice source with the lower vocal tract,” J. Acoust. Soc. Am. 101, 2234–2243. 10.1121/1.418246 [DOI] [PubMed] [Google Scholar]
Zanartu, M., Mongeau, L., and Wodicka, G. R. (2007). “ Influence of acoustic loading on an effective single mass model of the vocal folds,” J. Acoust. Soc. Am. 121, 1119–1129. 10.1121/1.2409491 [DOI] [PubMed] [Google Scholar]
Zhang, C., Zhao, W., Frankel, S. H., and Mongeau, L. (2002). “ Computational aeroacoustics of phonation. Part II: Effects of flow parameters and ventricular folds,” J. Acoust. Soc. Am. 112, 2147–2154. 10.1121/1.1506694 [DOI] [PubMed] [Google Scholar]
Zhao, W., Zhang, C., Frankel, S. H., and Mongeau, L. (2002). “ Computational aeroacoustics of phonation. Part I: Computational methods and sound generation mechanisms,” J. Acoust. Soc. Am. 112, 2134–2146. 10.1121/1.1506693 [DOI] [PubMed] [Google Scholar]
Zheng, X., Mittal, R., Xue, Q., and Bielamowicz, S. (2011). “ Direct-numerical simulation of the glottal jet and vocal-fold dynamics in a three-dimensional laryngeal model,” J. Acoust. Soc. Am. Volume 130, 404–415. 10.1121/1.3592216 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c1] Ananthapadmanabha, T. V., and Fant, G. (1982). “ Calculation of the true glottal flow and its components,” Speech Comm. 1, 167–184. 10.1016/0167-6393(82)90015-2 [DOI] [Google Scholar]

[c2] de Vries, M. P., Hamburg, M. C., Schutte, H. K., Verkerke, G. J., and Veldman, A. E. P. (2003). “ Numerical simulation of self-sustained oscillation of a voice-producing element based on Navier-Stokes equations and the finite element method,” J. Acoust. Soc. Am. 113, 2077–2083. 10.1121/1.1560163 [DOI] [PubMed] [Google Scholar]

[c3] Fant, G. (1960). Acoustic Theory of Speech Production (Mouton, The Hague: ), Sec. A2. [Google Scholar]

[c4] Fant, G. (1986). “ Glottal flow: Models and interaction,” J. Phonetics 14, 393–399. [Google Scholar]

[c5] Howe, M. S. (1976). “ The influence of vortex shedding on the generation of sound by convected turbulence,” J. Fluid Mech. 76, 711–740. 10.1017/S0022112076000864 [DOI] [Google Scholar]

[c6] Howe, M. S. (2002). Theory of Vortex Sound (Cambridge University Press, Cambridge: ), Secs. 4.4.2 and 6.3, p. 80 and Chap. 3. [Google Scholar]

[c7] Howe, M. S., and McGowan, R. S. (2010). “ On the single-mass model of the vocal folds,” Fluid Dyn. Res. 42, 015001. 10.1088/0169-5983/42/1/015001 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c8] Howe, M. S., and McGowan, R. S. (2011). “ Production of sound by unsteady throttling of flow into a resonant cavity, with application to voiced speech,” J. Fluid Mech. 672, 428–450. 10.1017/S0022112010006117 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c9] Howe, M. S., and McGowan, R. S. (2012). “ On the role of glottis-interior sources in the production of voiced sound,” J. Acoust. Soc. Am. 131, 1391–1400. 10.1121/1.3672655 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c10] Ishizaka, K., Matsudaira, M., and Kaneko, T. (1976). “ Input acoustic-impedance measurement of the subglottal system,” J. Acoust. Soc. Am. 60, 190–197. 10.1121/1.381064 [DOI] [PubMed] [Google Scholar]

[c11] Lighthill, J. (1978). Waves in Fluids (Cambridge University Press, Cambridge: ), p. 119. [Google Scholar]

[c12] Link, G., Kaltenbacher, M., Breuer, M., and Doellinger, M. (2009). “ A 2D finite-element scheme for fluid-solid-acoustic interactions and its application to human phonation,” Comput. Methods Appl. Mech. Eng. 198, 3321–3334. 10.1016/j.cma.2009.06.009 [DOI] [Google Scholar]

[c13] Luo, H., Mittal, R., Bielamowize, S., Walsh, R., and Hahn, J. (2008). “ An immersed-boundary method for flow-structure interaction in biological systems with applications to phonation,” J. Comput. Phys. 227, 9303–9332. 10.1016/j.jcp.2008.05.001 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c14] McGowan, R. S., and Howe, M. S. (2010). “ Influence of the ventricular folds on a voice source with specified vocal fold motion,” J. Acoust. Soc. Am. 127, 1519–1527. 10.1121/1.3299200 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c15] McGowan, R. S., and Howe, M. S. (2012). “ Source-tract interaction with prescribed vocal fold motion,” J. Acoust. Soc. Am. 131, 2999–3016. 10.1121/1.3685824 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c16] Murray, P. R., and Howe, M. S. (2012). “ On the thermo-acoustic Fant equation,” J. Sound Vib. 331, 3345–3357. 10.1016/j.jsv.2012.03.014 [DOI] [Google Scholar]

[c17] Olive, J. P., Greenwood, A., and Coleman, J. (1993). Acoustics of American English Speech: A Dynamic Approach (Springer-Verlag, New York: ), pp. 104 and 208. [Google Scholar]

[c18] Park, J. B., and Mongeau, L. (2007). “ Instantaneous orifice discharge coefficient of a physical, driven model of the human larynx,” J. Acoust. Soc. Am. 121, 442–455. 10.1121/1.2401652 [DOI] [PubMed] [Google Scholar]

[c19] Rayleigh, Lord (1945). Theory of Sound (Dover, New York: ), Vol. 2, Sec. 308. [Google Scholar]

[c20] Rosa, M. D. O., Pereira, J. C., Grellet, M., and Alwan, A. (2003). “ A contribution to simulating a three-dimensional larynx model using the finite element method,” J. Acoust. Soc. Am. 114, 2893–2905. 10.1121/1.1619981 [DOI] [PubMed] [Google Scholar]

[c21] Rothenberg, M. (1981). “ Acoustic interaction between the glottal source and the vocal tract,” in Vocal Fold Physiology, edited by Stevens K. N. and Hirano M. (University of Tokyo Press, Tokyo: ), pp. 305–328. [Google Scholar]

[c22] Stevens, K. N. (1998). Acoustic Phonetics (MIT Press, Cambridge, MA: ), pp. 55–152. [Google Scholar]

[c23] Tao, C., Zhang, Y., Hottinger, D. G., and Jiang, J. J. (2007). “ Asymmetric airflow and vibration induced by the Coanda effect in a symmetric model of the vocal fold,” J. Acoust. Soc. Am. 112, 2270–2278. 10.1121/1.2773960 [DOI] [PubMed] [Google Scholar]

[c24] Titze, I. R. (1994). Principles of Voice Production (Prentice Hall, Upper Saddle River, NJ: ), p. 72. [Google Scholar]

[c25] Titze, I. R. (2008). “ Nonlinear source-filter coupling in phonation: Theory,” J. Acoust. Soc. Am. 123, 2733–2749. 10.1121/1.2832337 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c26] Titze, I. R., and Story, B. H. (1997). “ Acoustic interactions of the voice source with the lower vocal tract,” J. Acoust. Soc. Am. 101, 2234–2243. 10.1121/1.418246 [DOI] [PubMed] [Google Scholar]

[c27] Zanartu, M., Mongeau, L., and Wodicka, G. R. (2007). “ Influence of acoustic loading on an effective single mass model of the vocal folds,” J. Acoust. Soc. Am. 121, 1119–1129. 10.1121/1.2409491 [DOI] [PubMed] [Google Scholar]

[c28] Zhang, C., Zhao, W., Frankel, S. H., and Mongeau, L. (2002). “ Computational aeroacoustics of phonation. Part II: Effects of flow parameters and ventricular folds,” J. Acoust. Soc. Am. 112, 2147–2154. 10.1121/1.1506694 [DOI] [PubMed] [Google Scholar]

[c29] Zhao, W., Zhang, C., Frankel, S. H., and Mongeau, L. (2002). “ Computational aeroacoustics of phonation. Part I: Computational methods and sound generation mechanisms,” J. Acoust. Soc. Am. 112, 2134–2146. 10.1121/1.1506693 [DOI] [PubMed] [Google Scholar]

[c30] Zheng, X., Mittal, R., Xue, Q., and Bielamowicz, S. (2011). “ Direct-numerical simulation of the glottal jet and vocal-fold dynamics in a three-dimensional laryngeal model,” J. Acoust. Soc. Am. Volume 130, 404–415. 10.1121/1.3592216 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Voicing produced by a constant velocity lung source

M S Howe

R S McGowan

Abstract

INTRODUCTION

Figure 1.

FORMULATION

Model configuration

Equation for the glottal flow

The vortex force

Figure 2.

The pressures p+ and p−

The Fant equation

Interpretation of p¯_=ρoco2W0(t)

The constant-pressure-driven Fant equation

Other special cases

THE LEVEL I APPROXIMATION

Prescribed glottal motion

The jet contraction ratio

Numerical procedure

NUMERICAL RESULTS

Model vocal tract parameters

Table I.

Table II.

Table III.

Influence of glottal frequency

Figure 3.

Figure 4.

Figure 6.

Figure 5.

Solution of the constant-pressure-driven Fant equation

Table IV.

Influence of the false folds on profile skewing

CONCLUSION

ACKNOWLEDGMENT

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

The pressures p₊ and p₋

Interpretation of $\bar{p}_= ρ_{o} c_{o}^{2} W_{0} (t)$