Abstract
Cardioid microphones/hydrophones are highly directional acoustical sensors, which enjoy easy availability via numerous commercial vendors for professional use. Collocating three such cardioids in orthogonal orientation to each other, the resulting triad would be sharply directional yet physically compact, while decoupling the incident signal's time-frequency dimensions from its azimuth-elevation directional dimensions, thereby simplifying signal-processing computations. This paper studies such a cardioid triad's azimuth-elevation direction-of-arrival estimation accuracy, which is characterized here by the hybrid Cramér-Rao bound. This analysis allows the cardioidicity index (α) to be stochastically uncertain, applies to any cardioidic order (k), and is valid for any real-valued incident signal regardless of the signal's time-frequency structure.
I. INTRODUCTION
A. The high directionality of a cardioid sensor
The “cardioid” sensor's name stems from its heart-shaped gain response of , where denotes the angle measured with respect to the cardioid sensor's axis, i.e., the straight line joining the 0° and the 180° on each of the polar plots in Fig. 1. (Please also see Chap. 5 of Ref. 1.) Such a cardioid sensor's heart-shaped gain pattern is largely unidirectional with one dominant front lobe, unlike a “figure-8” sensor's gain pattern with a back lobe equal in height to its front lobe.
FIG. 1.
(Color online) The cardioid's directivity pattern at order k = 1 and various cardioidicity α.
The “cardioidicity index,” , controls the cardioid's directivity.1 Strictly speaking, the cardioidicity index depends on the wavelength of the incident signal. Hence, should the incident signal's frequency be inexactly known or time-varying, the cardioidicity index would likewise be uncertain.
The order k specifies the power to which the cosine term is raised in the gain response of the cardioid. First-order cardioids date back to at least 1957.2 Regarding second-order cardioids, please see Ref. 3. On third-order cardioids, please refer to Ref. 4. These higher-order cardioids are often realized by computing the spatial finite differences of data collected by nearby isotropic sensors.5
Cardioid microphones/hydrophones are among the most practical acoustical sensors in wide professional use. For introductions, please see the following books: Chaps. 5, 6, and 11–21 of Ref. 1; Chaps. 8.3–8.5 of Ref. 2; Chaps. 2.5 and 3.4 of Ref. 6; Chaps. 4.5.6 and 4.5.7 of Ref. 7; Chaps. 3.5–3.7, 5.1–5.5, and 5.8 of Ref. 8; and pp. 111–114, 134–135, 342, and 484 of Ref. 9.
Dating back to at least the 1930s,2,10,11 cardioid microphones are also commercially available from various companies, including
-
(i)
Models 414, C519M, and SE300B, from AKG Acoustics Company (Vienna, Austria);
-
(ii)
Models 42, 2020, 4033, 4050, from Audio-Technica Corporation (Tokyo, Japan);
-
(iii)
Model B-2 PRO from Behringer Company (Willich, North Rhine-Westphalia, Germany);
-
(iv)
Model GXL1200BP from CAD Audio Company (Solon, OH, USA);
-
(v)
Model Stealthy Cardioid from Core Sound LLC (Teaneck, NJ, USA);
-
(vi)
d:screet mini 4080 from DPA Microphones Company (Alleroed, Denmark);
-
(vii)
Model MXL 770 from Marshall Electronics Inc. (Torrance, CA, USA);
-
(viii)
Model NT4 from Røde Microphones LLC (Silverwater, New South Wales, Australia);
-
(ix)
Models Evolution 914 and 935 from Sennheiser Company (Wedemark, Lower Saxony, Germany);
-
(x)
Models BETA 98A and SM58 from Shure Inc., (Niles, IL, USA); and
-
(xi)
Model MKV Microphone from SoundField Ltd. (Silverwater, New South Wales, Australia).
B. The advantages of placing cardioid sensors in orthogonal orientation and in spatial collocation
Consider three kth-order cardioid sensors all collocating at the Cartesian origin, but oriented along the x, y, and z axes. Please see Fig. 2. Such a collocated triad's physical compactness provides the advantage of deployment versatility and easy mobility.
FIG. 2.
(Color online) A triad of directional sensors, orthogonally oriented and collocated as one compact unit at the spherical coordinates' origin.
Upon this triad, consider a unit-power signal incident from a polar angle of measured from the positive z axis, and an azimuth angle of measured from the positive x axis. The triad's response may be characterized by a 3 × 1 array manifold12 of
(1) |
At k = 1 and α = 0, the cardioid triad degenerates to the much studied tri-axial velocity-sensor.13–18
The above array manifold is independent of frequency. That is, the spatial collocation uncouples the incident signal's time-frequency dimensions from the azimuth-elevation directional dimensions. This decoupling is most consequential for signal-processing computations. Consider this sample grid of , where refer to the grid size along the time-frequency dimensions, in azimuth, in elevation, respectively. If the three coupled domains become uncoupled, the product reduces by orders-of-magnitude to only a sum of .
This idea (of collocating diversely oriented cardioids of arbitrary order and arbitrary cardioidicity index) seems to be new to the open literature on direction finding to the present authors' best knowledge. A triad of collocating/perpendicular first-order cardioids has been studied in Refs. 12 and 19 for data-independent beamforming but not for direction finding. This three-dimensionally orthogonal triad differs from the two-dimensional triplet of Refs. 20 and 21 where three first-order standard cardioids (i.e., k = 1 and α = 0.5) all lie on a flat plane, 120° apart of each other in orientation. Reference 22 collocates four first-order standard cardioids all on a flat plane, 90° apart of each other in orientation. Reference 22 also collocates six first-order standard cardioids with two pointing along the ±x axes, two along the ±y axes, and two along the ±z axes.
To ease subsequent discussion, re-express Eq. (1) in terms of the Cartesian direction cosines , , , as
(2) |
Note that .
II. THE DATA MODEL
A cardioid may be effectively formed from a uniaxial velocity-sensor and a pressure-sensor (or from three pressure-sensors) by computing the finite difference among their data. Please see Refs. 23–32 for details. However, the above implementation can be imprecise in the real world. All practical sensors are imperfect.
Hence, the cardioids' α is modeled here as stochastic and normally distributed with a mean of and a standard deviation of . This presumption helps to render the probability of to be negligible; this presumption would be reasonable for any well-built sensor for practical use. Deterministically unknown but to be estimated are θ and ϕ.
To avert extraneous distraction from the present work's focus on the uncertainty in α in the cardioids, a simple statistical model is used below. More complex scenarios can be handled in analogy to the analysis below.
Incident upon a cardioid triad is a real-valued signal of any time-frequency signal structure. Here, m denotes the discrete-time index. At the mth time-instant, the following 3 × 1 data are measured by the triad:
(3) |
Here, n(m) denotes the additive noise's 3 × 1 vector, whose elements are each modeled as real-valued Gaussian, with a mean of zero and a variance of . This n(m) is modeled also as statistically uncorrelated over time (i.e., over m) and across the three cardioids, hence, . In the above, 0J denotes a J × 1 zero vector, whereas IJ represents a J × J identity matrix.
All M number of discrete-time samples may be collected into a 3 M × 1 vector of
(4) |
The conditional data vector has a mean of
(5) |
and a covariance matrix of
(6) |
where
III. HYBRID CRAMÉR-RAO BOUND DERIVATION FOR CARDIOIDS OF ANY ORDER k
Collect all three unknown scalar parameters into a vector . From Eqs. (8.49), (8.59), and (8.60) of Ref. 33, the “hybrid Fisher information matrix” (HFIM) equals
(7) |
where refers to an I × J matrix of all zero entries.
The (i,j)th element of is given by
(8) |
where Tr denotes the trace of the matrix inside the curly brackets, and
(9) |
(10) |
(11) |
Substitute Eqs. (9)–(11) in Eq. (8)
(12) |
(13) |
(14) |
(15) |
(16) |
(17) |
Substitute Eqs. (12)–(17) in Eq. (7)
(18) |
(19) |
(20) |
(21) |
(22) |
(23) |
The hybrid Cramér-Rao bounds for a triad of kth-order cardioids equal
(24) |
(25) |
(26) |
Of special practical interest is the first-order cardioid. This specific case will be investigated in great detail in Sec. IV.
IV. HYBRID CRAMÉR-RAO BOUND DERIVATION FOR FIRST-ORDER CARDIOIDS
Most practical cardioid sensors are of the first order, i.e., k = 1. In this case, Eqs. (18)–(23) may be simplified to give the following expressions for the elements of the HFIM:
From the above, Eqs. (24)–(26) become
(27) |
(28) |
(29) |
where
These hybrid Cramér-Rao bounds are expressed implicitly in terms of inside ; hence, these bounds apply to any real-valued waveform, whether wideband or narrowband, whether time-varying or time-invariant, whether stationary or non-stationary, whether zero-mean or otherwise—thereby offering high flexibility and thus wide applicability.
V. HYBRID CRAMÉR-RAO BOUND CHARACTERISTICS
For first-order cardioids, HCRB(θ), HCRB(ϕ), , each has exactly only four degrees of freedom—, θ, and ϕ—even though the measurement data model has more degrees of freedom than four. A larger σα would increase both and , but a larger would increase only but not .
These three hybrid Cramér-Rao bounds are plotted in Fig. 3 versus and versus at .
FIG. 3.
(Color online) The first-order cardioid-triad's HCRBs at various values of and various values of .
These three hybrid Cramér-Rao bounds are also plotted in Figs. 3–6, at various and . Each figure has a 3 × 3 “matrix” of sub-figures with increasing rightward in this 3 × 3 matrix of sub-figures, and increasing downward.
FIG. 4.
(Color online) The first-order cardioid-triad's at different values of and . Refer to Eq. (27).
FIG. 5.
(Color online) The first-order cardioid-triad's HCRB(ϕ) at different values of and . Refer to Eq. (28).
FIG. 6.
(Color online) The first-order cardioid-triad's at different values of and . Refer to Eq. (29).
Qualitative observations on Figs. 4 and 5 are:
-
(a)
Each HCRB in Fig. 3 increases monotonically with a decreasing and an increasing .
-
(b)
As increases or as decreases, both HCRB(θ) and HCRB(ϕ) and will all increase for any constant (θ,ϕ). This is largely due to the increasing vertical displacement.
-
(c)
HCRB(θ) and HCRB(ϕ) vary much less with (θ,ϕ) than with or .
Figure 5 has a shape dominated by the factor in the denominator. This is physically due to little acoustical energy projected onto the x-y plane on which ϕ is defined.
-
(d)
(e) as . This is because Eq. (28) has a factor of in its denominator.
For , Eqs. (27) and (28) reduce to
(30) |
(31) |
Substituting for α, one obtains Eqs. (30) and (31), which are the Cramér-Rao bounds for a deterministic . The expressions suggest that a smaller cardioidicity index could increase (i.e., could worsen) an unbiased estimator's error variance. The worst cardioidicity index is , corresponding to the customary tri-axial velocity-sensor mentioned in Sec. I.
VI. MAXIMUM A POSTERIORI (MAP) ESTIMATION OF THE AZIMUTH-ELEVATION DIRECTION-OF-ARRIVAL
The probability density function of the observation z, given , equals
(32) |
whereas the probability density function of α is
Hence, for given the observation z, the posterior probability density function equals
The MAP estimate () of given z by definition equals
(33) |
The above minimization in Eq. (33) may be solved iteratively via matlab's built-in function of fminsearch.
Figures 7 and 8, respectively, for order k = 1 and k = 2, show that this MAP estimator and the earlier derived HCRBs indeed approach each other. Here in Figs. 7 and 8, , , , , and there exist P = 1000 Monte Carlo independent trials for each icon. The root-mean-square error (RMSE) is defined as for the polar arrival angle, and for the azimuth arrival angle, with representing the pth Monte Carlo trial's estimate.
FIG. 7.
(Color online) The MAP estimates of Eq. (33) approach HCRB(θ) and HCRB(ϕ), which are derived in Eqs. (27) and (28). Here, the cardioid order equals k = 1.
FIG. 8.
(Color online) The MAP estimates of Eq. (33) approach HCRB(θ) and HCRB(ϕ), which are derived in Eqs. (24) and (25). Here, the cardioid order equals k = 2.
Figures 7 and 8 also reveal that the HCRBs could decrease with increasing randomness in α (i.e., as σα increases). This might surprise some readers. This phenomenon arises from the cardioid triad's array manifold having a Frobenius norm that varies with α (besides varying also with k,θ,ϕ). For example, at order k = 1,
(34) |
This is plotted in Fig. 9 versus and versus . There, is revealed to vary not monotonically with changing α. Rather, as α increases from zero toward unity, would increase, then would decrease, and finally would increase again. This pattern is more pronounced for smaller .
FIG. 9.
(Color online) How varies non-monotonically with α and .
VII. CONCLUSION
This paper advances the idea of placing three cardioid microphones/hydrophones (of any cardioidic order k) in orthogonal orientation but spatial collocation for frequency-independent direction finding—to the best of our knowledge. This paper has analytically derived the corresponding hybrid Cramér-Rao bound for direction finding. This derivation allows for stochastic uncertainty in the cardioidic index α. Quite surprisingly, the hybrid Cramér-Rao bounds are found not to decrease with increasing uncertainty in α, and this is found to be explainable by the array manifold's Frobenius norm being non-monotonically dependent on α. At first order k = 1, (a) the hybrid Cramér-Rao bounds turn out to have only four degrees of freedom: the polar-azimuth direction-of-arrival and , and (b) a larger cardioidicity index could improve an unbiased estimator's error variance.
ACKNOWLEDGMENT
The authors would like to thank Dr. Petr Tichavský for useful discussions.
References
- 1. Eargle J., The Microphone Book, second ed. ( Focal Press, Burlington, MA, 2005). [Google Scholar]
- 2. Olson H. F., Acoustical Engineering ( Van Nostrand Co, Princeton, NJ, 1957). [Google Scholar]
- 3. Woszczyk W., “ A microphone technique applying to the principle of second-order gradient unidirectionality,” J. Audio Eng. Soc. 32(7/8), 507–530 (1984). [Google Scholar]
- 4. Beavers B. R. and Brown R., “ Third-order gradient microphone for speech reception,” J. Audio Eng. Soc. 18(6), 636–640 (1970). [Google Scholar]
- 5. Olenko A. Y. and Wong K. T., “ Noise statistics across the three axes of a tri-axial velocity sensor constructed of pressure sensors,” IEEE Trans. Aerosp. Electron. Syst. 51(2), 843–852 (2015). 10.1109/TAES.2014.140242 [DOI] [Google Scholar]
- 6.Huang Y. and Benesty J. (editors), Audio Signal Processing for Next-Generation Multimedia Communication Systems ( Kluwer Academic Publishers, Boston, MA, 2004). [Google Scholar]
- 7. Sherman C. H. and Butler J. L., Transducers and Arrays for Underwater Sound ( Springer Science and Business Media, New York, 2007). [Google Scholar]
- 8. Tashev I. J., Sound Capture and Processing, Practical Approaches ( Wiley, Chichester, U.K, 2013). [Google Scholar]
- 9. Bai M. R., Ih J.-G., and Benesty J., Acoustic Array Systems Theory, Implementation, and Application ( Wiley, Singapore, 2013). [Google Scholar]
- 10. Glover R. P., “ A Review of cardioid type unidirectional microphones,” J. Acoust. Soc. Am. 11(3), 296–302 (1940). 10.1121/1.1916036 [DOI] [Google Scholar]
- 11. Olson H. F., “ The quest for directional microphones at RCA,” J. Audio Eng. Soc. 28(11), 776–786 (1980). [Google Scholar]
- 12. Wong K. T., Nnonyelu C. J., and Wu Y. I., “ A triad of cardioid sensors in orthogonal orientation and spatial collocation—Its spatial-matched-filter-type beam-pattern,” IEEE Trans. Signal. Proc. 66(4), 865–906 (2018). 10.1109/TSP.2017.2773419 [DOI] [Google Scholar]
- 13. Nehorai A. and Paldi E., “ Acoustic vector-sensor array processing,” IEEE Trans. Signal. Proc. 42(10), 2481–2491 (1994). 10.1109/78.317869 [DOI] [Google Scholar]
- 14. Wong K. T. and Zoltowski M. D., “ Closed-form underwater acoustic direction-finding with arbitrarily spaced vector-hydrophones at unknown locations,” IEEE J. Ocean. Eng. 22(3), 566–575 (1997). 10.1109/48.611148 [DOI] [Google Scholar]
- 15. Tam P. K. and Wong K. T., “ Cramér-Rao bounds for direction finding by an acoustic vector-sensor under non-ideal gain-phase responses, non-collocation, or non-orthogonal orientation,” IEEE Sens. J. 9(8), 969–982 (2009). 10.1109/JSEN.2009.2025825 [DOI] [Google Scholar]
- 16. Wong K. T., “ Acoustic vector-sensor ‘blind’ beamforming and geolocation for FFH-sources,” IEEE Trans. Aerosp. Electron. Syst. 46(1), 444–449 (2010). 10.1109/TAES.2010.5417173 [DOI] [Google Scholar]
- 17. Wu Y. I. and Wong K. T., “ Acoustic near-field source localization by two passive anchor nodes,” IEEE Trans. Aerosp. Electron. Syst. 48(1), 159–169 (2012). 10.1109/TAES.2012.6129627 [DOI] [Google Scholar]
- 18. Wu Y. I., Wong K. T., Lau S.-k., Yuan X., and Tang S.-k., “ A directionally tunable but frequency-invariant beamformer on an acoustic velocity-sensor triad to enhance speech perception,” J. Acoust. Soc. Am. 131(5), 3891–3902 (2012). 10.1121/1.3701991 [DOI] [PubMed] [Google Scholar]
- 19. Nnonyelu C. J., Wu Y. I., and Wong K. T., “ Cardioid microphones/hydrophones in a collocated and orthogonal triad a steerable beamformer with no beam-pointing error,” J. Acoust. Soc. Am. 145(1), 575–588 (2019). 10.1121/1.5087697 [DOI] [PubMed] [Google Scholar]
- 20. Massa F., “ Directional energy receiving systems for use in the automatic indication of the direction of arrival of the received signal,” J. Acoust. Soc. Am. 68(6), 1912–1913 (1980). 10.1121/1.385157 [DOI] [Google Scholar]
- 21. Freiberger K. and Sontachi A., “ Similarity-based sound source localization with a coincident microphone array,” in International Conference on Digital Audio Effects, Paris, France (September 2011), pp. 185–190. [Google Scholar]
- 22. Vryzas N., Dimoulas C. A., and Papanikolaou G. V., “ Embedding sound localization and spatial audio interaction through coincident microphones arrays,” in Audio Mostly Conference on Interaction with Sound, Thessaloniki, Greece (October 7–9, 2015), article 36. [Google Scholar]
- 23. Cox H., “ Super-directivity revisited ,” in IEEE Instrumentation and Measurement Technology Conference (2004), pp. 877–880. [Google Scholar]
- 24. Cox H. and Lai H., “ Performance of line arrays of vector and higher order sensors,” in Forty-First Asilomar Conference on Signals, Systems and Computers (2007), pp. 1231–1236. [Google Scholar]
- 25. Clark J. A., “ High-order angular response beamformer for vector sensors,” J. Sound Vib. 318(3), 417–422 (2008). 10.1016/j.jsv.2008.04.030 [DOI] [Google Scholar]
- 26. Cray B. A. and Nuttal A. H., “ Directivity factors for linear arrays of velocity sensors,” J. Acoust. Soc. Am. 110(1), 324–331 (2001). 10.1121/1.1373706 [DOI] [Google Scholar]
- 27. de Bree H. E., Basten T., and Yntema D., “ A single broad banded 3D beamforming sound probe,” in German Annual Conference on Acoustics (2008). [Google Scholar]
- 28. Derkz R. M. M. and Janse K., “ Theoretical analysis of a first-order azimuth-steerable superdirective microphone array,” IEEE Trans. Audio Speech Lang. Proc. 17(1), 150–162 (2009). 10.1109/TASL.2008.2006583 [DOI] [Google Scholar]
- 29. D'Spain G. L., Luby J. C., Wilson G. R., and Gramann R. A., “ Vector sensors and vector sensor line arrays: Comments on optimal array gain and detection,” J. Acoust. Soc. Am. 120(1), 171–185 (2006). 10.1121/1.2207573 [DOI] [Google Scholar]
- 30. Gur B., “ Particle velocity gradient based acoustic mode beamforming for short linear vector sensor arrays,” J. Acoust. Soc. Am. 135(6), 3463–3473 (2014). 10.1121/1.4876180 [DOI] [PubMed] [Google Scholar]
- 31. Smith K. B. and Leijen A. V. V., “ Steering vector sensor array elements with linear cardioids and nonlinear hippioids,” J. Acoust. Soc. Am. 122(1), 370–377 (2007). 10.1121/1.2722054 [DOI] [PubMed] [Google Scholar]
- 32. Wilson O. B., Wolf S. N., and Ingenito F., “ Measurements of acoustic ambient noise in shallow water due to breaking surf,” J. Acoust. Soc. Am. 78(1), 190–195 (1985). 10.1121/1.392557 [DOI] [Google Scholar]
- 33. Van Trees H. L., Detection, Estimation and Modulation Theory, Part IV: Optimum Array Processing ( Wiley, New York, 2002). [Google Scholar]