Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Mar 16.
Published in final edited form as: Ear Hear. 2022 Mar 16;43(5):1447–1455. doi: 10.1097/AUD.0000000000001216

Both-ear Method for the Analysis of Audiometric Data

Chen Chen 1, Ning Zhang 2, Gary C Curhan 2,3,4,5, Sharon G Curhan 3,4,6, Molin Wang 1,2,3,4,6
PMCID: PMC9398921  NIHMSID: NIHMS1777242  PMID: 35302530

Abstract

Objective:

Single-ear hearing measurements, such as better-ear, worse-ear or left/right ear, are often used as outcomes in auditory research, yet measurements in the two ears of the same individual are often strongly but not perfectly correlated. We propose a both-ear method using the Generalized Estimating Equation approach for analysis of correlated binary ear data to evaluate determinants of ear-specific outcomes that includes information from both ears of the same individual.

Design:

We first theoretically evaluated bias in odds ratio (OR) estimates based on worse-ear and better-ear hearing outcomes. A simulation study was conducted to compare the finite sample performances of single-ear and both-ear methods in logistic regression models. As an illustrative example, the single-ear and both-ear methods were applied to estimate the association of Dietary Approaches to Stop Hypertension (DASH) adherence scores with hearing threshold elevation among 3,135 women, aged 48–68 years, in the Nurses’ Health Study II.

Results:

Based on statistical theories, the worse-ear and better-ear methods could bias the OR estimates. The simulation results led to the same conclusion. In addition, the simulation results showed that the both-ear method had satisfactory finite sample performance and was more efficient than the single-ear method. In the illustrative example, the confidence intervals of the estimated ORs for the association of DASH scores and hearing threshold elevation using the both-ear method were narrower, indicating greater precision, than for those obtained using the other methods.

Conclusions:

The worse-ear and better-ear methods may lead to biased estimates, and the left/right ear method typically results in less efficient estimates. In certain settings, the both-ear method using the Generalized Estimating Equation approach for analyses of audiometric data may be preferable to the single-ear methods.

INTRODUCTION

Hearing loss is the fourth largest contributor to disability in the world (World Health Organization 2018), thus identifying potential risk factors for hearing loss is a pressing global health issue (Looi et al. 2015). The gold standard for measurement of hearing sensitivity is pure tone audiometry. The pure-tone audiogram provides information in hearing sensitivities across a selected frequency range and is widely used for investigative research and in the clinical setting. Previous research has commonly defined hearing status based on the pure-tone average (PTA), which is the average of hearing threshold measurements over a pre-defined set of audiometric frequencies for an individual ear. For example, in a study estimating the hearing loss prevalence based on PTA for children in the United States (Niskar et al. 1998), hearing loss was categorized according to average sensitivities in the left and right ear, and by the better (lower PTA) and worse (higher PTA) ear.

Although these single-ear methods are valuable in obtaining estimates of hearing loss prevalence and could be useful in the prediction of communication difficulty, a clinically meaningful outcome, it remains a question whether single-ear methods represent the best practice approach for evaluating exposure-hearing associations. Previous studies have used single-ear measurements as the response variable for exploring relationships between risk factors and hearing status (Cruickshanks et al. 2003; Bainbridge et al. 2008; Zhan et al. 2009; Lee et al. 2020). For example, in a study in the National Health and Nutrition Examination Survey (Bainbridge et al. 2008) that evaluated the cross-sectional relation of diabetes and hearing loss, low/mid frequency was defined as PTA 0.5,1,2 kHz and high frequency was defined as PTA 3,4.6.8 kHz. Hearing loss of mild or greater severity was defined as a PTA>25 dB HL and hearing loss of moderate or greater severity was defined as PTA> 40 dB HL. For each combination of frequency and severity, worse ear was defined as hearing loss in at least one ear, while better ear was defined as hearing loss in both ears. Adjusted odds ratio (OR) estimates were calculated using multivariate logistic regression for the association of diabetes and hearing loss in the worse and the better ear. In a longitudinal study conducted in the Epidemiology of Hearing Loss Study that evaluated risk factors for 5-year incident hearing loss (Cruickshanks et al. 2003), incident hearing loss was defined as PTA0.5,1,2,4 kHz > 25 dB HL in the worse ear at follow-up among those without hearing loss (PTA0.5,1,2,4 kHz 25 dB HL) in both ears at baseline.

In this paper, we investigate the validity and efficiency of using single-ear hearing measurement as the response variable in the evaluation of exposure-hearing associations. We also propose a both-ear method and compare the analysis results obtained using this approach with those obtained using traditional single-ear methods (worse-ear, better-ear, and left/right ear methods).

As an illustrative example, we apply both single-ear and both-ear methods to assess the association of the Dietary Approaches to Stop Hypertension (DASH) diet adherence scores and hearing threshold elevation in the Nurses’ Health Study II (NHS II) Conservation of Hearing Study (CHEARS) (Curhan et al. 2020).

THEORETICAL CONSIDERATIONS AND STATISTICAL METHODS

The single-ear method

Let Yij denote a binary response variable; Yij =1 represents hearing loss or hearing threshold elevation for the jth ear of the ith individual. The single-ear method assumes the following logistic regression model for the single-ear hearing outcome (Cruickshanks et al. 2003; Bainbridge et al. 2008; Curhan et al. 2020).

logitEY~i|xi=logPY~i=1|1,xi1,xiKPY~i=0|1,xi1,xiK=α0+k=1Kαkxik (1)

where Y~i represents the binary hearing measurement in the worse ear of the ith individual for the worse-ear method and that in the better ear for the better-ear method, E and P denote expectation and probability, respectively, and xi = (xi1,…, xiK) represents the exposure of interest and potential confounders.

The worse-ear and better-ear methods only use the hearing measurement on one selected ear when fitting the logistic regression model. If an individual-level exposure (e.g., the DASH score) influences hearing, the hearing measurements in both ears contain information about the exposure effects. A more reasonable model, which takes advantage of both ears’ data, for charactering the exposure-hearing relationship is

logitEYij|xij=logPYij=1|1,xij1,xijKPYij=0|1,xij1,xijK=β0+k=1Kβkxijk. (2)

Here Yij, j = 1,2, is a clustered response variable with cluster size 2 (left and right ears), xij = (xij1,…,xijK) represents the exposure of interest and potential confounders for the jth ear of the ith individual. Potential effect modifiers (e.g., baseline hearing ability for each ear) can be taken into account in Model (2) by including interactions of the exposure and the ear- and/or individual- level effect modifiers. Next, we show that the estimated regression coefficients from Model (1) may be biased if Model (2) is true.

The hearing measurements in the two ears, Yi1 for the left ear and Yi2 for the right ear, can be treated as correlated Bernoulli variables. Denote P(Yi1 = 1) = pL, P(Yi2 = 1 pR, and the correlation coefficient of Yi1 and Yi1 as ρ; for notational simplicity, we have omitted subscript i in these parameters. For the worse-ear method,

PY~i=1=pL+pRρpL1pLpR1pRpLpR. (3)

Similarly, for the better-ear method,

PY~i=1=ρpL1pLpR1pR+pLpR. (4)

See the Supplementary Material Appendix A for a detailed derivation. It follows that, if pL = pR = p, Formula (3) and (4) convert to (5) and (6), respectively;

fortheworse-earmethod,PY~i=1=2ρp+ρ1p2; (5)
forthebetter-earmethod,PY~i=1=ρp+1ρp2. (6)

Formula (36) suggest that, in either the worse-ear or the better-ear method, PY~i=1,which is modeled in (1), is typically not equal to the probabilities, pL or pR, which is modeled in (2), even in the simple case of pL = pR = p. Therefore, the estimated regression coefficients in Model (1) may not converge to the coefficients in Model (2) even if the sample size goes to infinity.

The left or right ear method models PY~i=1=pL or PY~i=1=pR. This method is valid as the estimated regression coefficients converge to the coefficients in Model (2); however, it is less efficient than a method using both ears since it uses less data than the both-ear method.

Next, we evaluate the magnitude of the bias in ORs assuming there is only one binary exposure in Models (1) and (2) and, within the exposure group, pL = pR = p. Based on Model (2), OR =pe1pe/pe1pe=pe1pepe1pe, where pe=PYij=1|Xi=1andpe=PYij=1|Xi=0, representing the prevalence of the hearing outcome being 1 in the jth ear in the exposed and unexposed groups, respectively, j = 1, 2. However, using the worse-ear method, exponential of the regression coefficient in model (1) represents the following OR:

ORw=pew1pew/pew1pew=2ρpe+ρ1pe212ρpe+ρ1pe22ρpe+ρ1pe212ρpe+ρ1pe2=2ρ+ρ1ORpeORpepe+1ORpepe+1ORpe2ρ+ρ1ORpeORpepe+112ρpe+ρ1pe22ρpe+ρ1pe2, (7)

where pew=PY~i=1|Xi=1 and pew=PY~i=1|Xi=0, and we have used Formula (5) in the second equality. Note that ORW = 1 OR = 1; that is, there is no difference between OR and ORW under the null (i.e., if there is no exposure-hearing association).

For the better-ear method, exponential of the regression coefficient in model (1) corresponds to

ORB=peb1peb/peb1peb=ρpe+1ρpe21ρpe+1ρpe2ρpe+1ρpe21ρpe+1ρpe2=ρ+1ρORpeORpepe+1ORpepe+1ORpeρ+1ρORpeORpepe+11ρpe+1ρpe2ρpe+1ρpe2 (8)

where peb=PY~i=1|Xi=1 and peb=PY~i=1|Xi=0. Similarly, there is no difference between OR and ORB under the null.

Figure 1 shows numerical results for the relative differences between ORW (or ORB) and OR as functions of OR and the between-ear correlation coefficient, ρ, when pe is 0.5, based on formula (7) and (8). Using the worse-ear method as an example, this relative difference is defined as (ORWOR/OR. The relative difference for ORW ranges from −10% to 11.5% when ρ =0.2. When ρ = 0.8, the difference is close to zero. The between-ear correlation coefficient for the hearing outcome, ≥5 dB HL threshold elevation, in the CHEARS data example is approximately 0.3. The absolute value of the relative difference increases when OR moves away from 1. The relative difference of ORB is larger than that in the worse-ear method, especially when the ρ is smaller. For example, when ρ = 0.2 and OR = 2.0, the relative difference of the ORs between the better-ear and both-ear methods is approximately 12.5%; when OR = 0.5, this relative difference is close to −13.75%. Supplementary Figure 1 shows the relative differences when pe = 0.1 where ORW and ORB are less different from the OR than those when pe = 0.50.

Fig. 1.

Fig. 1.

The relative difference comparing the odds ratio using the worse-ear method (upper panel) or the better-ear method (lower panel) with the both-ear method (Pe=0.5).

The both-ear method

We can use the Generalized Estimating Equation (GEE) method (Liang & Zeger 1986) to make inferences about the parameters in Model (2) based on the correlated both-ear data. GEE is a semiparametric method since we do not need to make distributional assumptions on the response variables. The GEE estimator of β = (β1, β2,⋯,βk) in Model (2) is obtained by solving the following equations:

i=1NDi'Vi1Yiμi=0, (10)

where Yi is the vector of the outcome variable formed by Yij, μi is the expected value E(Yi|xi, and Vi is the working variance-covariance matrix, and Di = ∂μi/∂β. In the GEE method, as long as the mean model, E(Yi|xi, is specified correctly, the regression coefficients will be estimated consistently, even if the working variance-covariance matrix is different from cov(Yi). We can use the sandwich estimator to estimate the variance of the regression coefficient estimates (Fitzmaurice et al. 2011).

Notably, the single-ear method has correlated data if each participant has multiple hearing threshold measures at multiple frequencies or frequency categories, which is typical for audiometric data. When there are threshold measurements at multiple frequencies, for both the both-ear and single-ear methods, data for all the frequencies or frequency categories should be analyzed in the same model and include the exposure-frequency interaction as well as the interactions between frequency and other relevant covariates; GEE method can then be used to estimate the regression coefficients of the regression models, taking into account the within-participant correlation between hearing threshold measurements in both ears and across frequencies.

Using three frequency groups (low, mid, high) as an example, we can create two indicators, x2 and x3, for the frequencies; that is, x2 = 1 for the mid frequencies and 0 otherwise; x3 = 1 for the high frequencies and 0 otherwise. Let x1 denote a continuous exposure variable of interest. The component in the right-hand side of Model (2) for the exposure and exposure-frequency interactions can then be written as β1x1 + β2x1x2+β3x1x3. It follows that the ORs representing the exposure effect per 1 unit increase in x1 for low, mid and high frequencies are exp(β1),exp(β1 + β2),exp(β1 + β3), respectively. If these ORs for different frequencies are estimated in separate analyses, the correlations between these OR estimates would be ignored, leading to incorrect Type I error in global tests such as the test for whether there is an exposure effect in at least one frequency group. This could result in misleading scientific findings.

SIMULATION STUDY

We conducted a simulation study to compare the four methods considered. Correlated binary responses were generated following methods previously described elsewhere (Touloumis 2016). Similar to the CHEARS study, for each individual (e.g., participant), the binary outcome variable had six dimensions, PTA at low-frequency, mid-frequency and high-frequency for both ears. We assumed: (a) the correlation coefficient, ρ, between different frequencies for the same ear was ρ=0.3; (b) for the same frequencies for different ears, ρ=0.6; and (c) for different frequencies for different ears, ρ=0.2. The GEE approach with an exchangeable working correlation matrix was used in the four methods considered to estimate the regression coefficients of the logistic regression models. R 4.0.3 was used for the simulation study and the R package ‘gee’ was used for the GEE analyses. Four criteria were calculated to compare the bias and efficiency of the estimated exposure coefficients (log(OR)) among the methods:

  1. The percent (%) relative bias, defined as 100*(the mean of the estimated over simulation replicates – the true coefficient)/the true coefficient;

  2. The empirical standard deviation of the point estimates (a smaller value represents a more efficient estimate);

  3. The average of the estimated standard errors of the estimated coefficients;

  4. The 95% confidence interval coverage rate (a converge rate closer to 95% indicating a better interval estimates).

We considered the two simulation scenarios, described below.

In the first simulation scenario, we included the following terms in logistic regression model: x1, a continuous variable for the exposure, participant-specific; x2 and x3, the dummy variables for mid- and high-frequency, respectively, participant-specific; interactions x1 × x2 and x1 × x3; x4, the baseline PTA measurements, ear-specific. The true coefficients of these six terms were all set as log(1.5). The number of simulation replicates was 1000, and the sample size (i.e., the number of 6-dimention clusters) was 1000 for each simulation replicate. The simulation results are shown in Table 1 (for 20% event rate) and Supplementary Table 1 (for 50% event rate). In this simulation scenario, we also conducted an additional simulation study for the 20% event rate using the unstructured working correlation matrix in the GEE analyses; the simulation results are shown in Supplementary Table 2.

TABLE 1:

Comparison of Alternative Analytic Methods for the Analysis of Audiometric Data: Simulation Study Scenario 1a

Both-ear Methodb Worse-ear Methodc Better-ear Methodd Left-ear Methode
Relative biasf (%) Empirical standard deviationg Estimated standard errorh Converge ratei (%) Relative bias (%) Empirical standard deviation Estimated standard error Converge rate (%) Relative bias (%) Empirical standard deviation Estimated standard error Converge rate (%) Relative bias (%) Empirical standard deviation Estimated standard error Converge rate (%)
x1 −0.34 0.07 0.07 0.94 4.62 0.08 0.08 0.95 48.91 0.17 0.18 0.76 0.12 0.09 0.09 0.95
x2 −0.73 0.08 0.08 0.95 8.61 0.09 0.09 0.93 47.78 0.23 0.22 0.89 0.37 0.12 0.12 0.96
x3 −0.32 0.08 0.08 0.96 9.18 0.09 0.09 0.94 48.90 0.23 0.22 0.88 −0.11 0.12 0.12 0.94
x1×x2 0.78 0.08 0.08 0.95 14.27 0.10 0.10 0.91 25.90 0.19 0.20 0.90 0.51 0.12 0.12 0.95
x1×x2 0.40 0.08 0.08 0.96 14.48 0.10 0.10 0.91 24.27 0.19 0.19 0.92 0.83 0.12 0.12 0.95
x4 0.30 0.04 0.04 0.95 −1.51 0.05 0.05 0.95 5.16 0.08 0.09 0.92 0.22 0.05 0.05 0.96
a

Simulation Study Scenario with a continuous exposure, a 20% event rate and a sample size of 1000, based on 1000 simulation replicates; using exchangeable working covariance-covariance matrix.

b

Both-ear method: logistic regression with hearing data of both ears at three frequency categories as correlated outcome.

c

Worse-ear method: logistic regression with hearing data of the worse ear at three frequency categories as the outcome.

d

Better-ear method: logistic regression with hearing data of the better ear at three frequency categories as the outcome.

e

Left ear method: logistic regression with hearing data of the left ear at three frequency categories as the outcome; the generalized estimating equation approach was used in all the methods for estimation.

f

Relative Bias is 100 × (the mean of the estimated log (OR) over simulation replicates – the true log (OR))/the true log (OR), where OR stands for odds ratio.

g

Empirical standard deviation (SD) is the empirical SD of the log(OR) estimates from simulation replicates.

h

Estimated standard error (SE) is the average of the sandwich SE over the simulation replications.

i

Coverage Rate is the 95% confidence interval coverage rate based on the sandwich SE.

x1: person-level continuous exposure; x2: dummy variable for mid-frequency; x3: dummy variable for high-frequency; x4 continuous ear-level baseline measurement.

In the second simulation scenario, instead of the continuous exposure evaluated in the first scenario, we examined a four-level categorical exposure. For this categorical exposure, we created the indicator variables x1,1, x1,2, and x1,3. We therefore used the interaction terms: x1,1 × x2, x1,2 × x2, x1,3 × x2, x1,1 × x3, x1,2 × x3 and x1,3 × x3. We set the regression coefficients of these terms similar to the estimated regression coefficients in the both-ear analysis for the CHEARS data (Burton et al., 2006), and the sample sizes in each simulation replicate were 1000 and 5000. The corresponding results are shown in Tables 2 (for 20% event rate and sample size 1000), Supplementary Table 3 (for 50% event rate and sample size 1000), Tables 3 (for 20% event rate and sample size 5000) and Supplementary Table 4 (for 50% event rate and sample size 5000).

TABLE 2:

Comparison of Alternative Analytic Methods for the Analysis of Audiometric Data: Simulation Study Scenario 2 a

Both-ear methodb Worse-ear methodc
Relative biasd (%) Empirical standard deviatione Estimated standard errorf Converge rateg (%) Relative bias (%) Empirical standard Deviation Estimated standard error Converge rate (%)
x1,1 −3.51 0.24 0.25 0.95 −13.90 0.25 0.27 0.94
x1,2 2.51 0.24 0.25 0.95 −1.63 0.27 0.29 0.94
x1,3 5.62 0.23 0.24 0.95 −2.91 0.24 0.25 0.94
x2 1.76 0.20 0.21 0.95 3.93 0.21 0.22 0.93
x3 0.90 0.21 0.22 0.94 8.24 0.23 0.24 0.90
x1,1×x2 −2.04 0.30 0.31 0.94 −11.04 0.32 0.33 0.94
x1,2×x2 1.52 0.27 0.28 0.95 9.28 0.31 0.33 0.94
x1,3×x2 38.73 0.28 0.29 0.94 −1.09 0.30 0.31 0.95
x1,1×x3 23.29 0.30 0.32 0.94 −35.35 0.33 0.34 0.94
x1,2×x3 2.85 0.29 0.31 0.94 25.40 0.34 0.37 0.93
x1,3×x3 −7.50 0.26 0.28 0.94 127.11 0.30 0.31 0.93
x4 −6.67 0.04 0.04 0.95 −17.82 0.04 42.00 0.96
a

Simulation Study Scenario with a 4-level categorical exposure, a 20% event rate and a sample size of 1000, based on 1000 simulation replicates; using exchangeable working covariance-covariance matrix.

b

Both-ear method: logistic regression with hearing data of both ears at three frequency categories as correlated outcome.

c

Worse-ear method: logistic regression with hearing data of the worse ear at three frequency categories as the outcome.

d

Relative Bias is 100 × (the mean of the estimated log (OR) over simulation replicates – the true log (OR))/the true log (OR), where OR stands for odds ratio.

e

Empirical standard deviation (SD) is the empirical SD of the log(OR) estimates from simulation replicates.

f

Estimated standard error (SE) is the average of the sandwich SE over the simulation replications.

g

Coverage Rate is the 95% confidence interval coverage rate based on the sandwich SE.

x1,1, x1,2, and x1,3: indicators for a categorical exposure; x2: dummy variable for mid-frequency; x3: dummy variable for high-frequency; x4 continuous ear-level baseline measurement.

The better-ear and left/right-ear methods have low convergence rates and thus the results are unavailable.

TABLE 3:

Comparison of Alternative Analytic Methods for the Analysis of Audiometric Data: Simulation Study Scenario 2a

Both-ear Methodb Worse-ear Methodc Better-ear Methodd Left-ear Methode
Relative biasf (%) Empirical standard deviationg Estimated standard errorh Converge ratei (%) Relative bias (%) Empirical standard deviation Estimated standard error Converge rate (%) Relative bias (%) Empirical standard deviation Estimated standard error Converge rate (%) Relative bias (%) Empirical standard deviation Estimated standard error Converge rate (%)
x1,1 −2.06 0.11 0.11 0.95 −12.48 0.11 0.11 0.94 −143.41 0.34 0.35 0.84 −1.18 0.15 0.15 0.94
x1,2 0.36 0.11 0.11 0.94 −1.00 0.12 0.12 0.95 30.64 0.35 0.35 0.96 1.45 0.15 0.15 0.95
x1,3 −0.87 0.10 0.11 0.95 6.45 0.11 0.11 0.94 −67.34 0.32 0.32 0.91 0.13 0.15 0.14 0.96
x2 0.28 −0.09 0.09 0.94 2.86 0.09 0.10 0.93 37.71 0.26 0.26 0.71 0.68 0.13 0.13 0.96
x3 −0.06 0.09 0.09 0.94 7.69 0.10 0.10 0.75 42.97 0.25 0.25 0.24 0.11 0.13 0.13 0.96
x1,1×x2 −1.86 0.13 0.14 0.95 −10.85 0.14 0.14 0.94 −129.65 0.38 0.40 0.87 1.86 0.18 0.19 0.95
x1,2×x2 5.32 0.12 0.12 0.94 4.38 0.13 0.14 0.94 64.38 0.38 0.38 0.95 −9.85 0.17 0.18 0.95
x1,3×x2 0.76 0.12 0.13 0.94 −13.68 0.13 0.13 0.94 −118.55 0.36 0.37 0.92 4.18 0.17 0.17 0.95
x1,1×x3 10.00 0.13 0.14 0.94 −39.86 0.14 0.15 0.94 269.03 0.37 0.38 0.94 34.17 0.18 0.19 0.94
x1,2×x3 0.35 0.13 0.13 0.95 34.94 0.15 0.15 0.94 1159.58 0.39 0.38 0.96 0.57 0.18 0.18 0.95
x1,3×x3 1.48 0.12 0.12 0.94 98.149 0.13 0.14 0.93 −111.93 0.34 0.35 0.93 −9.17 0.17 0.17 0.95
x4 −0.43 0.02 0.02 0.96 −4.05 0.02 0.02 0.95 38.25 0.04 0.03 0.96 8.94 0.02 0.02 0.95
a

Simulation Study Scenario with a 4-level categorical exposure, a 20% event rate and a sample size of 5000, based on 1000 simulation replicates; using exchangeable working covariance-covariance matrix.

b

Both-ear method: logistic regression with hearing data of both ears at three frequency categories as correlated outcome.

c

Worse-ear method: logistic regression with hearing data of the worse ear at three frequency categories as the outcome.

d

Better-ear method: logistic regression with hearing data of the better ear at three frequency categories as the outcome.

e

Left ear method: logistic regression with hearing data of the left ear at three frequency categories as the outcome; the generalized estimating equation approach was used in all the methods for estimation.

f

Relative Bias is 100 × (the mean of the estimated log (OR) over simulation replicates – the true log (OR))/the true log (OR) ), where OR stands for odds ratio.

g

Empirical standard deviation (SD) is the empirical SD of the log(OR) estimates from simulation replicates.

h

Estimated standard error (SE) is the average of the sandwich SE over the simulation replications.

i

Coverage Rate is the 95% confidence interval coverage rate based on the sandwich SE.

x1,1, x1,2, and x1,3: indicators for a categorical exposure; x2: dummy variable for mid-frequency; x3: dummy variable for high-frequency; x4 continuous ear-level baseline measurement.

The both-ear method outperformed the single-ear methods in all of the simulation scenarios. In the first simulation scenario, for a 20% event rate (Table 1), the left/right ear method led to estimates with less than 1% relative biases, similar to the both-ear method. However, there was an up to 50% larger empirical standard error than that observed using the both-ear method. Both the worse-ear and better-ear method estimators were more biased and had larger empirical standard deviations of the point estimates; the absolute values of the relative biases ranged from 5% to 49% for the better-ear estimator, and from 2% to 14% for the worse-ear estimators. Similar findings were observed for the 50% event rate (Supplementary Table 1); for the worse-ear method estimator, a higher relative bias that ranged from 0.1% to 29%, was observed. Similar findings were also observed when using the unstructured working correlation matrix in the GEEs (Supplementary Table 2).

In the second set of simulation scenarios, for a 20% event rate scenario with a sample size of 1000 (Table 2), the standard logistic regressions in the better-ear and left/right ear methods did not converge in more than 20% of simulation replicates; thus, the simulation results for these two methods are not available. The both-ear method estimator had less finite sample bias and less empirical standard deviation of the point estimates than the worse-ear method. When the sample size was increased to 5000 (Table 3) we observed similar patterns as those observed in the first simulation scenario. For the 50% event rate scenarios, the patterns of the performance of the four methods were similar to those observed in the first simulation scenario (Supplementary Tables 3 and 4).

CHEARS DATA ANALYSIS

To illustrate the implementation of our proposed methods, we provide an illustrative example based on data obtained in the CHEARS prospective study of dietary patterns and hearing threshold elevation (Curhan et al. 2020). Specifically, our example explores data obtained from an investigation of the dietary adherence scores for the DASH diet and risk of 3-year hearing threshold elevation.

CHEARS examines risk factors for hearing loss among participants in several large ongoing cohorts, including the NHS II, an ongoing cohort study of 116,430 female registered nurses in the United States, aged 25–42 years at enrollment in 1989. In a sub-cohort of these NHS II participants, the CHEARS Audiology Assessment Arm (AAA), longitudinal changes in pure-tone air and bone conduction audiometric hearing thresholds were assessed. The methods for the CHEARS AAA are described elsewhere (Curhan et al. 2020, 2021). Briefly, pure-tone air conduction hearing thresholds were assessed at 19 geographically diverse testing sites, conducted by licensed audiologists using standardized protocols and equipment calibrated data to meet American National Standards Institute standards. A priori, participants who reported excellent or good hearing and no history of otologic disease were invited to participate to examine early threshold changes. In total, 3,749 participants completed baseline testing and 3,135 participants (84%) completed 3-year follow-up testing.

The outcome was a binary variable with elevated hearing threshold, defined as at least 5 dB HL elevation in PTA hearing measurements from baseline to year 3. For each participant, the outcome variable had six dimensions, PTA at low-frequency (0.5,1,2 kHz), mid-frequency (3,4 kHz) and high-frequency (6,8 kHz) for both ears. The exposure of interest was the DASH score in quartiles. We adjusted for baseline PTA measurements at low-frequency (0.5,1,2 kHz), mid-frequency (3,4 kHz) and high-frequency (6,8 kHz) for both ears, and the following potential confounders: age (continuous), race (black/white/multi/other or unknown), body mass index (<25kg/m2/25–29/30–34/35–39/40+), smoking status (never/past/current), history of tinnitus (yes/no), cumulative average energy intake (continuous), noise exposure (very loud occupational or leisure-time noise exposure >=3 hours/week during any decade; yes/no). Exposure-frequency interactions were included in all the models. A GEE with the exchangeable covariance matrix was used for all the methods. In total, there were 6270 ears in the analysis from 3135 participants. Based on the worse-ear method, there were a total of 4,627 hearing threshold elevation events across the low-, mid- and high-frequency PTA categories. There were 1,858 events based on the better-ear methods, 3,228 events based on left-ear method, and 3,257 events based on right-ear method. Supplementary Table 4 presents the baseline characteristics of the participants.

Table 4 presents the estimated ORs and confidence intervals representing the DASH-hearing threshold elevation association which were obtained based on the regression coefficient estimates of DASH and interactions between DASH and frequency level. Notably, the confidence intervals of the estimates using both-ear methods were substantially narrower than those obtained using the other methods. For example, for the mid- and high-frequency PTAs, the widths of the confidence intervals of the estimated coefficients for quartile 4 DASH score from the worse-ear method were 21% and 27% wider, respectively, than those obtained with the both-ear method. For the better-ear method, the width of the confidence intervals obtained were 43% and 68% wider than those obtained with the both-ear method. The point estimates were not substantially different between the both-ear and worse/better-ear methods. The better-ear method typically led to estimates with the widest confidence intervals.

TABLE 4:

The Association between Dietary Approaches to Stop Hypertension (DASH) Score and 3-year Audiometric Hearing Threshold Elevation in the Conservation of Hearing Study (CHEARS) Audiology Assessment Arm, 2012–2018

Dash score (quartile) Both Ear Worse Ear Better Ear Left Ear Right Ear
ORa 95% CI ORa 95% CI ORa 95% CI ORa 95% CI ORa 95% CI
Low-frequency PTA
  Quartile 1 1.00 Reference 1.00 Reference 1.00 Reference 1.00 Reference 1.00 Reference
  Quartile 2 1.06 0.86,1.29 1.08 0.87,1.34 1.04 0.72,1.50 1.09 0.85,1.41 1.03 0.80,1.32
  Quartile 3 0.97 0.79,1.20 0.98 0.79,1.23 1.00 0.69,1.44 1.02 0.79,1.33 0.94 0.73,1.21
  Quartile 4 0.86 0.69,1.06 0.88 0.70,1.11 0.84 0.57,1.23 0.92 0.70,1.21 0.82 0.63,1.07
Mid-frequency PTA
  Quartile 1 1.00 Reference 1.00 Reference 1.00 Reference 1.00 Reference 1.00 Reference
  Quartile 2 1.02 0.86,1.21 1.00 0.82,1.22 1.05 0.83,1.33 1.13 0.92,1.38 0.91 0.74,1.12
  Quartile 3 0.91 0.76,1.08 0.87 0.71,1.07 0.95 0.75,1.22 0.95 0.77,1.17 0.87 0.71,1.08
  Quartile 4 0.78 0.65,0.93 0.79 0.64,0.98 0.71 0.55,0.92 0.82 0.66,1.01 0.75 0.60,0.93
High-frequency PTA
  Quartile 1 1.00 Reference 1.00 Reference 1.00 Reference 1.00 Reference 1.00 Reference
  Quartile 2 0.99 0.84,1.16 0.99 0.80,1.22 1.02 0.82,1.27 1.05 0.86,1.28 0.94 0.77,1.15
  Quartile 3 0.91 0.77,1.07 0.85 0.69,1.05 0.99 0.80,1.24 0.86 0.70,1.05 0.98 0.80,1.20
  Quartile 4 0.88 0.74,1.04 0.86 0.69,1.07 0.91 0.72,1.15 0.79 0.64,0.97 1.01 0.82,1.24
a

Odds ratio based on multivariate logistic regression models adjusted for age (continuous), race (black/white/multi/other or unknown), body mass index(<25kg/m2/25–29/30–34/35–39/40+), smoking status (never/past/current), history of tinnitus (yes/no), cumulative average energy intake (continuous), noise exposure (very loud occupational or leisure-time noise exposure >=3 hours/week during any decade; yes/no), and the ear and frequency-level baseline pure tone average (PTA) measurements at low-frequency PTA (0.5,1,2 kHz), mid-frequency PTA (3,4 kHz) and high-frequency PTA (6,8 kHz) for both ears. Exposure-frequency interactions were also included in all the models.

DISCUSSION

In this paper, we propose a both-ear method for analysis of correlated ear data to evaluate the associations of exposures with hearing outcomes. The traditional single-ear methods, such as the worse-ear and better-ear methods, may lead to biased and inefficient association estimates. The proposed both-ear method, which uses all of the information in the data from both ears, while taking into account within-participant correlation in the analysis, leads to asymptotically unbiased association estimates.

We evaluated the existing single-ear methodologic approaches for analyzing audiometric data, the worse-ear, better-ear and left/right ear methods, and compared these approaches with the proposed both-ear method. First, based on theoretic statistical inferences, we revealed that the bias in the OR estimates based on the worse-ear or better-ear method increased when the within-participant between-ear correlation coefficient for the hearing measurements is smaller and the true effect is stronger. The results from the both-ear method can be similar to the single ear methods when the within-participant between-ear correlation coefficient is large, and the true effect is small. Second, the results of our simulation study suggest that the both-ear method outperforms the three conventional single-ear analytic methods in finite sample bias and in efficiency. The left/right-ear method has better performance with respect to bias, but has lower efficiency than the both-ear method. In the analysis results of the CHEARS data, the robust standard errors using both-ear method are smaller than those using the other methods. This is consistent with the findings in the simulation study.

The GEE approach has been used to estimate population-average effects while taking into account the dependency due to multiple measures over time in longitudinal audiometric data (Rigters et al., 2018). The generalized linear mixed model has been used to estimate subject-specific effects in audiometric data when the hearing threshold is treated as a continuous outcome (Gebregziabher et al., 2018; Kiely et. al., 2012). In this paper we use the GEE method to take into account the within-participants between-ear and between-frequency correlations. The GEE method does not require a distributional assumption and the GEE-estimators are consistent, even if the covariance of response variables is mis-specified. It is noteworthy that even though this paper is focused on logistic regression, the strategy of using both-ear data, with the GEE for making inferences, would apply to other commonly used modeling approaches, including linear regression for continuous outcomes and polytomous logistic regression for more than two-level categorical outcomes.

In the situation where a threshold value is missing, this is because a threshold could not be confidently determined by the audiologist at that frequency. In the CHEARS data, this occurred in a very small number (0.5%) of individuals. Typically, this would be for a single threshold and in only one ear and missing completely at random is a reasonable assumption. Therefore, for those participants with incomplete data we suggest including their partial data that are available in the analysis. In the situations where the missingness is likely to be missing at random, the inverse probability weighting method could be used to handle missingness in the GEE method (Robins et al., 1995).

Although the single-ear methods can be useful in predicting the clinically meaningful outcomes such as communication difficulty, for research investigations that aim to evaluate factors that may influence changes in hearing thresholds, the both-ear methods may be preferable due to the gains in accuracy and efficiency afforded by using information from both ears. The both-ear method could thereby improve the ability to detect meaningful exposure-hearing threshold associations, identify potentially modifiable risk factors for hearing loss, and detect threshold changes at earlier stage when it may be more possible to prevent progression. For example, the both-ear method would be useful in studies for which there is no a priori hypothesis that the exposure(s) of interest differentially influence threshold change in one ear versus the other, such as systemic conditions, dietary intake, physical activity, obesity, or exposure to air pollution. Moreover, in the setting of an a priori hypothesis that the exposure of interest may differentially influence threshold change in one ear versus the other, an interaction between the exposure (e.g., how often a gun is discharged for recreational or occupational reasons) and ear side (e.g., the ear side of discharging a gun), as well as exposure-frequency-ear side interaction if applicable, may be included in the model in addition to the exposure and potential confounders. In this setting, the proposed both-ear analytic approach based on GEE would still be useful for obtaining reliable and precise estimates of the associations observed for each ear.

CONCLUSION

The worse-ear and better-ear methods may lead to biased estimates, and the left/right ear method typically results in less efficient estimates. In certain settings, the both-ear method using the GEE approach for analyses of audiometric data may be preferable to the single-ear methods. Particularly when the research studies are designed to evaluate exposure-hearing threshold relationships, the proposed both-ear analytic approach could be used as the standard method for obtaining reliable and precise estimates of exposure-hearing loss associations.

Supplementary Material

Supplemental Data File (.doc, .tif, pdf, etc.)_1
Supplemental Data File (.doc, .tif, pdf, etc.)_2

Supplementary Fig. 1. The relative difference (RD) comparing the odds ratio using the worse-ear method (upper panel) or the better-ear method (lower panel) with the both-ear method (Pe=0.1).

Supplemental Data File

Financial disclosures/conflicts of interest:

This work was partially supported by the National Institute Health grants R01 DC017717, U01 CA176726 (NHS II) and U01 HL145386 (NHS II).

Footnotes

SGC serves as a consultant to Decibel Therapeutics. GCC serves as a consultant to Decibel Therapeutics, AstraZeneca, Shire, and Allena Pharmaceuticals; receives royalties from UpToDate for being an author and Section Editor; and is an employee of OM1, Inc. The other authors have no disclosures.

REFERENCES

  1. Bainbridge KE, Hoffman HJ, & Cowie CC (2008). Diabetes and hearing impairment in the United States: Audiometric evidence from the national health and nutrition examination surveys, 1999–2004. Annals of Internal Medicine, 49, 1–10. 10.7326/0003-4819-149-1-200807010-00231 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Burton A, Altman DG, Royston P, & Holder RL (2006). The design of simulation studies in medical statistics. Statistics in Medicine, 25(24), 4279–4292. 10.1002/sim/2673 [DOI] [PubMed] [Google Scholar]
  3. Cruickshanks KJ, Tweed TS, Wiley TL, Klein BEK, Klein R, Chappell R, Nondahl DM, & Dalton DS (2003). The 5-year incidence and progression of hearing loss: The epidemiology of hearing loss study. Archives of Otolaryngology-Head & Neck Surgery, 129, 1041–1046. 10.1001/archotol.129.10.1041 [DOI] [PubMed] [Google Scholar]
  4. Curhan SG, Halpin C, Wang M, Eavey RD, & Curhan GC (2020). Prospective study of dietary patterns and hearing threshold elevation. American Journal of Epidemiology, 189, 204–214. 10.1093/aje/kwz223 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Curhan SG, Halpin C, Wang M, Eavey RD, & Curhan GC (2021). Tinnitus and 3-year change in audiometric hearing thresholds. Ear and Hearing. 10.1097/AUD.0000000000000990 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Fitzmaurice G, Laird N, & Ware J (2011). Applied Longitudinal Analysis (2nd ed.). John Wiley & Sons. [Google Scholar]
  7. Gebregziabher M, Eckert M, Matthews L, Teklehaimanot AA, Dubno JR (2018). Join modeling of multivariate hearing thresholds measured longitudinally at multiple frequencies. Commun Stat Theory Methods, 47(22), 5418–5434. 10.1080/03610926.2017.1395045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Kiely KM, Gopinath B, Mitchell P, Luszcs M, Anstey KJ (2012). Cognitive, health, and sociodemographic predictors of longitudinal decline in hearing acuity among older adults. J Gerontol A Biol Sci Med Sci, 67(9), 997–1003. 10.1093/gerona/gls066 [DOI] [PubMed] [Google Scholar]
  9. Lee SA, Kang HT, Lee YJ, & Kim BG (2020). National representative analysis of unilateral hearing loss and hearing aid usage in South Korea. PLOS 10.1371/journal.pone.0232112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Liang KY, & Zeger SL (1986). Longitudinal data analysis using generalized linear models. Biometrika, 73, 13–22. 10.1093/biomet/73.1.13 [DOI] [Google Scholar]
  11. Looi L, Granten D, McGrath PF, Gross M, & Griffin GE (2015). Hearing loss: A global health issue. The Lancet, 385(9972), 943–944. 10.1016/S0140-6736(15)60208-2 [DOI] [PubMed] [Google Scholar]
  12. Niskar AS, Kieszak SM, Holmes A, Esteban E, Rubin C, & Brody DJ (1998). Prevalence of hearing loss among children 6 to 19 years of age: The third national health and nutrition examination survey. The Journal of the American Medical Association, 279, 1071–1075. 10.1001/jama.279.14.1071 [DOI] [PubMed] [Google Scholar]
  13. Robins JM, Rotnitzky A, & Zhao LP (1995). Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Journal of the American Statistical Association, 90, 106–121. 10.1080/01621459.1995.10476494 [DOI] [Google Scholar]
  14. Rigters SC, Schroeff MP, Papageorgiou G, Baatenburg de Jong RJ, & Goedegebure A (2018). Progression of hearing loss in the aging population: repeated auditory measurements in the Rotterdam study. Audiol Neurootol, 23(5), 290–297. 10.1159/000492203 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Touloumis A (2016). Simulating correlated binary and multinomial responses under marginal model specification: The SimCorMultRes package. The R Journal, 8, 79–91. [Google Scholar]
  16. World Health Organization. (2018). Addressing the rising prevalence of hearing loss. https://apps.who.int/iris/handle/10665/260336
  17. Zhan W, Cruickshanks KJ, Klein BE, Klein R, Huang G, Pankow JS, Gangon RE, & Tweed TS (2009). Generational differences in the prevalence of hearing impairment in older adults. American Journal of Epidemiology, 171, 260–266. 10.1093/aje/kwp370 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Data File (.doc, .tif, pdf, etc.)_1
Supplemental Data File (.doc, .tif, pdf, etc.)_2

Supplementary Fig. 1. The relative difference (RD) comparing the odds ratio using the worse-ear method (upper panel) or the better-ear method (lower panel) with the both-ear method (Pe=0.1).

Supplemental Data File

RESOURCES