Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Nov 20.
Published in final edited form as: Stat Med. 2022 Sep 20;41(26):5335–5348. doi: 10.1002/sim.9572

Analytical methods for correlated data arising from multicenter hearing studies

Yanghui Sheng 1, Ce Yang 2, Sharon Curhan 3,4, Gary Curhan 2,3,4,5, Molin Wang 1,2,3,4
PMCID: PMC9588694  NIHMSID: NIHMS1834186  PMID: 36125070

Abstract

In epidemiological hearing studies, estimating the association between exposures and hearing loss using audiometrically-assessed hearing measurements is challenging due to the complex correlation structure in the clustered data, with clusters formed by the two ears of the same individual and the testing site and audiologist. We propose a linear mixed-effects model to take into account the multilevel correlation structures of the data. Both theoretically and in simulation studies, we compare single-ear linear regression models commonly used in published hearing loss studies with the proposed both-ears linear mixed models properly accounting for the multi-level correlations. Our findings include (1) when there are only participant-level covariates, the worse-ear linear regression models produce unbiased but typically less efficient estimators than the both-ear and average-ear approaches; (2) when there are ear-level confounders, the worse-ear method may lead to biased estimators and the average-ear method produces unbiased but typically less efficient estimators than the both-ear method; (3) the both-ear method may gain efficiency when additionally adjusting for testing sites and audiologists. As an illustrative example, we applied the single-ear and both-ear methods to assess aspirin-hearing association in the Nurses’ Health Study II.

Keywords: audiometry, both ears, clustered data, hearing data analysis, mixed-effects model, multi-level cluster, single-ear method

1 |. INTRODUCTION

Hearing loss is a common and disabling global public health issue associated with social isolation, cognitive decline, depressive symptoms, and lower quality of life.15 To understand the etiology of hearing loss, several epidemiological studies are providing much-needed population data. For example, the Conservation of Hearing Study (CHEARS) is a multicenter prospective study seeking to identify modifiable risk factors for acquired hearing loss. In our illustrative example, the CHEARS study population derives from participants in the ongoing longitudinal cohort Nurses Health Study (NHS) II. The Audiology Assessment Arm (AAA) within the NHS II CHEARS includes 3136 participants who underwent baseline and 3-year follow-up audiometric assessments performed by one of 71 certified audiologists at one of 32 test sites.

Correctly estimating the association between exposures and hearing loss from pure-tone audiometry data, such as the CHEARS-AAA data mentioned above, can be challenging due to the complex correlation structure. For instance, in the longitudinal setting, repeated measurements from multiple time points are correlated. In multicenter studies where subjects were tested at different sites by different audiologists, there are additional layers of intra-site and intra-audiologist correlations. Moreover, measurements from the same participant in the left ear and the right ear are paired. However, current analysis methods not only ignore the potential intra-site and intra-audiologist correlations in the pure-tone audiometry data, they often use the threshold measurements from a single ear such as the worse ear or better ear as the outcome.69 In certain settings, this single-ear approach may result in biased association estimates, potentially resulting in misleading scientific conclusions. Moreover, while most published studies have tested participants’ hearing thresholds at multiple frequencies ranging from 0.25 to 8 kHz, the results are often summarized into a pure-tone average (PTA), which is calculated by averaging hearing threshold levels at a set of specified testing frequencies.1013 This common practice may also result in loss of information available in the individual frequency data.14 Overall, there is a need for reliable and efficient statistical methods that can address the complex correlation structure in hearing data and also fully utilize the measurement for each individual frequency.

There is a large literature on methods for longitudinal and correlated data.1517 The linear mixed effects models18 have become a popular tool for analyzing correlated data because of the flexibility in modeling the within-subject correlation and the availability of reliable and efficient software.19,20 See Pinheiro21 for a review on linear mixed effects models for longitudinal data. Audiologists notice that hearing loss patterns vary across individuals, partially due to different patterns in aging among different organ systems.22 Therefore, the linear mixed effects models may be used to analyze hearing data, with individual variability characterized by subject-specific random effects.22,23 Recently, concerning the commonly-made normality assumptions, Schielzeth et al24 investigated the robustness issues regarding misspecification of the underlying distributions of the residual terms in the mixed effects models and found that model estimates are usually robust to violation of the distributional assumptions. Alternative analysis methods for correlated hearing data include the generalized estimating equations (GEEs)25 that adjust for clusters as fixed effects and uncover the effect of interest from a population-average perspective.13,26 While distributional assumptions are not required, the GEEs may not work well with large number of clusters but small cluster sizes.

In this paper, we present analytical methods that make maximum use of the data for estimating the associations between exposures and the continuous pure-tone audiometric hearing threshold measurements. Following a theoretical bias and efficiency evaluation of the commonly used single-ear methods, we emphasize the importance of the generalization from linear models to linear mixed-effects models in hearing studies and explicitly quantify the resulting bias and efficiency loss when failing to do so. We present a simulation study that compares results from analyses using linear mixed-effects models that account for multi-level correlations, models that do not account for multi-level correlations, and from linear regression methods based on single-ear data and assuming independence between participants. These methods are then illustrated in an analysis of the association between aspirin use and risk of hearing threshold elevation based on the CHEARS-AAA dataset.

2 |. MIXED-EFFECTS MODELS TAKING INTO ACCOUNT MULTI-LEVEL CLUSTERING

For the convenience of presentation, we use the context of hearing threshold data in AAA of CHEARS, where hearing thresholds were measured at two time points, the baseline and three-year follow up, to present our models. We first introduce notations that will be used throughout the paper. Suppose there are N study participants, Na audiologists and Ns test sites in total. For each study participant, there are two audiometric assessments, at the baseline and the end of follow-up (3 years after the baseline assessment). The test sties at which the audiometry assessments were conducted and the audiologist(s) who conducted the assessments for participant i at baseline and follow-up are denoted as {si0, ai0} and {si1, ai1}, respectively. There are 4 levels of clusters: the ears (j{1,2}) are level 1 units, participants (i{1,2,,N}) are level 2 units, audiologists are level 3 units (ai0,ai1{1,,Na}), and test sites are level 4 units (si0,si1{1,2,,Ns}). The ears are nested within subjects. In CHEARS-AAA, some audiologists work in multiple test sites, so audiologists are not nested within test sites. In a study where different sites do not share audiologists, audiologists would be nested within test sites at each time point.

First, assume that the hearing threshold measurements are available under only one frequency. The outcome of interest Yij is the change from baseline to the end of follow-up in the hearing threshold for the ith participant in the jth ear. Let X denote the participant-level exposure of interest (for example, aspirin use), W a participant-level confounder (for example, age at baseline), and Z an ear-level confounder (for example, hearing threshold measured at baseline). Each of X, W, Z can be either a scalar or a column vector. A linear mixed-effects model for Yij, with random effects for multilevel clusters, can be written as

Base model:Yij=β0+β1Xi+β2Wi+β3Zij+bsi0(site0)+bsi1(site1)+bai0(aud0)+bai1(aud1)+bi(ind)+ϵij, (1)

for i=1,,N and j=1,2, where β1, β2, and β3 are row vectors of regression coefficients. In addition, {bsi0(site0),bsi1(site1)} are random site effects for the site where the ith participant was tested at baseline and the end of follow-up, respectively, {bai0(aud0),bai1(aud1)} are the random audiologist effects for the audiologist testing the ith participant at baseline and the end of follow-up, respectively, and bi(ind) is a random participant effect. These random effects and the residual term ϵij are mutually independent and follow zero-mean normal distributions; that is, bsi0(site0)N(0,σ(site0)2), bsi1(site1)N(0,σ(site1)2), bai0(aud0)N(0,σ(aud0)2), bai1(aud1)N(0,σ(aud1)2), bi(ind)N(0,σ(ind)2) and ϵijqN(0,σϵ2). Schielzeth et al24 investigated the robustness issues of the linear mixed-effects models and found that they are relatively insensitive to violations of the distributional assumptions. We refer to this model as the base model.

In the base model above, the outcomes of different participants tested by different audiologists at different sites are independent. We have var(YijXi,Wi,Zij)=σ(aud0)2+σ(aud1)2+σ(site0)2+σ(site1)2+σ(ind)2+σe2, denoted as V. Meanwhile, the random effect terms allow for dependence between measurements from the same participant, audiologist, or site. Takethe random effect for audiologist at baseline, bai0(aud0), as an example: it allows for dependence between measurements tested by the same audiologist at baseline. Thus, the adjusted intra-cluster correlation coefficient (ICC) between the outcomes of two participants tested by the same audiologist at only the baseline is σ(aud0)2/V. It increases to (σ(aud0)2+σ(aud1)2)/V if assuming they share the same audiologists both at baseline and at Year 3. Furthermore, if the testing sites are also the same between these two participants, at Year 3, but not at baseline, then the ICC between the outcomes of these participants is (σ(aud0)2+σ(aud1)2+σ(site1)2)/V. The within-participant between-ear ICC is (σ(aud0)2+σ(aud1)2+σ(site0)2+σ(site1)2+σ(ind)2)/V.

The exposure effects are the same across all the participants in the base model. This assumption can be relaxed by adding a possibly vector-valued random slope to the effects of exposures, allowing the exposure effects to be participant-specific. This leads to

Randomslopemodel:Yij=β0+(β1+b2i(ind))Xi+β2Wi+β3Zij+bsi0(site0)+bsi1(site1)+bai0(aud0)+bai1(aud1)+b1i(ind)+ϵij, (2)

where b2i(ind)N(0,Σslope) is the random slope for exposure at the participant level. The average effect of exposure, when averaged over the population of participants, is β1.

The hearing threshold measurements in CHEARS-AAA are for seven frequencies, 0.5, 1, 2, 3, 4, 6, and 8 kHz. To include data from all the frequencies in the same model, the outcome of interest Yijq is now the change from baseline to the end of follow-up in the hearing threshold for the ith participant in the jth ear at the qth frequency. A flexible linear mixed-effects model for Yijq, with participant-specific covariates and random effects for multilevel clusters that allow frequency-specific fixed and random effects, can be written as

Fullfrequencymodel:Yijq=β0q+β1qXi+β2qWi+β3qZij+bsi0(site0)+bsi1(site1)+bai0(aud0)+bai1(aud1)+bi(ind)+bqsi0(site0.f)+bqsi1(site1.f)+bqai0(aud0.f)+bqai1(aud1.f)+biq(ind.f)+ϵijq,

for i=1,,N, j=1,2, q=1,,Q, where {bqsi0(site0.f),bqsi1(site1.f)} are the random effects for frequency q for the sites at which the ith participant is tested at baseline and the end of follow-up, respectively, {bqai0(aud0.f),bqai1(aud1.f)} are the similarly defined random audiologist effects, and biq(ind.f) is a random participant effect for frequency q. In addition to the zero-mean normality assumptions for the random effect and residual terms in the base model, we have bqsi0(site0.f)N(0,σ(site0.f)2), bqsi1(site1.f)N(0,σ(site1.f)2), bqai0(aud0.f)N(0,σ(aud0.f)2), bqai1(aud1.f)N(0,σ(aud1.f)2), and biq(ind.f)N(0,σ(ind.f)2). In the mixed model, we can allow within-person covariance between bqsim(sitem.f) and bsim(sitem), between bqaim(audm.f) and baim(audm), for m=0,1, and between biq(ind.f) and bi(ind), denoting as cov(sitem), cov(audm) and cov(ind), respectively.

In the full frequency model above, for example, including the random effect for audiologist-frequency interaction, bqai0(aud0.f), allows for the correlations between measurements for the same frequency tested by the same audiologist at baseline to differ from the correlations between measurements for different frequencies tested by the same audiologist at baseline. Denote the adjusted total variance, var(YijqXi,Wi,Zij)=σ(aud0)2+σ(aud1)2+σ(site0)2+σ(stte1))2+σ(ind)2+σ(aud0.f)2+σ(aud1.f)2+σ(site0.f)2+σ(site1.f)2+σ(ind.f)2+cov(site0)+cov(site1)+cov(aud0)+cov(aud1)+cov(ind)+σϵ2, as Vf. The adjusted ICC between the outcomes of two participants tested by the same audiologist at only the baseline is σ(aud0)2/Vf if the outcomes are for different frequencies and (σ(aud0)2+σ(aud0.f)2)/Vf if they are for the same frequency. The within-participant between-ear ICC is (σ(aud0)2+σ(aud1)2+σ(site0)2+σ(site1)2+σ(ind)2)/Vf for two outcomes at different frequencies and (σ(aud0)2+σ(aud1)2+σ(site0)2+σ(site1)2+σ(ind)2+σ(aud1.f)2+σ(site0.f)2+σ(sitel.f)2+σ(ind.f)2+cov(site0)+cov(site1)+cov(aud0)+cov(aud1)+cov(ind))/Vf for outcomes at the same frequency.

After allowing fixed effects to vary over different frequencies, it is often reasonable to assume that the random effects for participants, sites and audiologists do not change across frequencies; in this case we can exclude the random effects for participant/site/audiologist-frequency interactions from the full frequency model, leading to Yijq=β0q+β1qXi+β2qWi+β3qZij+bsi0(site0)+bsi1(site1)+bai0(aud0)+bai1(aud1)+b1i(ind)+ϵijq. We can often further assume that the coefficients of some elements of W and Z do not change over frequency. Suppose that the data consist of results obtained at two testing frequencies, 1 and 6 kHz. Let I(q=2)=1 if test frequency = 6 kHz and I(q=2)=0 otherwise. The full frequency model may be simplified as

Simplefrequencymodel:Yijq=β0+β1Xi+β2Wi+β3I(q=2)+β4I(q=2)×Xi+β5Zij+bsi0(site0)+bsi1(site1)+bai0(aud0)+bai1(aud1)+bi(ind)+ϵij,

where exposure impacts hearing at 1 and 6 kHz differently, which is captured by the interaction term I(q=2)×X. The effect of exposure on hearing threshold is represented by β1 at 1 kHz and (β1 + β4) at 6 kHz.

Sample SAS code for each of the models can be found in Section 1 of the Supplementary Material.

3 |. THEORETICAL EVALUATION OF POTENTIAL BIAS AND EFFICIENCY LOSS IN SINGLE-EAR METHODS

If a participant-level exposure (eg, aspirin intake) influences hearing, the hearing measurements in both ears contain information about the exposure effects. The models described above allow us to make full use of the information from both ears of each participant, and the regression coefficients of the exposures in these both-ears models represent the exposure-outcome relationship. On the other hand, the commonly-used single-ear methods include the worse-ear and better-ear methods. The former uses data from the ear showing the larger change in hearing threshold, that is, Yiw=max(Yi1,Yi2), and the latter uses data from the ear showing the smaller change in hearing threshold, that is, Yib=min(Yi1,Yi2). Another single-ear method is the average-ear method, which used averaged data from both ears, that is, Yia=12(Yi1+Yi2). In this section, we theoretically evaluate the potential bias and efficiency loss of the estimators from the single-ear methods by comparing them to the true parameters defined in the both-ears models.

3.1 |. Models with only participant-level covariates

In the analysis of the single-ear methods, if there are no ear-level covariates, the base model becomes

Single-ear mixed-effects model:Yi=β˜0+β˜1Xi+β˜2Wi+b˜si0(site0)+b˜si1(site1)+b˜ai0(aud0)+b˜ai1(aud1)+ϵ˜i, (3)

where Yi is one of Yiw, Yib, and Yia, depending on which single-ear method we use, and we put a tilde on top of each parameter to distinguish the parameters and random effects from those for the both-ears method in Section 2.

The single-ear methods used in many published hearing loss studies4,10,12,13,27,28 treat data from individuals as independent and do not take into account site- and audiologist-level clusters. Without adjusting for the audiologist and site effects, we have the following standard linear regression model:

Single-ear linear regression model:Yi=β˜0+β˜1Xi+β˜2Wi+ϵ˜i. (4)

We denote the vectors of the estimated intercept and regression coefficients obtained from this single-ear linear regression model analysis as β^w, β^b, and β^a, corresponding to the three ways of constructing the outcome variables described earlier. Assuming normality for the outcome Y, it follows that

E(β^w)=(β0+σ˜ϵ/π,β1,β2),
E(β^b)=(β0σ˜ϵ/π,β1,β2),
E(β^a)=(β0,β1,β2)=β,

where ϵ˜iN(0,σ˜ϵ2),ϕ() is the probability density function for standard normal distribution, and β0, β1 and β2 are the regression parameters in the base model in (1) and the random slope model in (2) when β3 = 0 in both models. Note that, when β3 = 0, the base model and random slope model have the same mean of E(YijXi,Wi) and thus, they have the same fixed-effect parameters. See Section 2.1 of the Supplementary Material for details. Therefore, β^a is an unbiased estimator of β, whereas β^w and β^b are biased in the intercept term but unbiased in the coefficients of exposure X and confounder W. This result also holds for the estimators from the single-ear mixed-effects model in (3), which has the same expectation E(YiXi,Wi) as the single-ear linear regression model in (4).

We next compare the efficiency between the methods when the both-ears model is

Yij=β0+β1X¯i+bi(ind)+ϵij,

where bi(ind)N(0,σb2), ϵijN(0,σϵ2), and the single-ear model is the same except that it does not include the random effects. Here, X¯i represents the set of participant-level covariates, which is possibly a vector; for example, X¯i=(Xi,Wi). It follows that the maximum likelihood estimator (MLE) of β^1 has the following variance,

var(β^1)=(cσe2+σb2)(Ni=1NX¯ii=1NX¯ii=1NX¯iX¯i)1,

where c = 1 for the worse-ear and better-ear methods and c = 0.5 for the both-ears and average-ear methods. See Section 3.1 of the Supplementary Material for details. Therefore, the better-ear and worse-ear methods lead to estimators with the same variance. The average-ear method, on the other hand, results in estimators that are more efficient than the better-ear and worse-ear methods. Furthermore, the average-ear method is equally efficient as the both-ears method.

3.2 |. Models with ear-level confounders

When the both-ears model also includes ear-level confounders (ie, when β30 in the base model and the random slope model), the outcomes for the two ears of the same person can have different means; that is, when Zi1Zi2,

E(Yi1Xi,Wi,Zi1)=β0+β1Xi+β2Wi+β3Zi1,
E(Yi2Xi,Wi,Zi2)=β0+β1Xi+β2Wi+β3Zi2.

For presentational convenience, assume that the ear-level confounder is the baseline threshold. For the worse-earmethod, the baseline threshold of the worse ear, Ziw, is defined as Ziw=Zik, where k=argmaxj(Yij), and the single-ear linear regression model becomes

Worse-ear linear regression model with ear-level covariates:Yiw=β˜0+β˜1Xi+β˜2Wi+β˜3Ziw+ϵ˜i. (5)

We denote the vectors of the estimated fixed-effect intercept and regression coefficients obtained from this single-ear regression analysis as β^w. Adjusting for site and audiologist effects on top of that yields the following mixed-effects model:

Worse-ear mixed-effects model with ear-level covariates:Yiw=β˜0+β˜1Xi+β˜2Wi+β˜3Ziw+b˜si0(site0)+b˜si1(site1)+b˜ai0(aud0)+b˜ai1(aud1)+ϵ˜i. (6)

Similarly, the baseline threshold of the better ear Zib is defined as Zib=Zik, where k=argminj(Yij). The average baseline threshold of the two ears is Zia=12(Zi1+Zi2). The single-ear linear regression and mixed-effects models can then be defined for the better-ear and average-ear methods as in (5–6).

We denote the vectors of estimated regression parameters from the better-ear and average-ear linear regression models as β^b and β^a, respectively. As shown in Section 2.2 of the Supplementary Material, β^W and β^b do not have closed forms for their expectations and typically do not converge to β=(β0,β1,β2,β3). As for the average-ear approach, β^a is an unbiased estimator of β; that is, E(β^a)=β. This is true for both the single-ear mixed-effects model (eg, model 6) and the single-ear linear regression model (eg, model 5).

Despite this, we compare the both-ears method under the model

Yij=β0+β1X¯i+β2Zij+bi(ind)+ϵij,

where X¯i can be a vector as mentioned above, with the average-ear method under the standard linear regression model. It can be shown that the ordinary least square estimator from the average-ear method is less efficient than the MLE from the both-ears method when there is only participant random effect included in the mixed-effects model. See Section 3.2 of the Supplementary Material for details.

Note that unlike our full frequency model and simple frequency model in Section 2, the current single-ear methods typically do not include data from multiple frequencies as a clustered outcome. Instead, the single-ear methods are implemented with respect to the ΔPTA values calculated by averaging the threshold changes among multiple frequencies separately for two ears; that is, ΔPTAij=1Qq=1QYijq, denoted as Yij¯. The outcomes in the worse-ear, better-ear and average-ear methods are based on the data from the ear with the larger ΔPTA, the smaller ΔPTA, or the average of both ears, respectively. In other words, Yiw=max(Yi1¯,Yi2¯), Yib=min(Yi1¯,Yi2¯), and Yia=12(Yi1¯+Yi2¯). In the simulation studies in Section 4, we show that the conventional single-ear methods based on ΔPTA would not be able to estimate exposure effects correctly if exposure-frequency interaction exists.

4 |. SIMULATION STUDIES

We conducted simulation studies to compare among the following methods: (1) the worse-ear method, (2) the average-ear method, (3) the both-ears method including the random participant effect, and (4) the both-ears method including random site, audiologist, and participant effects. The former two methods entail the use of linear regression models and treat data from each individual as independent, thus do not consider any clusters, whereas the last two methods use mixed-effect models that take these correlations into account.

4.1 |. Simulation set-up

We considered a sample size of N = 1000 and N = 400 with 1000 simulation replications. To investigate the flexibility of our proposed models on clusters, we set Na = 5, 40, 100 with Ns = 5, 10, 20, respectively, so that there are one, four, and five audiologists at each test site, respectively. We assumed that participants were randomly assigned to their test sites at baseline and returned to the same test site for their follow-up assessments. For both the baseline and follow-up assessments, participants were randomly assigned to one of the audiologists at their test sites. For i=1,,N, we considered the following covariates

  • A binary exposure Xi; for example, aspirin intake defined as 1 if Ai > cA and 0 otherwise, where Ai is a continuous variable representing the dose of aspirin-intake for participant i and cA is a presepcified cutoff point;

  • A continuous participant-level confounder Wi; for example, age;

  • A continuous ear-level confounder Zi; for example, hearing threshold at baseline Zi=(Zi1,Zi2).

The covariates are generated from the following multivariate normal distribution

(Ai,Wi,Zi1,Zi2)N(μ,V12RV12),

where

μ=(0621010),V=(1000025000064000064),R=(10.20.060.060.210.150.150.060.1510.70.060.150.71).

The mean, variance, and correlation coefficients were specified to mimic the CHEARS-AAA dataset. We set cA = 0.385 so that about 35% of the participants are aspirin-users, consistent with the CHEARS population. We generated outcome data according to Models 1–4 defined below.

Model 1–base model without ear-level covariates::Yij=β0+β1Xi+β2Wi+bsi(site)+bai0(aud0)+bai1(aud1)+bi(ind)+ϵij;
Model 2–random slope model without ear-level covariates::Yij=β0+(β1+b2i(ind))Xi+β2Wi+bsi(site)+bai0(aud0)+bai1(aud1)+b1i(ind)+ϵij;
Model 3–random slope model with ear-level covariates::Yij=β0+(β1+b2i(ind))Xi+β2Wi+β3Zij+bsi(site)+bai0(aud0)+bai1(aud1)+b1i(ind)+ϵij;
Model 4–simple frequency model without ear-level covariates:Yijq=β0+β1Xi+β2Wi+β3I(q=2)+β4I(q=2)Xi+bsi(site)+bai0(aud0)+bai1(aud1)+b1i(ind)+ϵij,

where i=1,,N, j=1,2, and q=1,2. The regression parameters were based on a preliminary analysis of the CHEARS-AAA data. In Models 1–3, the parameters were set to (β0,β1,β2,β3)=(3,0.2,0.1,0.15). In Mode l4, the parameters were set to (β0,β1,β2,β3,β4)=(3,0.2,0.1,2,0.4). The random effects were sampled from independent standard normal distributions and thus, were not related to exposures. The adjusted within-site ICC and within-audiologist ICC were about 0.2 and 0.4, respectively. Finally, concerning the robustness of the mixed-effects models to misspecifications of underlying data distributions, we also considered scenarios where the random effects and residuals were assumed to be normal in analyses, whereas in fact, they were generated from asymmetric standard Gumbel distributions with cumulative distribution function (c.d.f.) F(x)=eex for x(,). See Figure 1 in the Supplementary Material for the plots of the c.d.f. and probability density function (p.d.f.) of this distribution.

4.2 |. Discreteness of the hearing threshold data

In hearing tests, to determine the lowest levels at which sounds are reliably heard by each ear, pure tones were played to participants with 5dB increments. Hence, the hearing thresholds recorded were discrete and multiples of five. Generating the outcomes Yij and baseline threshold Zij for each participant according to Models 1–4, we created the ‘recorded’ hearing threshold of subject i’s jth ear at baseline as Z˜ij=5Zij/5, where x denotes the ceiling function taking as input a real number x and mapping it to the least integer greater than or equal to x. The ‘recorded’ outcome, change in hearing threshold, was then Y˜ij=5(Zij+Yij)/5Z˜ij.

4.3 |. Operating characteristics

We analysed the simulated hearing datasets using SAS PROC MIXED for the mixed-effects models and SAS PROC REG for regular linear regression models. See Section 1 of the Supplementary Material for sample codes. The following operating characteristics (OC) were calculated to compare the finite sample bias and efficiency of the resulting estimators: (1) Relative bias, calculated as (11000r=11000β^(r)β)/β, where β^(r) denotes the estimate for the true parameter β in the rth simulation replicate; (2) empirical standard error (SE), calculated as the sample standard deviation of {β^(1),β^(2),,β^(1000)}; (3) estimated SE, calculated by averaging the model-based SE estimates in the SAS output over 1000 replicates; (4) coverage rate (CR) of 95% confidence interval (CI), calculated as 11000r=11000I(LB(r)βUB(r)), where LB(r) and UB(r) denote the estimated upper bound and lower bound of the 95% CI for β in the rth simulation replicate, respectively.

4.4 |. Results of the simulation studies

When the data-generating mechanism only contains participant-level covariates (Models 1 and 2), all the models including the single-ear methods produced satisfactory point estimates of β1 and β2 in terms of bias and CR of the 95% CI. The model-based SE estimates of the linear models were close to the corresponding sandwich estimators. See Table 1 for the scenario with N = 1000, Na = 40, Ns = 10 and Tables S1-S5 in the Supplementary Material for the other combinations of N, Ns, and Na. However, as showed by the empirical SEs, the worse-ear method was less efficient compared to the both-ears methods and the average-ear method had comparable efficiency to the both-ears method that does not include the random site and audiologist effects. Further including the random site and audiologist, the both-ears methods were about 40% more efficient than the both-ears methods without the random site and audiologist effects. This is not surprising as sites and audiologists were not related to the exposure or age in this simulation; ignoring them does not cause bias in the point estimates but including them leads to smaller residual variance and thus improved efficiency of the regression coefficient estimates. When the residual terms and random effects were misspecified, the relative biases of the estimates from the single-ear methods and the both-ear methods not adjusting for sites and audiologists were lessthan 6% when N = 1000; however, they increased to 10%−13% when N = 400. The both-ears method including random sites and audiologists effects, on the other hand, still maintained satisfactory performance. See Tables S6 and S7 in the Supplementary Material for details. Rounding the hearing outcomes to multiples of 5 gave similar results. See Table S22 in the Supplementary Material for details.

TABLE 1.

Performances of different methods when data is simulated based on Models 1 and 2 (without ear-level covariates)

Parameter Both-ears Aa Both-ears Bb Worse-earc Average-eard
β1 (truth = 0.2) (Model 1) Relative bias 0.87% 0.25% −0.052% 0.25%
Empirical SE 0.0841 0.139 0.143 0.139
Estimated SE 0.0851 0.140 0.143 0.140
CR of 95% CI 95.3% 94.5% 94.9% 94.6%
β2 (truth = 0.1) (Model 1) Relative bias −0.27% −0.14% −0.11% −0.14%
Empirical SE 0.00799 0.0134 0.0138 0.0134
Estimated SE 0.00813 0.0134 0.0136 0.0134
CR of 95% CI 95.0% 96.0% 95.6% 96.0%
β1 (truth = 0.2) (Model 2) Relative bias 1.19% 0.55% 0.23% 0.53%
Empirical SE 0.0985 0.148 0.152 0.149
Estimated SE 0.101 0.150 0.148 0.145
CR of 95% CI 95.6% 94.8% 94.4% 93.9%
β2 (truth = 0.1) (Model 2) Relative bias −0.31% −0.19% −0.13% −0.16%
Empirical SE 0.00872 0.0140 0.0144 0.0141
Estimated SE 0.00876 0.0138 0.0142 0.0139
CR of 95% CI 94.9% 95.6% 94.6% 95.6%

Note: The upper half displays the results when data is generated from Model 1 (the base model) and the lower half displays those when data is generated from Model 2 (the random slope model). Sample size N = 1000, with four audiologists at each of the ten sites.

a

Both-ears method taking into account participants, sites, and audiologists-formed clusters, based on mixed-effects model E(YijXi,Wi,bsi(site),bai0(aud0),bai1(aud1),bi(ind))=β0+β1Xi+β2Wi+bsi(site)+bai0(aud0)+bai1(aud1)+bi(ind) or E(YijXi,Wi,bsi(site),bai0(aud0),bai1(aud1),b1i(ind),b2i(ind))=β0+(β1+b2i(ind))Xi+β2Wi+bsi(site)+bai0(aud0)+bai1(aud1)+b1i(ind).

b

Both-ears method ignoring sites and audiologists, based on mixed-effects model E(YijXi,Wi,bi(ind))=β0+β1Xi+β2Wi+bi(ind) or E(YijXi,Wi,b1i(ind),b2i(ind))=β0+(β1+b2i(ind))Xi+β2Wi+b1i(ind).

c

Worse-ear method ignoring sites and audiologists, based on linear regression model E(YiwXi,Wi)=β0+β1Xi+β2Wi,Yiw=max(Yi1,Yi2).

d

Average-ear method ignoring sites and audiologists, based on linear regression model E(YiaXi,Wi)=β0+β1Xi+β2Wi,Yia=(Yi1+Yi2)/2.

When the data-generating mechanism contains an extra ear-specific confounder (Model 3), omitting the ear-specific covariate (in our case, the baseline threshold) in the worse-ear and average-ear linear regression resulted in parameter estimates with relative biases between −27% and −35%; see Table 2 for the scenario with N = 1000, Na = 40, Ns = 10 and Tables S8-S12 in the Supplementary Material for the other combinations of N, Ns, and Na. We also examined the performances of the single-ear and both-ears methods when the ear-specific covariate was included in the model. The outcome and baseline hearing threshold measurements in the worse-ear approach are those from the worse ear, that is,

E(YiwXi,Wi,Baseline Thresholdiw)=β0+β1Xi+β2Wi+β3Baseline Thresholdiw.

Our theoretical evaluation in Section 3 showed that this worse-ear approach can lead to biased estimates of regression coefficients. Both our theoretical evaluation (Section 2.2 of the Supplementary Material) and simulation studies suggest that the bias could be reduced by using the average baseline threshold as in the following model,

E(YiwXi,Wi,Baseline Thresholdia)=β0+β1Xi+β2Wi+β3Baseline Thresholdia.

See the comparison between the columns Worse-ear A and Worse-ear B in Table 2 and Tables S8-S12 in the Supplementary Material for details. Both the average-ear approach and the both-ears approach yielded satisfactory parameter estimates; their SE estimates were also comparable, and accounting for the random site and audiologist effects in the both-ears method further led to about 35% efficiency gains (first two columns in Table 2 and Tables S8-S12 in the Supplementary Material). Finally, relative biases increased under misspecification of residuals terms and random effects for all methods except the both-ears method including random sites and audiologists effects. See Tables S13 and S14 in the Supplementary Material for details. Similar results were obtained when using the discretized hearing data. See Table S23 in the Supplementary Material for details.

TABLE 2.

Performances of different methods when data is simulated based on Model 3 (random slope model with ear-level covariates)

Parameter Bothears Aa Bothears Bb Worse-ear, misspecifiedc Average-ear, misspecifiedd Worseear Ae Worseear Bf Averageearg
β1 (truth = 0.2)
(Model 3)
Relative bias 1.10% 0.52% −28.00% −28.70% −1.89 % 1.01% 0.37%
Empirical SE 0.0985 0.148 0.173 0.168 0.153 0.155 0.149
Estimated SE 0.101 0.150 0.166 0.163 0.150 0.150 0.146
CR of 95% CI 95.6% 94.6% 92.7% 92.5% 94.8% 94.7% 93.8%
β2 (truth = 0.1) (Model 3) Relative bias −0.33% −0.19% −34.90% −35.00% −2.66% −0.18% −0.19%
Empirical SE 0.00880 0.0141 0.0162 0.0158 0.0147 0.0147 0.0142
Estimated SE 0.00883 0.0139 0.0159 0.0156 0.0145 0.0145 0.0141
CR of 95% CI 95.1% 95.4% 43.0% 41.5% 94.5% 95.2% 95.4%

Note: Sample size N = 1000, with four audiologists at each of the ten sites.

a

Both-ears method taking into account participants, sites, and audiologists-formed clusters and ear-level covariates, based on mixed-effects model E(YijXi,Wi,Zij,bsi(site),bai0(aud0)),bai1(aud1),b1i(ind),b2i(ind))=β0+(β1+b2i(ind))Xi+β2Wi+β3Zij+bsi(site)+bai0(aud0)+bai1(aud1)+b1i(ind).

b

Both-ears method taking into account ear-level covariates but ignoring sites and audiologists, based on mixed-effects model E(YijXi,Wi,Zij,b1i(ind),b2i(ind))=β0+(β1+b2i(ind))Xi+β2Wi+β3Zij+b1i(ind).

c

Worse-ear method ignoring sites, audiologists and ear-level covariates, based on linear regression model E(YiwXi,Wi)=β0+β1Xi+β2Wi,Yiw=max(Yi1,Yi2).

d

Average-ear method ignoring sites, audiologists and ear-level covariates, based on linear regression model E(YiaXi,Wi)=β0+β1Xi+β2Wi,Yia=(Yi1+Yi2)/2.

e

Worse-ear method taking into account ear-level covariates but ignoring sites and audiologists, based on linear regression model E(YiwXi,Wi,Ziw)=β0+β1Xi+β2Wi+β3Ziw,Yiw=max(Yi1,Yi2),Ziw=max(Zi1,Zi2).

f

Worse-ear method ignoring sites and audiologists, using averaged ear-level covariates, based on linear regression model E(YiwXi,Wi,Zia)=β0+β1Xi+β2Wi+β3Zia,Yiw=max(Yi1,Yi2),Zia=(Zi1+Zi2)/2.

g

Average-ear method taking into account ear-level covariates but ignoring sites and audiologists, based on linear regression model E(YiaXi,Wi,Zia)=β0+β1Xi+β2Wi+β3Zia,Yia=(Yi1+Yi2)/2,Zia=(Zi1+Zi2)/2.

We further examined the situation where the pure-tone audiometry data are available at multiple frequencies at which the covariate of interest impacts hearing loss differently (Table 3 for the scenario with N = 1000, Na = 40, Ns = 10 and Tables S15-S19 in the Supplementary Material for the other combinations of N, Ns, and Na.). We simulated datasets using a two-frequency model (Model 4), which can be easily extended to include more frequencies by adding the appropriate indicator variables and interaction terms. The data-generating mechanism assumed that the exposure impacts the hearing threshold at 6 kHz (β1 + β4 = 0.6) more than it does at 1 kHz (β1 = 0.2). The parameter estimates obtained using the worse-ear and average-ear methods based on ΔPTA were close to the average of the effects at 1 and 6 kHz, and therefore, biased when compared to the true parameters. See the last two columns of Table 3 and Tables S15-S19 in the Supplementary Material for details. The both-ears mixed-effects model including random site, audiologist, and participant effects presented in the first column in Table 3 and Tables S15-S19 in the Supplementary Material yielded relative biases as small as −0.34% and the smallest standard error estimates. Omitting the random site and audiologist effects gave valid estimates despite larger standard error estimates as shown in the second column in Table 3 and Tables S15-S19 in the Supplementary Material. Similar results were obtained under misspecification of the residual terms and random effects (Tables S20-S21 in the Supplementary Material). Discretizing the audiometry data also gave similar results (Table S24 in the Supplementary Material).

TABLE 3.

Performances of different methods when data is simulated based on Model 4 (simple frequency model without ear-level covariates)

Parameter Both-ears Aa Both-ears Bb Worse-ear ΔPTAc Average-ear ΔPTAd
β1 (truth = 0.2) (Model 4) Relative bias 0.41% 0.72% 101% 101%
Empirical SE 0.0815 0.136 0.135 0.134
Estimated SE 0.0846 0.140 0.137 0.136
CR of 95% CI 95.8% 95.7% 69.8% 69.2%
β1 + β4 (truth = 0.6) (Model 4) Relative bias −0.0086% 0.096% −33.10% −33.20%
Empirical SE 0.0829 0.139 0.135 0.134
Estimated SE 0.0846 0.140 0.137 0.136
CR of 95% CI 95.7% 95.3% 69.7% 69.2%
β2 (truth = 0.1) (Model 4) Relative bias 0.026% 0.67% 0.71% 0.67%
Empirical SE 0.00735 0.0127 0.0129 0.0127
Estimated SE 0.00742 0.0130 0.0131 0.0130
CR of 95% CI 95.1% 95.9% 95.6% 95.9%

Note: Sample size N = 1000, with four audiologists at each of the ten sites.

a

Both-ears method taking into account participants, sites, and audiologists-formed clusters, based on the PTA mixed-effects model E(YijqXi,Wi,bsi(site),bai0(aud0),bai1(aud1),bi(ind))=β0+β1Xi+Wi+β3I(q=2)+β4I(q=2)×Xi+bsi(site)+bai0(aud0)+bai1(aud1)+bi(ind).

b

Both-ears method ignoring sites and audiologists, based on the PTA mixed-effects model E(YijqXi,Wi,bi(ind))=β0+β1Xi+β2Wi+β3I(q=2)+β4I(q=2)×Xi+bi(ind).

c

Worse-ear method ignoring sites and audiologists, based on the PTA linear regression model E(YiwXi,Wi)=β0+β1Xi+β2Wi,Yiw=max(Yi1,Yi2) in terms of ΔPTA.

d

Average-ear method ignoring sites and audiologists, based on the PTA linear regression model E(YiaXi,Wi)=β0+β1Xi+β2Wi,Yia=(Yi1+Yi2)/2 in terms of ΔPTA.

In summary, if there are no ear-level confounders, the worse-ear method resulted in less efficient parameter estimates with small relative biases and satisfactory coverage rates. The average-ear method was as efficient as the both-ear method. However, if there are ear-level confounders, parameter estimates resulting from the worse-ear approach had larger relative biases. The average-ear method led to valid estimates which was slightly less efficient than the both-ear method. Note that, when the outcome is the change in the hearing threshold from baseline, the ear-level hearing threshold measured at baseline is likely to be confounders and should be adjusted for in the model. In our simulation setting, site and audiologist formed clusters in the outcome but was independent from the exposure. In this setting, ignoring site or audiologist in the model caused efficiency loss but not bias in the point estimates. If sites and audiologists are also related to exposure and age, omitting them in the model may lead to biased point estimates. In addition, when data are available at multiple frequencies with different exposure effects, the conventional methods based on the PTA led to parameter estimates with substantial relative biases, and the both-ears method allowing exposure-frequency interactions resulted valid estimates. Finally, while relative biases tended to rise under misspecification of the residual terms and random effects, the both-ears method including sites and audiologists effects gave satisfactory performance and was consistently better than other methods.

5 |. ILLUSTRATIVE EXAMPLE: ANALYSIS OF THE CHEARS-AAA DATA

The multicenter prospective study CHEARS and the study population are described in the introduction section briefly, with more details including ascertainment of covariates available in Curhan et al29 We evaluated the aspirin-hearing threshold association using both our proposed both-ears mixed-effects models and the single-ear linear regression models that treat data from individuals as independent without taking into account any clusters. The CHEARS-AAA dataset consists of audiometric data at multiple test frequencies. In all models, aspirin exposure is binary indicating whether a participant is ever an aspirin user over the previous two years, whereas age and baseline pure-tone thresholds are continuous variables. Other potential confounders that are not explicitly listed in the regression models include smoking (categorical with never, past, current), body mass index (categorical with <25, 25–29, 30–34, 35–39, >40), physical activity (quintiles), hypertention (binary), diabetes (binary), dietary Approaches to Stop Hypertension (DASH) score (quintiles), total energy intake (continuous), acetaminophen use (categorical with none, 1 day/week, 2–3 days/week, 4–5 days/week, ≥6 days/week), ibuprofen use (categorical with none, 1 day/week, 2–3 days/week, 4–5 days/week, ≥6 days/week), occupational or leisure-time noise exposure (≥3 hours/week during any decade or not), and impulse or gunfire noise exposure (≥3 times/year during any decade or not).

We first analyze the data using the both-ears mixed-effects model

E[YijqXij,bsi0(site0),bsi1(site1),bai0(aud0),bai1(aud1),bi(ind)]=β0+β1Aspirini+β2Agei+β3Baseline Thresholdijq+β4I(freq=0.5kHz)++β9I(freq=6kHz)+β10Aspirini×I(freq=0.5kHz)++β15Aspirini×I(freq=6kHz)++bsi0(site0)+bsi1(site1)+bai0(aud0)+bai1(aud1)+bi(ind),

where Yijq is the 3-year change in the hearing threshold for participant i = 1, …, 3136 in ear j = 1, 2, at frequencies 0.5, 1, 2, 3, 4, 6, and 8 kHz (corresponding to q = 1,· · ·, 7, respectively). The inclusion of the frequency-aspirin and frequency-confounders interactions allows us to estimate the effect of aspirin use on hearing at different frequencies. The results for the high frequencies (6 and 8 kHz) are reported in Table 4, where the P-values are from the Wald test. As shown in Table 4, aspirin exposure is associated with an estimated increase of 0.727 dB (P = 0.0005, 95% CI: [0.317, 1.137]) in the elevation of the hearing threshold at 6 kHz after a 3-year follow-up. The results of the analyses for all the frequencies can be found in the top half of Table S25 in the Supplementary Material.

TABLE 4.

Results of the analyses of the CHEARS-AAA data at high frequencies using two-ear mixed-effects models and ordinary least square linear regression models

Model Frequency Estimate (SE) P-value
Both-ears mixed-effects models with frequency-specific effectsa 6 kHz 0.727 (0.209) 0.0005
8 kHz 0.388 (0.209) 0.0630
Both-ears frequency-specific mixed-effects modelsb 6 kHz 0.685 (0.272) 0.0119
8 kHz 0.399 (0.297) 0.1799
Worse-ear linear regression modelsc 6 kHz 0.533 (0.328) 0.1041
8 kHz 0.406 (0.349) 0.2458
Better-ear linear regression modelsd 6 kHz 0.758 (0.289) 0.0087
8 kHz 0.283 (0.315) 0.3691
Average-ear linear regression modelse 6 kHz 0.630 (0.278) 0.0236
8 kHz 0.323 (0.302) 0.2849
a

Both-ears method taking into account participants, sites, and audiologists-formed clusters, based on mixed-effects model with frequency-specific effects for all the fixed effects covariates E[YijqXij,bsi0(site0),bsi1(site1)bai0(aud0),bai1(aud1),b(ind)]=β0+β1Aspirini+β2Agei+β3Baseline Thresholdijq+β4I(freq=0.5kHz)++β9I(freq=6kHz)+β10Aspirini×I(freq=0.5kHz)++β15Aspirini×I(freq=6kHz)++bsi0(site0)+bsi1(site1)+bai0(aud0)+bai1(aud1)+bi(ind).

b

Both-ears method taking into account participants, sites, and audiologists-formed clusters, based on frequency-specific mixed-effects model E[YijqXij,bsi0(site0.q),bsi1(sitel.q),bai0(aud0.q),bai1(aud1.q),bi(ind.q)]=β0q+β1qAspirini+β2qAgei+β3qBaseline Thresholdijq++bsi0(site0.q)+bsi1(site1.q)+bai0(aud0.q)+bai1(aud1.q)+bi(ind.q)forq=1,,7.

c

Worse-ear method ignoring sites and audiologists, based on the frequency-specific linear regression model E[Yiqw]=β0q+β1qXi+β2qWi+β3qBaseline Thresholdiqw+forq=1,,10.Yiqw=max(Yi1q,Yi2q). Baseline Thresholdiqw=Baseline Thresholdikq,in whichk=argmaxj(Yijq).

d

Better-ear method ignoring sites and audiologists, based on the frequency-specific linear regression model E[Yiqb]=β0q+β1qXi+β2qWi+β3qBaseline Thresholdiqb+forq=1,,10.Yiqb=min(Yi1q,Yi2q). Baseline Thresholdiqb=Baseline Thresholdikq,in whichk=argminj(Yijq).

e

Average-ear method ignoring sites and audiologists, based on the frequency-specific linear regression model E[Yiqa]=β0q+β1qXi+β2qWi+β3qBaseline Thresholdiqa+forq=1,,10.Yiqa=(Yi1q+Yi2q)/2. Baseline Thresholdiqa=(Baseline Thresholdi1q+Baseline Thresholdi2q)/2.

In addition, we fit separate mixed-effects models for each frequency,

E[YijqXij,bsi0(site0.q),bsi1(site1.q),bai0(aud0.q),bai1(aud1.q),bi(ind.q)]=β0q+β1qAspirini+β2qAgei+β3qBaseline Thresholdijq++bsi0(site0.q)+bsi1(site1.q)+bai0(aud0.q)+bai1(aud1.q)+bi(ind.q),

for q = 1, …, 7. Aspirin exposure is associated with an estimated increase of 0.685 dB (P = 0.0119, 95% CI: [0.152, 1.218]) in the elevation of the hearing threshold at 6 kHz after a 3-year follow-up. See Table 4 and Table S25 in the Supplementary Material.

For comparison, we also fit single linear regression models where the outcome Yiq is defined as the change in the hearing threshold of the worse ear (ie, ear with the larger increase in the threshold), the better ear (ie, ear with the smaller increase in the threshold), or the average change in the hearing threshold from two ears for participant i at frequency q. The model is expressed as follows, with some confounders introduced at the beginning of the section omitted.

E[Yiq]=β0q+β1qAspirini+β2qAgei+β3qBaseline Thresholdiq+

for q = 1, …, 10, where q = 1, …, 7 correspond to the frequencies 0.5 kHz , …, 8 kHz as in the mixed-effects models, and q = 8,9,10 correspond to the PTAs of low-frequency (LPTA0.5,1,2kHz), mid-frequency (MPTA3,4kHz), and high-frequency (HPTA6,8kHz), respectively. The baseline pure-tone threshold is defined correspondingly, according to which ear was used to define Yiq (see footnotes in Table 4). Following the better-ear approach, aspirin exposure is associated with an estimated increase of 0.758 dB (P = 0.0087, 95% CI: [0.192, 1.325]) in the elevation of the hearing threshold at 6 kHz after a 3-year follow-up. Following the average-ear approach, aspirin exposure is associated with an estimated increase of 0.630 dB (P = 0.0236, 95% CI: [0.0845, 1.175]) in the elevation of the hearing threshold at 6 kHz after a 3-year follow-up. The effect of aspirin exposure is not detected as significant following the worse-ear approach. See Table S26 of the Supplementary Material for the results of the analyses for all the frequencies.

6 |. DISCUSSION

In this article, we compare, both theoretically and in simulation studies, the commonly used existing methods in hearing-loss literature, which utilize single-ear data and treat each participant as independent, to proposed both-ears methods that are based on linear mixed-effects models. We found that (1) the worse-ear and better-ear methods may lead to less efficient estimators than the both-ears and average-ear methods if there are no ear-level confounders, and they may lead to biased estimates when there exist ear-level confounders; (2) the average-ear method is valid whether or not there are ear-level confounders; in terms of efficiency, it is comparable to the both-ear method that includes a random individual effect to take into account the between-ear correlation if there are no ear-level confounders; however, if there exist ear-level confounders, the average-ear method may lead to less efficient estimators than the both-ears method; (3) the both-ear methods can gain efficiency by including site and audiologist random effects if site and audiologist are correlated with the hearing outcome but not the exposure.

This article focuses on the situations when sites and audiologists are correlated only with the hearing outcome. In the cases when sites and audiologists are confounders, they should be included in the model; not including them may cause bias in effect estimates.30 Other than the confounding issue, we have assumed in our study that the random effects follow normal distributions. However, according to our simulation studies in Section 4, the both-ears methods including random sites and audiologists effects maintained the best performances when such assumptions were violated. Our proposed both-ears mixed effects models are expected to work in settings of a typical hearing study. The full frequency model includes the audiologist-frequency, site-frequency, and participant-frequency interactions. Additional random intercepts and slopes can be similarly specified to take into account other interaction effects. According to the simulation studies in Section 4, the SAS program works well with a low computational burden across simulation settings with flexible sample sizes, cluster numbers and cluster sizes that are realistic in hearing studies. Note that, if the between-cluster variation is close to zero, the estimated variance of the random effect for the cluster could be negative.3133 While we did not encounter this boundary issue in our settings, one could add restrictions as described in Thompson34 to avoid obtaining estimates out of the parameter space. Additional investigations of such negative variance problems can be found in Fletcher and Underwood35 and Leithy et al.32 Furthermore, we can perform statistical tests on zero variance components in linear mixed-effects models.36,37 If the variance is zero, one can consider remove the random effect from the model.

Moreover, linear mixed effects models are advantageous in uncovering individual-specific effects. This is particularly desirable when dealing with correlated hearing data since hearing loss patterns are expected to vary across participants. Other existing statistical options such as marginal methods focus on the population-average effects. See Chen et al38 for a both-ears method that analyzes hearing data based on GEEs without taking into account sites and audiologists effects. While mixed-effects models adjust for sites and audiologists using random effects, the GEE method can adjust for them using the fixed effects. Although the GEE method does not require any distributional assumptions, it may not be suitable forthescenarioswheremanysmallclustersareincludedasfixed-effects.Athoroughcomparisonofthemarginalmethods such as the GEE method in the application of hearing studies with mixed effects models is a topic of our future research.

Finally, we comment that the CHEARS-AAA example only has two time points. For studies with three or more time points, one could extend our proposed model by including time, time-exposure interactions, and time-confounders interactions as the fixed effects. As for the random effects in the models, one could consider the interactions between site/audiologist and time, similar to those between site/audiologist and frequency in the full frequency model in Section 2. In addition, taking the simple frequency model in Section 2 as an example, the participant random effect bi(ind) can be expanded to bi(ind)+bit(i)(ind.time), where the subscripts t(i) indicates that time points are nested within participants. The additional participant-time random interaction allows for the ICC of the outcomes of the same participant at the same time point to be larger than that of the same participant at distinct time points.

Supplementary Material

supinfo

ACKNOWLEDGEMENTS

This work was partially supported by the National Institute Health grants R01 DC017717, U01 CA176726 (NHS II), and U01 DC 010811. The authors thank the reviewers for their insightful comments that have improved this paper.

Funding information

National Institute of Health, Grant/Award Numbers: R01 DC017717, U01 CA176726, U01 DC 010811

Footnotes

SUPPORTING INFORMATION

Additional supporting information can be found online in the Supporting Information section at the end of this article.

CONFLICT OF INTEREST

The authors declare no potential conflict of interests.

DATA AVAILABILITY STATEMENT

The data for the Audiology Assessment Arm are not publicly available.

REFERENCES

  • 1.Cunningham L, Tucci D. Hearing loss in adults. N Engl J Med. 2017;377(25):2465–2473. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Mick P, Kawachi I, Lin F. The association between hearing loss and social isolation in older adults. Otolaryngol Head Neck Surg. 2014;150(3):378–384. [DOI] [PubMed] [Google Scholar]
  • 3.Lin F, Yaffe K, Xia J, et al. Hearing loss and cognitive decline in older adults. JAMA Intern Med. 2013;173(4):293–299. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Kiely K, Anstey K, Luszcz M. Dual sensory loss and depressive symptoms: the importance of hearing, daily functioning, and activity engagement. Front Hum Neurosci. 2013;7:837. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Dalton D, Cruickshanks K, Klein B, Klein R, Wiley T, Nondahl D. The impact of hearing loss on quality of life in older adults. Gerontologist. 2003;43(5):661–668. [DOI] [PubMed] [Google Scholar]
  • 6.Bainbridge K, Hoffman H, Cowie C. Diabetes and hearing impairment in the United States: audiometric evidence from the National Health and nutrition examination survey, 1999 to 2004. Ann Intern Med. 2008;149(1):1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Cruickshanks KJ, Tweed TS, Wiley TL, et al. The 5-year incidence and progression of hearing loss. Arch Otolaryngol Head Neck Surg. 2003;129(10):1041–1046. [DOI] [PubMed] [Google Scholar]
  • 8.Zhan W, Cruickshanks KJ, Klein BEK, et al. Generational differences in the prevalence of hearing impairment in older adults. Am J Epidemiol. 2009;171(2):260–266. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Lee SA, Kang HT, Lee YJ, Lee JD, Kim BG. National representative analysis of unilateral hearing loss and hearing aid usage in South Korea. PLoS One. 2020;15(4):e0232112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Gates G, Cooper J, Kannel W, Miller N. Hearing in the elderly: the Framingham cohort, 1983–1985. Part I. basic audiometric test results. Ear Hear. 1990;11(4):247–256. [PubMed] [Google Scholar]
  • 11.Gopinath B, McMahon C, Rochtchina E, Karpa M, Mitchell P. Incidence, persistence, and progression of tinnitus symptoms in older adults: the blue mountains hearing study. Ear Hear. 2010;31(3):407–412. [DOI] [PubMed] [Google Scholar]
  • 12.Linssen A, Van Boxtel M, Joore M, Anteunis L. Predictors of hearing acuity: cross-sectional and longitudinal analysis. J Gerontol A. 2014;69(6):759–765. [DOI] [PubMed] [Google Scholar]
  • 13.Rigters S, Van der Schroeff MP, Papageorgiou G, Baatenburg de Jong R, Goedegebure A. Progression of hearing loss in the aging population: repeated auditory measurements in the Rotterdam study. Audiol Neurotol. 2018;23(5):290–297. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Lee CY, Hwang JH, Hou SJ, Liu TC. Using cluster analysis to classify audiogram shapes. Int J Audiol. 2010;49(9):628–633. [DOI] [PubMed] [Google Scholar]
  • 15.Diggle P, Diggle PJ, Heagerty P, Liang KY, Zeger S. Analysis of Longitudinal Data. Oxford, England: Oxford University press; 2002. [Google Scholar]
  • 16.Fitzmaurice G, Davidian M, Verbeke G, Molenberghs G. Longitudinal Data Analysis.. Boca Raton, FL: CRC press; 2008. [Google Scholar]
  • 17.Fitzmaurice GM, Laird NM, Ware JH. Applied Longitudinal Analysis. Hoboken, NJ: John Wiley & Sons; 2012. [Google Scholar]
  • 18.Hartley HO, Rao JNK. Maximum-likelihood estimation for the mixed analysis of variance model. Biometrika. 1967;54(1):93–108. [PubMed] [Google Scholar]
  • 19.Littell RC, Milliken GA, Stroup WW, Wolfinger RD. SAS System for Mixed Models. Cary: SAS Institute Inc.; 1996. [Google Scholar]
  • 20.Pinheiro J, Bates D. Mixed-Effects Models in S and S-PLUS. New York: Springer-Verlag; 2000. [Google Scholar]
  • 21.Pinheiro JC. Linear mixed effects models for longitudinal data. Wiley StatsRef. 2014;1(1):1–7. [Google Scholar]
  • 22.Morrell CH, Brian LJ. Modelling hearing thresholds in the elderly. Stat Med. 1991;10:1453–1464. [DOI] [PubMed] [Google Scholar]
  • 23.Morrell CH, Pearson JD, Brant LJ, Gordon-Salant S. Construction of hearing percentiles in women with non-constant variance from the linear mixed-effects model. Stat Med. 1997;16:2475–2488. [DOI] [PubMed] [Google Scholar]
  • 24.Schielzeth H, Dingemanse NJ, Nakagawa S, et al. Robustness of linear mixed-effects models to violations of distributional assumptions. Methods Ecol Evol. 2020;11(9):1141–1152. [Google Scholar]
  • 25.Liang KY, Zeger SL. Longitudinal data analysis using generalized linear models. Biometrika. 1986;73:13–22. [Google Scholar]
  • 26.Wallhagen MI, Strawbridge WJ, Shema SJ, Kaplan GA. Impact of self-assessed hearing loss on a spouse: a longitudinal analysis of couples. J Gerontol Soc Sci. 2004;59(3):190–196. [DOI] [PubMed] [Google Scholar]
  • 27.Cruickshanks K, Klein R, Klein B, Wiley T, Nondahl D, Tweed T. Cigarette smoking and hearing loss: the epidemiology of hearing loss study. J Am Med Assoc. 1998;279(21):1715–1719. [DOI] [PubMed] [Google Scholar]
  • 28.Tambs K, Hoffman H, Borchgrevink H, Holmen J, Samuelsen S. Hearing loss induced by noise, ear infections, and head injuries: results from the Nord-Trøndelag hearing loss study. Int J Audiol. 2003;42:89–105. [DOI] [PubMed] [Google Scholar]
  • 29.Curhan S, Halpin C, Wang M, Eavey R, Curhan G. Prospective study of dietary patterns and hearing threshold elevation. Am J Epidemiol. 2020;189(3):204–214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.McNamee R. Confounding and confounders. Occup Environ Med. 2003;60:227–234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Rao CR. Estimation of variance and covariance components in linear models. J Am Stat Assoc. 1972;67(337):112–115. [Google Scholar]
  • 32.Leithy HAE, Wahed ZAA, Abdallah MS. On non-negative estimation of variance components in mixed linear models. J Adv Res. 2016;7:59–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Cheng C, Wang M. Statistical methods for analysis of combined categorical biomarker data from multiple studies. Ann Appl Stat. 2020;14(3):1146–1163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Thompson WA. The problem of negative estimates of variance components. Ann Math Stat. 1962;33(1):273–289. [Google Scholar]
  • 35.Fletcher DJ, Underwood AJ. How to cope with negative estimates of components of variance in ecological field studies. J Exp Mar Biol Ecol. 2002;273:89–95. [Google Scholar]
  • 36.Greven S, Crainiceanu CM, Kuchenhoff H, Peters A. Restricted likelihood ratio testing for zero variance components in linear mixed models. J Comput Graph Stat. 2008;17(4):870–891. [Google Scholar]
  • 37.Drikvandi R, Verbeke G, Khodadadi A, Partovi NV. Testing multiple variance components in linear mixed-effects models. Biostatistics. 2012;14(1):144–159. [DOI] [PubMed] [Google Scholar]
  • 38.Chen C,Zhang N,Curhan GC,Curhan SG,Wang M. Both-ear method for the analys is of audiometricdata. EarHear.2022;43(5):1443–1455. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supinfo

Data Availability Statement

The data for the Audiology Assessment Arm are not publicly available.

RESOURCES