Skip to main content
Journal of Applied Statistics logoLink to Journal of Applied Statistics
. 2022 Jun 8;50(13):2717–2738. doi: 10.1080/02664763.2022.2078798

Analysis of medians under two-way model with and without interaction for Birnbaum–Saunders distributed response

S M Patil 1, H V Kulkarni 1,CONTACT
PMCID: PMC10503464  PMID: 37720248

Abstract

The Birnbaum–Saunders (BS) distribution, well-known as the fatigue-life distribution, has been used in numerous disciplines ranging from engineering to medical sciences. In this article, we develop a test for analysis of medians for BS distributed response to assess the impact of two interacting factors on the median, where no test is presently available. The proposed integrated likelihood ratio test (ILRT) eliminates the nuisance shape parameters by integrating them out. The second-order accurate asymptotic chi-square distribution of ILRT is derived. An in-depth simulation study strongly supports its excellent performance even under small group sizes. Furthermore, ILRT developed under the one-way model is found uniformly superior over its peers, is straightway extendable under general multiway setup, and has potential to be extended to other non-normal response variables. Its genuine need in industry, where non-normal responses are commonly encountered, is highlighted through analysis of three real data sets: ILRT strongly picked out the deposition time as influential factor in epitaxial layer experiment, revealed significant impact of spools on fiber life for the failure times of Kevlar 49 fiber data, and gave more accurate parameter estimates in delivery time data experiment, as assessed by various model adequacy tools, where its competitors failed to deliver desired results.

Keywords: Homogeneity of medians, one-way setup, two-way setup with and without interaction, small group sizes, skewed data

1. Introduction

The Birnbaum–Saunders (BS) distribution is widely used in the context of material fatigue and reliability applications, also known as fatigue-life distribution. It is used in various situations where the accumulation of a particular factor forces a quantifiable characteristic to exceed a critical threshold. For example, migration of metallic flaws in nano-circuits due to heat in a computer chip, accumulation of harmful substances in the lungs from air pollution, the occurrence of natural disasters such as earthquakes and tsunamis, among many other situations. Recently, its use is not limited to analyze ‘fatigue’ or ‘cumulative damage’ type areas from where it was originated but also has found profound applications in business, economics, finance, industry, insurance, nutrition, psychology, quality control, among many other disciplines and is acquiring increasing popularity. For more details see Leiva [3], among many other references.

One-way and two-way analyses are commonly used to analyze data generated from industries and many other disciplines, where the data are most often likely to be non-normally distributed. More often such data are non-negative and have a lifetime related probability distribution, where there is an acute paucity of literature related to two and multiway setup. Often the non-negative and/or skew nature of the data is totally ignored and normal-based methods are blindly used, leading to a possibility of incorrect decisions. Section 5 demonstrates analyses of three real data sets related to engineering, where the results based on F-test were not reliable. Thus, there seems to be an absolute need for developing correct statistical procedures for handling non-normal ANOVA problems. In spite of the fact that BS distribution is widely used in various disciplines, the two-way ANOVA setup under BS is absolutely un-handled in the literature, whereas there are only a few ways available in a one-way setup. Thus, the present work seems to be an attempt to fill this gap in the literature.

Noting that the mean and standard deviation of the BS distribution are directly proportional to the scale parameter, as well as the scale parameter of the distribution equals the median, assessing the impact of external factors on the median or equivalently on the scale is an important problem in industrial applications, leading to a factorial model for the scale parameter which henceforth is refereed to as the median. Presently no test is available in the literature under this scenario, while under the one-way model, two tests by Niu et al. [6] based on generalized pivotal quantities (GPQ) and Delta method are available. Regression models have been handled through the log-transformation on the BS distributed response, see for example Rieck and Nedelman [7] and Lemonte et al. [4] among others.

There is an acute paucity of literature on the analysis of factorial data with non-normal response variables. A few existing tests like generalized linear models make use of maximum likelihood-based procedures which work better under relatively larger sample sizes while in industry there could be many situations where gathering more data would be probably expensive.

The present work develops a simple test for the two-way setup with and without interaction for the median, based on the idea of eliminating nuisance shape parameters through the use of integrated likelihood (IL). An integrated likelihood ratio test (ILRT) is constructed and is shown to be asymptotically chi-square distributed up to the second-order. Simple ad-hoc multiplicative adjustments for small group sizes have been developed leading to improved performance under small samples rendering the ILRT uniformly usable. A simulation study shows that ILRT controlled the type-I error remarkably and uniformly well and attains very good power under varying realistic situations. A similar test under the one-way model is also developed and found to outperform the existing GPQ and Delta method-based tests. For similar work under normally distributed data, we refer to Kulkarni and Patil [9], SenGupta and Kulkarni [8], and Kulkarni and SenGupta [10] in the context of directional data.

The remainder of this article is organized as follows. Section 2 develops the ILRT under a two-way with and without interaction model and its asymptotic distributions. Section 3 is devoted to a thorough performance assessment of the ILRT based on a rigorous simulation study. Section 4 develops its version under a one-way model and reports its performance assessment. A rigorous analysis with several model adequacy checks of three real data sets from industrial applications presented in Section 5 highlights the advantages of the proposed test in industry. The paper is concluded with a section of conclusions.

2. The proposed ILRT

Let L(ψψ,λλ) be a likelihood function under consideration, ψψ being the vector of parameters of interest and λλΛΛ the vector of nuisance parameters. An IL is of the form

L¯(ψψ)=ΛΛL(ψψ,λλ)Π(λλ|ψψ)dλλ,

where Π is a non-negative weight function on ΛΛ making the above integral convergent for every fixed ψψ. L¯ depends on the data and ψψ, and can be used as a standard likelihood function for all likelihood-based inference procedures.

In the following, we develop analysis of medians, a version of ANOVA for the mean parameter under two-way with interaction setup. The medians are assumed to be influenced by two factors while the shape parameters are assumed to be unaffected.

2.1. Two-way setup with interaction

The probability density function (pdf) of a BS distribution (see Leiva [3]) under two-way setup with interaction form for the median ψij is

f(tijk,ψij,γij)=122πψijγijA(tijk;ψij) exp[12γij2a(tijk;ψij)],ψij=(μ+αi+βj+(αβ)ij)>0,tijk>0,γij>0,i=1,,I; j=1,,J; k=1,,nij,

where A(tijk;ψij)=(ψij/tijk)1/2+(ψij/tijk)3/2, a(tijk;ψij)=(tijk/ψij)+(ψij/tijk)2, μ is the unknown overall effect, αi is the unknown effect of the ith level of first factor, βj is the unknown effect of the jth level of second factor, (αβ)ij is the unknown interaction effect of the ith level of first factor and jth level of second factor and γij are unknown shape parameters.

Let Tij1,,Tijnij be the observations from the (i,j)th cell following a BS(ψij,γij) distribution, i=1,,I, j=1,,J. Let N=i=1Ij=1Jnij; ni.=j=1Jnij, n.j=i=1Inij, T¯a=i=1Ij=1Jk=1nijTijk/N and T¯h=(i=1Ij=1Jk=1nijTijk1/N)1 be the overall sample size, arithmetic and harmonic mean, respectively, T¯ai..=j=1Jk=1nijTijk/ni., T¯hi..=(j=1Jk=1nijTijk1/ni.)1 be the arithmetic and harmonic means of ith row, with similar counter parts for column, T¯aij.=k=1nijTijk/nij, T¯hij.=(k=1nijTijk1/nij)1 be the arithmetic and harmonic means of (i,j)th cell and t¯a, t¯h, t¯ai.., t¯hi.., t¯a.j., t¯h.j., t¯aij., t¯hij. denote their observed values, respectively. The hypothesis corresponding to the test for no interaction is

H0:(αβ)11=(αβ)12==(αβ)IJv/sH1:(αβ)ij(αβ)ijfor some ii,jj. (1)

Let tt=(t111,,t11n11,,tI11,,tIJnIJ) be the vector of all observations. Here (μ,α1,,αI,β1,,βJ,(αβ)11,,(αβ)IJ) is the vector of unknown parameters of interest with ψij=μ+αi+βj+(αβ)ij; ψψ=(ψ11,,ψIJ) and γγ=(γ11,,γIJ) is the vector of unknown nuisance parameters. Berger et al. [1] recommend the uniform prior taking a constant value 1 over the entire nuisance parameter space as an apparent option when no prior information is available about the nuisance parameters. In the present case, an added advantage is that it gives a closed-form expression for the resulting IL.

The likelihood function of (ψψ,γγ|tt) is

L(ψψ,γγ|tt)=i=1Ij=1J{(22π)nij(ψijγij)nijk=1nijA(tijk;ψij)exp[12γij2k=1nija(tijk;ψij)]},ψij,γij>0. (2)

Integration of the likelihood function with respect to the uniform prior over [0,) for all γij, i=1,,I, j=1,,J yields

L¯(ψψ|tt)=i=1Ij=1JL¯ij(ψij)i=1Ij=1J{(1ψij)nijk=1nijA(tijk;ψij)[k=1nija(tijk;ψij)](nij1)/2}. (3)

The log-IL functions under H0 and the whole parameter space are denoted by l¯0 and l¯1, respectively, and are given by

l¯0=i=1Ij=1J{nijlog(ψij0)+k=1nijlog[A(tijk;ψij0)](nij1)2log[k=1nija(tijk;ψij0)]} (4)

and

l¯1=i=1Ij=1J{nijlog(ψij1)+k=1nijlog[A(tijk;ψij1)](nij1)2log[k=1nija(tijk;ψij1)]}, (5)

where ψij0=(μ+αi+βj) and ψij1=(μ+αi+βj+(αβ)ij).

Let l¯0 be maximized at δδ=(μˆ0,ααˆ0,ββˆ0). Its value can be obtained by maximizing right-hand side (rhs) of Equation (4) numerically. Any numerical algorithm for maximization (equivalently, minimization of the function after reversing its sign) needs an initial starting value to start with, for which the choice (t¯a×t¯h,t¯a1..×t¯h1..,,t¯aI..×t¯hI..,t¯a.1.×t¯h.1.,,t¯a.J.×t¯h.J.) is found to work very well. Similarly l¯1 is maximized at (μˆ1,ααˆ1,ββˆ1,(ααββˆ)1) and its value can be computed by maximizing rhs of (5) where initial value suggested is either (δδ,t¯a11.×t¯h11.,,t¯aIJ.×t¯hIJ.) or (t¯a×t¯h,t¯a1..×t¯h1..,,t¯aI..×t¯hI..,t¯a.1.×t¯h.1.,,t¯a.J.×t¯h.J., t¯a11.×t¯h11.,,t¯aIJ.×t¯hIJ.).

Though analytical proof of the existence of a unique maximum likelihood estimate (MLE) is not tractable at stage, thousands of sample data sets were randomly generated and observed to give unique MLE's both under l¯0 as well as l¯1, and practically no difficulty was found in employing the test developed here.

The IL ratio is

λ¯=supψψΘ0Θ0L¯(ψψ|tt)supψψΘΘL¯(ψψ|tt)=i=1Ij=1J{[(1ψˆij0)nijk=1nijA(tijk;ψˆij0)[k=1nija(tijk;ψˆij0)](nij1)/2][(1ψˆij1)nijk=1nijA(tijk;ψˆij1)[k=1nija(tijk;ψˆij1)](nij1)/2]}, (6)

where ΘΘ={ψψ:ψij(0,),i=1,,I, j=1,,J}, Θ0Θ0 is the subset of ΘΘ and ψˆij0=μˆ0+αˆi0+βˆj0, ψˆij1=μˆ1+αˆi1+βˆj1+(αβˆ)ij1; i=1,,I, j=1,,J. The proposed ILRT statistic 2logλ¯ equals

TILRT1=2i=1Ij=1J{nijlog(ψˆij0ψˆij1)+k=1nijlog[A(tijk;ψˆij1)A(tijk;ψˆij0)]+(nij1)2log[k=1nija(tijk;ψˆij0)k=1nija(tijk;ψˆij1)]}. (7)

The asymptotic χ2 distribution of TILRT1 is stated in Theorem 2.1. The theme of the proof goes in line with that of the Theorem 1 of SenGupta and Kulkarni [8] and given as supplementary material.

Theorem 2.1

The second-order accurate asymptotic distribution of TILRT1 is χ(I1)(J1)2.

2.2. Two-way setup without interaction

The pdf of a BS distribution under two-way setup without interaction model is as in Section 2.1 except that the vector of unknown parameters of interest is (μ,α1,,αI,β1,,βJ) with ψij=μ+αi+βj, i=1,,I; j=1,,J; ψψ=(ψ11,,ψIJ). The hypothesis of interest is to test

H0:β1=β2==βJv/sH1:βjβjfor some jj. (8)

The expressions for the likelihood function L, the IL function L¯ and the log-IL function l¯1 under H1 are given exactly by Equations (2), (3) and (4), respectively, while under H0, we get

l¯0=i=1Ij=1J{nijlog(ψi0)+k=1nijlog[A(tijk;ψi0)](nij1)2log[k=1nija(tijk;ψi0)]}, (9)

where ψi0=(μ+αi).

Here, l¯0 is maximized at δδ=(μˆ0,ααˆ0). Its value can be obtained by maximizing rhs of Equation (9) numerically using an initial starting value (t¯a×t¯h,t¯a1..×t¯h1..,,t¯aI..×t¯hI..). Similarly l¯1 is maximized at (μˆ1,ααˆ1,ββˆ1) and its value can be computed by maximizing rhs of (4) where initial value suggested is either (δδ,t¯a.1.×t¯h.1.,,t¯a.J.×t¯h.J.) or (t¯a×t¯h,t¯a1..×t¯h1..,, t¯aI..×t¯hI..,t¯a.1.×t¯h.1.,,t¯a.J.×t¯h.J.).

The expressions of resulting IL ratio and the ILRT statistics say, TILRT2 are same as those given in Equations (6) and (7), respectively, with estimates of median parameter replaced by

ψˆij0=μˆ0+αˆi0 (10)

and

ψˆij1=μˆ1+αˆi1+βˆj1 (11)

are the MLE's of ψij under H0 and whole parameter space, respectively. The asymptotic χ2 distribution of TILRT2 is stated in Theorem 2.2.

Theorem 2.2

The second-order accurate asymptotic distribution of TILRT2 is χ(J1)2.

The proof is exactly in line with the one of Theorem 2.1 and is omitted.

2.3. Assessment of convergence of type-I error rate through simulations

The type-I errors of ILRT are simulated for various common group sizes n=5(5)50 (here and in the sequel the notation a(b)c denotes the set of values of n starting with a, ending with c, with successive increments by the number b), overall effect μ=0.5,1,2,3.5,5; ith row level α1=0.5,1,2,3.5,5, αi=αi1×h2, i=2,,I; incrementing h2 through 0.5, 1, 1.5. By default, under H0, under the without interaction setup, βj=0.5,1,2,3.5,5, j=1,,J while under the with-interaction setup, jth column level β1=0.5,1,2,3.5,5, βj=βj1×h2, j=2,,J and (ij)th interaction effect (αβ)ij=0.5,1,2,3.5,5, i=1,,I, j=1,,J (and other parametric combinations as specified in the set P given in Section 3.1 except GSP). The simulated type-I errors plotted in Figures 1 and 2 for with and without interaction setup, respectively, converge very well to the target level 0.05, indicating good conformation to the asymptotic χ2 distribution specified by Theorems 2.1 and 2.2, for group sizes exceeding 5 and 30, respectively. Under smaller group sizes, the type-I errors are slightly smaller for the with-interaction setup and larger for the without-interaction setup than the desired level. Ad-hoc corrective adjustment factors suggested in the next subsection (Equations (12) and (13)) stabilized the type-I errors quite well to the desired level under small group sizes.

Figure 1.

Figure 1.

Box plots portraying the rate of convergence of type-I errors of ILRT with common group size (n) under two-way with interaction setup.

Figure 2.

Figure 2.

Box plots portraying the rate of convergence of type-I errors of ILRT with common group size (n) under two-way without interaction setup.

2.4. Corrective adjustments under small samples (two-way setup)

Let Tα and qα be the αth quantiles of the test statistic T and its desired χ2 distribution, respectively. Nothing that P(T>T1α)=α, and hence that P[(q1α/T1α)T>q1α]=α, it follows that the entity q1α/T1α works as a corrective factor (cf) for adjusting the observed sizes to the desired level α under small group sizes. These adjustments are most often essential under small group sizes since the asymptotic distributions of likelihood-based tests work well under little larger group sizes.

The cf is calculated under different group sizes using the ratio of theoretical 95th quantiles of the desired χ(I1)(J1)2 and χ(J1)2 (note that these are q1α at 5 % level) distribution to the simulated 95th quantiles of TILRT1 of (7) and TILRT2 of (7, 10 and 11) under H0, respectively, based on 2000 simulations for each of the parametric combinations specified in Section 2.3. The resulting cf was found to depend mainly on the common group size and is given in Equations (12) and (13) for two-way with and without interaction setup, respectively

cf={1.0146for n<10,1for n10, (12)
cf={0.8440for n<10,0.9596for 10n20,0.9800for 25n30,1for n>30. (13)

The ILRT is suggested to be used with this cf to be employed individually for each group based on the distinct group size nij in place of n in the expression for cf (resulting in cfij say) giving rise to the adjusted statistic for two-way with and without interaction setup

TILRT_adj=2i=1Ij=1Jcfij×{nijlog(ψˆij0ψˆij1)+k=1nijlog[A(tijk;ψˆij1)A(tijk;ψˆij0)]+(nij1)2log[k=1nija(tijk;ψˆij0)k=1nija(tijk;ψˆij1)]}.

The box plots of the simulated sizes of adjusted ILRT plotted in Figure 3(a,b) for two-way with and without interaction setup, respectively, are indicative of the improvement in the size performance of ILRT after adjustments. Throughout in the sequel, we use this adjusted version of ILRT and the word ‘ILRT’ is used to represent this adjusted ILRT throughout.

Figure 3.

Figure 3.

Box plot of type-I errors of ILRT under two-way with and without interaction setup. (a) Two-way with interaction setup. (b) Two-way without interaction setup.

3. Performance assessment through simulation study

This section presents an exhaustive simulation study to test the performance of the proposed ILRT in terms of type-I error and power function under a fairly representative set of parametric combinations.

3.1. Type-I error rate

The parametric setup considered under two-way model for empirical assessment constitutes I=2(1)6 row levels, J=2(1)6 column levels, overall effect μ=0.5,5, ith row effect α1=0.5,5, αi=αi1×h2, i=2,,I; incrementing h2 through 0.5, 1.5. By default, under H0, the column levels are same under the without interaction setup, and the common values are taken to be 0.5 and 5. Under the with-interaction setup, the column levels are set at β1=0.5,5, βj=βj1×h2, j=2,,J. For with-interaction setup, under H0, the interaction term is set at common values 0.5 and 5.

Based on the pattern of group sizes among the IJ cells, three group size patterns (GSP) were considered: GSP-1: 100% (all) small size groups ( nij=5, i=1,,I, j=1,,J), GSP-2: 50% small size ( nij=5) and 50% medium size ( nij=15) groups, GSP-3: 100% (all) medium size groups ( nij=15). Similarly based on the variability in shape parameters among the IJ groups, three variability patterns in shape (VPS) were considered: VPS-1: 100% (all) groups with equal shape parameters ( γij=1, i=1,,I, j=1,,J), VPS-2: first 50% groups with equal shape parameters ( γij=0.5(<1)) and last 50% groups with equal shape parameters ( γij=1.5(>1)), VPS-3: first 50% groups with equal shape parameters ( γij=1.5) and last 50% groups with equal shape parameters ( γij=0.5). The set of all parametric combinations resulting from the above settings is denoted by P in the sequel.

Type-I errors of ILRT are simulated for the parametric combinations in P based on 2000 simulations. The commonly used 5% level of significance was used. Figure 3( a,b) giving the box plots of the simulated sizes under two-way with and without interaction setup indicates well-concentrated type-I errors of ILRT around the nominal level.

3.2. Power performance

Based on 2000 simulations, the power functions were simulated for the same parametric setup  P as above. In addition, under two-way with-interaction setup, under the alternative hypothesis, the interaction factor (ααββ) was systematically varied taking (αβ)11=0.5, (αβ)ij=(αβ)i(j1)×h1 for j>1 and (αβ)ij=(αβ)(i1)(j+2)×h1 for j = 1, i=1,,I, j=1,,J; incrementing h1 through h1=1(0.2)3;3.5(0.5)5;7. Note that h1=1 corresponds to the null hypothesis. The overall effect is set to be μ=0.5, ith row effect α1=0.5; αi=αi1×0.5, i=2,,I and jth column effect β1=0.5; βj=βj1×0.5, j=2,,J.

Figure 4 displays the power functions (as a function of h1) under I = 2; J = 2 and I = 6; J = 2 as representative of all patterns represented in the simulation study. For I = 6; J = 2 increments were slightly altered to be h1=1(0.2)2.6 for a better visual display. The panel headings of each sub-figure are the pairs (VPS, GSP).

Figure 4.

Figure 4.

Power functions of ILRT under two-way with interaction setup. (a) Two row (I = 2) and two column (J = 2) levels. (b) Six row (I = 6) and two column (J = 2) levels.

Similarly, under two-way without interaction setup, the column factor ββ was systematically varied taking β1=0.5; βj=βj1×h1, j=2,,J; incrementing h1 through h1=1(.02)1.1;1.2(0.2)3;3.5(0.5)5;7(2)15 under the alternative. The overall effect and ith row effect were set as above. Figure 5 displays the power functions (as a function of h1) under I = 2; J = 2 and I = 6; J = 6 as representative of all patterns represented in the simulation study. Here too, for I = 6; J = 6 increments were slightly altered to be h1=1(.02)1.1;1.2(0.2)2 for a better visual display. As before, the panel headings of each sub-figure are the pairs (VPS, GSP).

Figure 5.

Figure 5.

Power functions of ILRT under two-way without interaction setup. (a) Two row (I = 2) and two column (J = 2) levels. (b) Six row (I = 6) and six column (J = 6) levels.

Clearly, ILRT gives consistently satisfactory power under all parametric combinations considered and hence it can be uniformly used under all real-life situations.

ILRT-based homogeneity test under one-way setup can be similarly developed and is derived in the next section. Note that the test is equivalent to ANOVA for means under BS distribution when the shape parameters of all groups are the same.

4. ILRT under one-way setup

Consider the one-way setup with I groups, there being ni observations in ith group, i=1,,I. The pdf of the jth observation in the ith group tij is

f(tij,βi,γi)=122πβiγi[A(tij;βi)]exp[12γi2a(tij;βi)],tij>0,βi,γi>0, i=1,,I;j=1,,ni,

where A(tij;βi)=(βi/tij)1/2+(βi/tij)3/2, a(tij;βi)=(tij/βi)+(βi/tij)2, βi is the median and γi is the shape parameter.

Let Ti1,,Tini be the random observations forming the ith group, following BS(βi,γi) distribution, i=1,,I. Let N=i=1Ini, T¯a=i=1Ij=1niTij/N and T¯h=(i=1Ij=1niTij1/N)1 be the overall sample size, arithmetic and harmonic mean, respectively, T¯ai.=j=1niTij/ni and T¯hi.=(j=1niTij1/ni)1 be the arithmetic and harmonic means of ith group and t¯a, t¯h, t¯ai, t¯hi denote their observed values, respectively. The hypothesis of interest is to test

H0:β1=β2==βIv/sH1:βiβifor some ii. (14)

The likelihood function of aggregate sample is

L(ββ,γγ|tt)=i=1I{(22π)ni(βiγi)nij=1niA(tij;βi) exp[12γi2j=1nia(tij;βi)]}βi,γi>0, i=1,,I.

Here, ββ=(β1,,βI) is the vector of unknown parameters of interest and γγ=(γ1,,γI) is the vector of unknown nuisance parameters. Similar to two-way setup, integration of the likelihood function with respect to the uniform prior yields

L¯(ββ|tt)=i=1IL¯i(βi)i=1I{(1βi)nij=1niA(tij;βi)[j=1nia(tij;βi)](ni1)/2},

and with the abuse of similar notation,

l¯0=i=1I{nilog(β0)+j=1nilog[A(tij;β0)](ni1)2log[j=1nia(tij;β0)]} (15)

and

l¯1=i=1I{nilog(βi)+j=1nilog[A(tij;βi)](ni1)2log[j=1nia(tij;βi)]}. (16)

In the similar context, βˆ0 and βˆβˆ are maximizers of rhs of (15) and (16), respectively. For numerical maximization, the vector (t¯a×t¯h,(t¯a1.×t¯h1.,,t¯aI.×t¯hI.)) worked well as initial starting value. Exactly parallel computations yield the following ILRT statistics for the one-way setup:

TILRT3=2{i=1Inilog(βˆ0βˆi)+i=1Ij=1nilog[A(tij;βi)A(tij;β0)]+i=1I(ni1)2log[j=1nia(tij;β0)j=1nia(tij;βi)]}. (17)

The asymptotic χ2 distribution of TILRT3 is stated in Theorem 4.1. Its proof is similar to that of Theorem 2.1 and is omitted.

Theorem 4.1

The second-order accurate asymptotic distribution of TILRT3 is χ(I1)2.

4.1. Corrective adjustments under small samples (one-way setup)

Based on 100,000 simulations, the simulated type-I errors of ILRT at 5% level of significance, for various group sizes n=5(5)45;100,200; group shape parameters γ1=0.5(0.5)5, γi=γi1×h2, i=2,,I, incrementing h2 through 0.5, 1, 1.5; (and other parametric combinations as specified in the set Q given in Section 4.2.1 except GSP) converged well to the target level 0.05, indicating good conformation to the asymptotic χ2 distribution specified by Theorem 4.1 for group sizes exceeding 45. As before, a minor ad-hoc corrective adjustment factor is developed using a parallel approach, stated in Equation (18), and succeeds very well in controlling the sizes, suggesting the use of corrected statistics given in Equation (19)

cf={0.936+0.00128×nfor 5n45,1for n>45. (18)
TILRT3_adj=2i=1Icfi×{i=1Inilog(βˆ0βˆi)+i=1Ij=1nilog[A(tij;βi)A(tij;β0)]+i=1I(ni1)2log[j=1nia(tij;β0)j=1nia(tij;βi)]}. (19)

4.2. Performance assessment through simulation study

A rigorous simulation study is attempted to compare the performance of ILRT with the GPQ-based test in terms of type-I error and power function.

4.2.1. Type-I error rate

The study is carried out under the following fairly representative set of parametric combinations: number of groups I=2(2)8; group shape parameters γ1=0.5,2, γi=γi1×h2, i=2,,I, incrementing h2 through 0.8, 1, 1.15; group median parameters βi=1, i=1,,I (kept fixed based on a pilot study showing no impact of ββ on the observed sizes under H0), and ni=5 and ni=15, respectively, representing small and medium group sizes, i=1,,I.

Similar to the two-way setup, four GSP were considered: GSP-1: 100% (all) small size groups ( ni=5, i=1,,I), GSP-2: 75% small size and 25% medium size ( ni=15) groups, GSP-3: 50:50 proportion of small and medium size groups and the last one GSP-4: 25% small size and 75% medium size groups. The four VPS considered were VPS-1 (small equal shape parameters (<1)): γ1=0.5; h2=1, VPS-2 (large shape under large size groups): γ1=0.5; h2=1.15, VPS-3 (small shape under large size groups): γ1=2; h2=0.8 and VPS-4 (large equal shape parameters (>1)): γ1=2; h2=1. As before, the set of all parametric combinations resulting from the above settings is denoted by Q in the sequel.

Based on 5000 simulations, type-I errors of ILRT, LRT, GPQ and Delta method are simulated for each of the parametric combinations in Q. For GPQ, the type-I error is estimated as the proportion of the simulated 5000 P-values falling below the nominal level of α. A single P-value was calculated based on 1000 GPQs. Commonly used 5% level of significance was used. The box plots of simulated type-I errors plotted in Figure 6 revealed uniform performance of tests over Q, differing across the tests, namely, conservative sizes of GPQ, well-concentrated close to nominal sizes of ILRT and liberal (large) sizes of LRT and Delta method.

Figure 6.

Figure 6.

Simulated type-I errors of tests under one-way setup.

Thus, LRT and Delta method being not usable with respect to size criterion is not included in the further power study presented next.

4.2.2. Power comparison

Powers were simulated based on 5000 simulations for the aforementioned tests for the same parametric setup Q. Additionally, the median parameter vector ββ was systematically varied taking β1=1 and βi=βi1×h1, i=2,,I, incrementing h1 through h1=1(.02)1.1;1.2(0.2)3 and additionally by h1=5(5)30,40,50 under two groups so as to acquire reasonable power. Again, h1=1 corresponds to the null hypothesis.

The power functions (as a function of increments h1) for I = 2, 4, 6 and 8 groups are displayed in Figure 7. Here too, the panel headings of each sub-figure are the pairs (VPS, GSP). For two groups, GSP-2 and 4 are not valid as the partition can be only 50:50 or 100 %. Notable observations revealed in the power comparison are reported below.

Figure 7.

Figure 7.

Simulated power functions under one-way setup.

4.3. Observations based on power functions

ILRT performs excellent under two situations: (i) small size groups, irrespective of the shape parameter pattern in the groups and (ii) under larger values of shape parameter for all groups. In all other situations, both tests perform almost similar.

5. Examples

The three real data sets; two under the two-way and one under the one-way setup highlight the contribution of the ILRT tests developed here in industrial applications to arrive at more correct decisions. The model adequacy checks attempted by employing two graphical analyses, namely the quantile residual plots suggested by Dunn and Smyth [2] and interval plots of residuals, as well as the AIC and BIC criteria strongly support in favor of the decision given by ILRT. Matlab codes for data analysis are provided as supplementary material.

5.1. Example 1. Two-way setup: epitaxial layer thickness

The data are taken from Montgomery [5], where the epitaxial layer thickness (measured in μm) is studied to examine the impact of two factors, namely, deposition time (A) and arsenic flow rate (B), each one at two levels. Each of the resulting four combinations is replicated four times. The summary statistics sample median ( βˆij) and shape parameter ( γˆij), i=1,,I, j=1,,J are reported in Table 1. The researchers were interested in reducing the variability in the thickness of the layer by keeping the average thickness level as close to the target level as possible.

Table 1.

Example 1. Summary sample statistics for four treatment combinations.

Cell number (i,j) (1, 1) (1, 2) (2, 1) (2, 2)
nij 4 4 4 4
Median (scale) ( βˆij) 14.4913 13.9213 14.8247 14.7874
Shape parameter ( γˆij) 0.0632 0.0048 0.0030 0.0148

Often, practitioners adopt a normal-based F-test with homogeneous variances by ignoring the non-negative feature and/or skewed nature of the response distribution. This popularly used analysis employed on the present data picked none of the two factors as having a significant impact on the epitaxial layer thickness. The data are also analyzed using the BS-based ILRT developed in Section 2 which seems more appropriate here on account of the non-negative nature of the response. The later was also validated by model adequacy checks described in the next paragraph. As noted earlier, this analysis is equivalent to testing the impact of factors on the cell medians, that is more relevant for a skewed distribution like BS. Results of analyses are reported in Table 2, where the ILRT strongly picked out the deposition time (factor A) as an influential factor, which the conventional F-based analysis was incapable to pick.

Table 2.

Example 1. Data analysis results: two-way setup.

  P-values for the test for effects        
Tests A B AB P-values for normality of residuals a AIC BIC MSE
Conventional F-test 0.0600 0.2830 0.3390 0.0110 33.5024 35.0476 0.3701
BS ILRT a 0.0089 0.6585 1.0000 0.4790 1.3391 6.7472 0.3173

aQuantile residuals are used.

To assess the adequacy of the model supported by ILRT, the quantile residuals (rq) are obtained as, rqijk=Φ1{F(tijk,μˆ,αˆi,γˆij)}; i=1,,I, j=1,,J and k=1,,nij, where F() is the cumulative distribution function (CDF) of BS distribution and Φ() is the standard normal CDF. Since the model has picked out a single significant factor, the estimates μˆ,αˆi are taken as the maximizers of the log-IL given in Equation (9) involving single factor and γˆij are the regular MLE's of the group shape parameters computed holding the parameters μ,αi fixed at μˆ,αˆi. For the model involving no significant factors supported by the F-test, rijk=(tijktˆijk); i=1,,I, j=1,,J and k=1,,nij. tˆijk=14.514; i=1,,I, j=1,,J and k=1,,nij being the overall mean of all observations. The P-value (0.4790) of the normality test for quantile residuals under ILRT based on BS (Figure 8(a)) distribution strongly supports the validity of the BS assumption for the data as well as the model proposed by ILRT. Furthermore, the almost uniform appearance of the interval plot (Figure 9(a)) of the residuals grouped cell-wise over the four cells (renumbered as (1,1)=1,(1,2)=2,(2,1)=3 and (2,2)=4) where each individual interval covers zero, under the BS model goes in the favor of the model given by the ILRT indicating factor A significantly affects the epitaxial layer thickness. Note that the similar plot for F-based residuals indicates heterogeneous variances for the residuals and the middle two cells do not cover zero, indicating that their mean is different from zero. Furthermore, the smaller values of AIC, BIC and MSE reported in Table 2 for BS more strongly support in favor of the BS-based model, against the F-based model with no significant factors.

Figure 8.

Figure 8.

Example 1. Normal probability plots of residuals: two-way setup. (a) Quantile residuals under ILRT based on BS. (b) Residuals under F-test based on normal.

Figure 9.

Figure 9.

Example 1. 95% confidence interval plots of the mean of residuals for each treatment combination: two-way setup. (a) Quantile residuals under ILRT based on BS. (b) Residuals under F-test based on normal.

The main effect plot of deposition time (factor A) plotted in Figure 10 further suggests that epitaxial layer thickness increases as the deposition time increases. In the plot, the range of epitaxial layer thickness under both levels of deposition time falls in the allowable range of 14–15 μm. However, the main aim was to reduce the variability of the layer while being close to the target level. As the smaller level of deposition time controls the mean level within the target range, the smaller level, that is 10 min of deposition time is recommended to insure smaller median parameters and hence smaller variability. The level of arsenic flow rate can be arbitrarily chosen, as it does not produce any significant impact on the median.

Figure 10.

Figure 10.

Example 1. Main effect plot of deposition time factor under two-way model.

5.2. Example 2. One-way setup: failure times of Kevlar 49 fiber

The test procedure under one-way setup is illustrated with a real data set from Leiva [3]. The response variable is the failure time (measured in hours) of Kevlar 49 fiber subject to various stress levels (measured in MPa) and spools. Eight spools at a constant stress level 29.7 are considered for the present analysis. Table 3 presents the summary statistics, sample median ( βˆi) and shape parameter ( γˆi), i=1,,8.

Table 3.

Example 2. Summary sample statistics for eight spools.

Treatment number (i) 1 2 3 4 5 6 7 8
ni 4 8 4 5 4 3 8 3
Median (scale) ( βˆi) 767.8610 21.0540 26.8041 982.8460 45.5263 28.4913 11.8251 309.9670
Shape parameter ( γˆi) 0.3498 1.5816 0.8882 0.7868 1.5224 1.4335 1.1013 0.9032

The data are analyzed as a one-way setup for the median under BS distribution assuming unequal shape parameters. ILRT (Equation (19)) is employed with the multiplicative corrective adjustment given in Equation (18). The P-value for the GPQ test was computed based on 100,000 samples. A conventional F-test is also employed on the data. The results of the data analysis are reported in Table 4.

Table 4.

Example 2. Data analysis results: one-way setup.

Test P-values for testing equality of eight spools P-values for normality of residuals a AIC BIC MSE
Conventional F-test 0 0.0005 556.4478 573.0834 55116
ILRT a 0.0108 0.1210 452.6069 479.2239 55192
GPQ a 0.1605 0.008 486.8273 501.7994 117160

aQuantile residuals are used.

The ILRT and F-test indicate a significant impact of spools on the fiber life while the GPQ is not capable of identifying this impact. Note that this is a situation with large number of small sized groups. In the simulation study for power comparison considered in Section 4, this situation is represented in Figure 7 under I = 8 groups with the first column representing small sized groups. Thus, the present situation resembles first column of Figure 7 under I = 8 groups where ILRT is clearly an out-performer over GPQ indicating more trust in the results produced by ILRT over those of GPQ for this case. The main effect plot (Figure 11) indicates that the average failure time of fiber is maximum at spool-4.

Figure 11.

Figure 11.

Example 2. Main effect plot of spool under one-way model.

As before, for model adequacy checks, the quantile residuals are obtained as, rqij=Φ1{F(tij,βˆi,γˆi)}; i=1,,I, j=1,,ni, where F() CDF of BS distribution and Φ() is the standard normal CDF. For ILRT βˆi are obtained based on the log-IL given in Equation (16) and γˆi is the regular MLE computed holding the medians βi fixed at βˆi while for GPQ, GPQ-based estimates of parameters are used. As before, for conventional F-test rij=(tijtˆij); i=1,,I and j=1,,ni, where tˆij=t¯i., i=1,,I and j=1,,ni being the mean of ith group. The normal probability plots of residuals are plotted in Figure 12, which clearly indicate that the ILRT-based residuals are normal while others may not be so.

Figure 12.

Figure 12.

Example 2. Normal probability plots of residuals: one-way setup. (a) Quantile residuals based on maximum IL estimates of parameters under H1. (b) Quantile residuals based on GPQ-based estimates of parameters under H0. (c) Residuals under F-test based on normal distribution with unequal treatment means.

On account of AIC, BIC and P-value for normality of quantile residuals, the model and hence the parameter estimates offered by ILRT seem to be more reliable. Here too, the uniform appearance of the interval plots (Figure 13(a)) without any trend under the H1 model also goes in favor of the decision given by the ILRT indicating significant differences among the spools. Note also that the GPQ-based residuals indicate non-zero mean for spool numbered 1, 2, 4 and 7.

Figure 13.

Figure 13.

Example 2. Interval plots of residuals: one-way setup. (a) Quantile residuals based on maximum IL estimates of parameters under H1. (b) Quantile residuals based on GPQ-based estimates of parameters under H0. (c) Residuals under F-test based on normal distribution with unequal treatment means.

5.3. Example 3. Two-way setup: delivery time data

The data are taken from Montgomery [5], where the engineer is interested to test the impact of two different types of 32-ounce bottles (glass (level-1) and plastic (level-2)) (factor A) on the time to deliver 12-bottle cases of the product. Two workers (factor B) are used to move 40 cases of product 50 ft on a standard type of hand truck and stacking the cases in a display to perform a task. Each treatment combination is replicated four times, and the delivery time is recorded. Sample medians and shape parameters of the resulting data under BS distribution assumption are tabulated in Table 5.

Table 5.

Example 3. Summary sample statistics for four treatment combinations.

Cell number (i,j) (1, 1) (1, 2) (2, 1) (2, 2)
nij 4 4 4 4
Median (scale) ( βˆij) 4.9968 5.9632 4.4666 4.9075
Shape parameter ( γˆij) 0.0164 0.0805 0.0615 0.0450

The data are analyzed using conventional normal-based F-test with homogeneous variances and also using the BS-based ILRT developed in Section 2. The results are assessed by model adequacy checks. Table 6 shows the results of the analysis, which show that both tests identified both factors as influential factors, and no significant interaction. However, it is notable that the model parameter estimates given by the two tests are different.

Table 6.

Example 3. Data analysis results: two-way setup.

Tests P-values for the test for effects        
  A B AB P-value for normality of residuals a AIC BIC MSE
Conventional F-test 0.0010 0.0020 0.1460 0.0260 22.3679 27.0035 0.1119
BS ILRT 0.0095 0.0194 0.1539 0.1250 17.7571 24.7104 0.1288

aQuantile residuals are used.

To assess the adequacy of the model supported by ILRT, the quantile residuals are obtained as before, rqijk=Φ1{F(tijk,μˆ,αˆi,βˆj,γˆij)}; i=1,,I, j=1,,J and k=1,,nij. Since the interaction term does not seem to be significantly different from zero, the estimates μˆ,αˆi, βˆj are taken as the maximizers of the log-IL given in Equation (4) and estimates of the shape parameter γˆij are the regular MLE's of the group shape parameters computed holding the parameters μ,αi,βj fixed at μˆ,αˆi,βˆj; i=1,,I,j=1,,J. For the model supported by the F-test, the residuals are rijk=(tijktˆijk); where tˆijk=t¯ai..+t¯a.j.t¯a, referring to the notation developed in Section 2.1, i=1,,I, j=1,,J and k=1,,nij. The P-value (0.1250) of the normality test for quantile residuals under ILRT based on BS (Figure 14(a)) distribution strongly supports the validity of the BS assumption for the data as well as the model parameter estimates obtained under ILRT. Furthermore, the almost uniform length and appearance of the interval plot (Figure 15(a)) of the residuals grouped cell-wise over the four cells, where each individual interval covers zero, under the BS model goes in the favor of the model estimates given by the ILRT indicating factors A and B significantly affect the delivery time. Similar plot for F-based residuals indicates heterogeneous variances for the residuals. Furthermore, the smaller values of AIC and BIC reported in Table 6 under BS-based model more strongly support in favor of the BS-based model, against the F-based model.

Figure 14.

Figure 14.

Example 3. Normal probability plots of residuals: two-way setup. (a) Quantile residuals under ILRT based on BS. (b) Residuals under F-test based on normal.

Figure 15.

Figure 15.

Example 3. 95% confidence interval plots of the mean of residuals for each treatment combination: two-way setup. (a) Quantile residuals under ILRT based on BS. (b) Residuals under F-test based on normal.

The main effect plot of two worker (factor B) and bottle type (factor A) plotted in Figure 16 further suggests that delivery time required for worker 2 and plastic bottle is smaller.

Figure 16.

Figure 16.

Example 3. Main effect plots of factors A and B.

6. Concluding remarks

The proposed ILRT provides a uniformly satisfactory option for two-way and one-way setup under BS distributed response variable commonly encountered in industries. The multiway generalization of the test is straightforward, and an attempt is being made to extend the test to other non-normal response variables namely when the data are Gamma, or Weibull distributed, after employing the close to-normal transformations suggested in Kulkarni and Powar [11], Kulkarni and Powar [12], respectively as well as under censoring themes, an area that is very less addressed in the literature in spite of its wide applicability. For one-way model, the ILRT developed here is observed to prominently outperform its GPQ-based peer test under almost all of scenarios. ILRT has an added advantage of having less computational efforts making it more useful for practitioners not using advanced software. The analysis of real data sets clearly highlights the need of ILRT in industrial applications.

Supplementary Material

Supplemental Material

Acknowledgments

The authors are sincerely thankful to three anonymous referees, associate editor and editor for careful reading and insightful comments that greatly improvement of this manuscript.

Funding Statement

The authors are very much thankful to National Board for Higher Mathematics (NBHM), Department of Atomic Energy (DAE), India, under the grant sanction letter no. 2/48(34)/2016/NBHM(RP)/R&D II/4531 dated 31/03/2017, Department of Science and Technology, New Delhi, through FIST scheme (SR/FST/MSI-103 dated 18/11/2015) and UGC, New Delhi under Special Assistance Programme (SAP) (F.520/8/DRS-I/2016(SAP-I)) for financial support for this research work.

Disclosure statement

No potential conflict of interest was reported by the authors.

References

  • 1.Berger J.O., Liseo B., and Wolpert R.L., Integrated likelihood methods for eliminating nuisance parameters, Statist. Sci. 14 (1999), pp. 1–28. [Google Scholar]
  • 2.Dunn P.K. and Smyth G.K., Randomized quantile residuals, J. Comput. Graph. Statist. 5 (1996), pp. 236–244. [Google Scholar]
  • 3.Leiva V., The Birnbaum–Saunders Distribution, Academic Press, Amsterdam, 2015. [Google Scholar]
  • 4.Lemonte A.J., Cordeiro G.M., and Moreno-Arenas G., Improved likelihood-based inference in Birnbaum–Saunders nonlinear regression models, Appl. Math. Model. 40 (2016), pp. 8185–8200. [Google Scholar]
  • 5.Montgomery D.C., Design and Analysis of Experiments, John Wiley & Sons, New York, 2017. [Google Scholar]
  • 6.Niu C., Guo X., Xu W., and Zhu L., Comparison of several Birnbaum–Saunders distributions, J. Stat. Comput. Simul. 84 (2014), pp. 2721–2733. [Google Scholar]
  • 7.Rieck J.R. and Nedelman J.R., A log-linear model for the Birnbaum–Saunders distribution, Technometrics 33 (1991), pp. 51–60. [Google Scholar]
  • 8.SenGupta A. and Kulkarni H.V., Universal and efficient tests for homogeneity of mean directions of circular populations, Statist. Sinica 30 (2020), pp. 1995–2021. [Google Scholar]
  • 9.Kulkarni H.V. and Patil S.M., Uniformly implementable small sample integrated likelihood ratio test for one-way and two-way ANOVA under heteroscedasticity and normality, AStA Adv. Stat. Anal. 105 (2021), pp. 273–305. [Google Scholar]
  • 10.Kulkarni H.V. and SenGupta A., An efficient test for homogeneity of mean directions on the hyper-sphere, Int. Stat. Rev. 90 (2021), pp. 1–43. [Google Scholar]
  • 11.Kulkarni H.V. and Powar S.K., A new method for interval estimation of the mean of the Gamma distribution, Lifetime Data Anal. 16 (2010), pp. 431–447. [DOI] [PubMed] [Google Scholar]
  • 12.Kulkarni H.V. and Powar S.K., A simple normal approximation for Weibull distribution with application to estimation of upper prediction limit, J. Probab. Stat. 2011 (2011), p. 863274. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Material

Articles from Journal of Applied Statistics are provided here courtesy of Taylor & Francis

RESOURCES