Analysis of medians under two-way model with and without interaction for Birnbaum–Saunders distributed response

S M Patil; H V Kulkarni

doi:10.1080/02664763.2022.2078798

. 2022 Jun 8;50(13):2717–2738. doi: 10.1080/02664763.2022.2078798

Analysis of medians under two-way model with and without interaction for Birnbaum–Saunders distributed response

S M Patil ¹, H V Kulkarni ^1,^CONTACT

PMCID: PMC10503464 PMID: 37720248

Abstract

The Birnbaum–Saunders (BS) distribution, well-known as the fatigue-life distribution, has been used in numerous disciplines ranging from engineering to medical sciences. In this article, we develop a test for analysis of medians for BS distributed response to assess the impact of two interacting factors on the median, where no test is presently available. The proposed integrated likelihood ratio test (ILRT) eliminates the nuisance shape parameters by integrating them out. The second-order accurate asymptotic chi-square distribution of ILRT is derived. An in-depth simulation study strongly supports its excellent performance even under small group sizes. Furthermore, ILRT developed under the one-way model is found uniformly superior over its peers, is straightway extendable under general multiway setup, and has potential to be extended to other non-normal response variables. Its genuine need in industry, where non-normal responses are commonly encountered, is highlighted through analysis of three real data sets: ILRT strongly picked out the deposition time as influential factor in epitaxial layer experiment, revealed significant impact of spools on fiber life for the failure times of Kevlar 49 fiber data, and gave more accurate parameter estimates in delivery time data experiment, as assessed by various model adequacy tools, where its competitors failed to deliver desired results.

Keywords: Homogeneity of medians, one-way setup, two-way setup with and without interaction, small group sizes, skewed data

1. Introduction

The Birnbaum–Saunders (BS) distribution is widely used in the context of material fatigue and reliability applications, also known as fatigue-life distribution. It is used in various situations where the accumulation of a particular factor forces a quantifiable characteristic to exceed a critical threshold. For example, migration of metallic flaws in nano-circuits due to heat in a computer chip, accumulation of harmful substances in the lungs from air pollution, the occurrence of natural disasters such as earthquakes and tsunamis, among many other situations. Recently, its use is not limited to analyze ‘fatigue’ or ‘cumulative damage’ type areas from where it was originated but also has found profound applications in business, economics, finance, industry, insurance, nutrition, psychology, quality control, among many other disciplines and is acquiring increasing popularity. For more details see Leiva [3], among many other references.

One-way and two-way analyses are commonly used to analyze data generated from industries and many other disciplines, where the data are most often likely to be non-normally distributed. More often such data are non-negative and have a lifetime related probability distribution, where there is an acute paucity of literature related to two and multiway setup. Often the non-negative and/or skew nature of the data is totally ignored and normal-based methods are blindly used, leading to a possibility of incorrect decisions. Section 5 demonstrates analyses of three real data sets related to engineering, where the results based on F-test were not reliable. Thus, there seems to be an absolute need for developing correct statistical procedures for handling non-normal ANOVA problems. In spite of the fact that BS distribution is widely used in various disciplines, the two-way ANOVA setup under BS is absolutely un-handled in the literature, whereas there are only a few ways available in a one-way setup. Thus, the present work seems to be an attempt to fill this gap in the literature.

Noting that the mean and standard deviation of the BS distribution are directly proportional to the scale parameter, as well as the scale parameter of the distribution equals the median, assessing the impact of external factors on the median or equivalently on the scale is an important problem in industrial applications, leading to a factorial model for the scale parameter which henceforth is refereed to as the median. Presently no test is available in the literature under this scenario, while under the one-way model, two tests by Niu et al. [6] based on generalized pivotal quantities (GPQ) and Delta method are available. Regression models have been handled through the log-transformation on the BS distributed response, see for example Rieck and Nedelman [7] and Lemonte et al. [4] among others.

There is an acute paucity of literature on the analysis of factorial data with non-normal response variables. A few existing tests like generalized linear models make use of maximum likelihood-based procedures which work better under relatively larger sample sizes while in industry there could be many situations where gathering more data would be probably expensive.

The present work develops a simple test for the two-way setup with and without interaction for the median, based on the idea of eliminating nuisance shape parameters through the use of integrated likelihood (IL). An integrated likelihood ratio test (ILRT) is constructed and is shown to be asymptotically chi-square distributed up to the second-order. Simple ad-hoc multiplicative adjustments for small group sizes have been developed leading to improved performance under small samples rendering the ILRT uniformly usable. A simulation study shows that ILRT controlled the type-I error remarkably and uniformly well and attains very good power under varying realistic situations. A similar test under the one-way model is also developed and found to outperform the existing GPQ and Delta method-based tests. For similar work under normally distributed data, we refer to Kulkarni and Patil [9], SenGupta and Kulkarni [8], and Kulkarni and SenGupta [10] in the context of directional data.

The remainder of this article is organized as follows. Section 2 develops the ILRT under a two-way with and without interaction model and its asymptotic distributions. Section 3 is devoted to a thorough performance assessment of the ILRT based on a rigorous simulation study. Section 4 develops its version under a one-way model and reports its performance assessment. A rigorous analysis with several model adequacy checks of three real data sets from industrial applications presented in Section 5 highlights the advantages of the proposed test in industry. The paper is concluded with a section of conclusions.

2. The proposed ILRT

Let $L (ψ ψ, λ λ)$ be a likelihood function under consideration, $ψ ψ$ being the vector of parameters of interest and $λ λ \in Λ Λ$ the vector of nuisance parameters. An IL is of the form

\bar{L} (ψ ψ) = \int_{Λ Λ} L (ψ ψ, λ λ) \cdot Π (λ λ | ψ ψ) d λ λ,

where Π is a non-negative weight function on $Λ Λ$ making the above integral convergent for every fixed $ψ ψ .$ $\bar{L}$ depends on the data and $ψ ψ$ , and can be used as a standard likelihood function for all likelihood-based inference procedures.

In the following, we develop analysis of medians, a version of ANOVA for the mean parameter under two-way with interaction setup. The medians are assumed to be influenced by two factors while the shape parameters are assumed to be unaffected.

2.1. Two-way setup with interaction

The probability density function (pdf) of a BS distribution (see Leiva [3]) under two-way setup with interaction form for the median $ψ_{i j}$ is

\begin{aligned} f (t_{i j k}, ψ_{i j}, γ_{i j}) & = \frac{1}{2 \sqrt{2 π} ψ_{i j} γ_{i j}} A (t_{i j k}; ψ_{i j}) e x p [- \frac{1}{2 γ_{i j}^{2}} a (t_{i j k}; ψ_{i j})], \\ ψ_{i j} = (μ + α_{i} + β_{j} + (α β)_{i j}) > 0, \\ t_{i j k} > 0, γ_{i j} > 0, i = 1, \dots, I; j = 1, \dots, J; k = 1, \dots, n_{i j}, \end{aligned}

where $A (t_{i j k}; ψ_{i j}) = (ψ_{i j} / t_{i j k})^{1 / 2} + (ψ_{i j} / t_{i j k})^{3 / 2}$ , $a (t_{i j k}; ψ_{i j}) = (t_{i j k} / ψ_{i j}) + (ψ_{i j} / t_{i j k}) - 2$ , μ is the unknown overall effect, $α_{i}$ is the unknown effect of the ith level of first factor, $β_{j}$ is the unknown effect of the jth level of second factor, $(α β)_{i j}$ is the unknown interaction effect of the ith level of first factor and jth level of second factor and $γ_{i j}$ are unknown shape parameters.

Let $T_{i j 1}, \dots, T_{i j n_{i j}}$ be the observations from the $(i, j)$ th cell following a $B S (ψ_{i j}, γ_{i j})$ distribution, $i = 1, \dots, I$ , $j = 1, \dots, J$ . Let $N = \sum_{i = 1}^{I} \sum_{j = 1}^{J} n_{i j}$ ; $n_{i .} = \sum_{j = 1}^{J} n_{i j}$ , $n_{. j} = \sum_{i = 1}^{I} n_{i j}$ , ${\bar{T}}_{a} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} \sum_{k = 1}^{n_{i j}} T_{i j k} / N$ and ${\bar{T}}_{h} = (\sum_{i = 1}^{I} \sum_{j = 1}^{J} \sum_{k = 1}^{n_{i j}} T_{i j k}^{- 1} / N)^{- 1}$ be the overall sample size, arithmetic and harmonic mean, respectively, ${\bar{T}}_{a i . .} = \sum_{j = 1}^{J} \sum_{k = 1}^{n_{i j}} T_{i j k} / n_{i .}$ , ${\bar{T}}_{h i . .} = (\sum_{j = 1}^{J} \sum_{k = 1}^{n_{i j}} T_{i j k}^{- 1} / n_{i .})^{- 1}$ be the arithmetic and harmonic means of ith row, with similar counter parts for column, ${\bar{T}}_{a i j .} = \sum_{k = 1}^{n_{i j}} T_{i j k} / n_{i j}$ , ${\bar{T}}_{h i j .} = (\sum_{k = 1}^{n_{i j}} T_{i j k}^{- 1} / n_{i j})^{- 1}$ be the arithmetic and harmonic means of $(i, j)$ th cell and ${\bar{t}}_{a}$ , ${\bar{t}}_{h}$ , ${\bar{t}}_{a i . .}$ , ${\bar{t}}_{h i . .}$ , ${\bar{t}}_{a . j .}$ , ${\bar{t}}_{h . j .}$ , ${\bar{t}}_{a i j .}$ , ${\bar{t}}_{h i j .}$ denote their observed values, respectively. The hypothesis corresponding to the test for no interaction is

\begin{aligned} H_{0} : (α β)_{11} = (α β)_{12} = \dots = (α β)_{I J} v / s H_{1} : (α β)_{i j} \neq (α β)_{i^{'} j^{'}} \\ for some i \neq i^{'}, j \neq j^{'} . \end{aligned}

(1)

Let $t t = (t_{111}, \dots, t_{11 n_{11}}, \dots, t_{I 11}, \dots, t_{I J n_{I J}})$ be the vector of all observations. Here $(μ, α_{1}, \dots, α_{I}, β_{1}, \dots, β_{J}, (α β)_{11}, \dots, (α β)_{I J})$ is the vector of unknown parameters of interest with $ψ_{i j} = μ + α_{i} + β_{j} + (α β)_{i j}$ ; $ψ ψ = (ψ_{11}, \dots, ψ_{I J})$ and $γ γ = (γ_{11}, \dots, γ_{I J})$ is the vector of unknown nuisance parameters. Berger et al. [1] recommend the uniform prior taking a constant value 1 over the entire nuisance parameter space as an apparent option when no prior information is available about the nuisance parameters. In the present case, an added advantage is that it gives a closed-form expression for the resulting IL.

The likelihood function of $(ψ ψ, γ γ | t t)$ is

\begin{aligned} L (ψ ψ, γ γ | t t) & = \prod_{i = 1}^{I} \prod_{j = 1}^{J} {{(2 \sqrt{2 π})}^{- n_{i j}} (ψ_{i j} γ_{i j})^{- n_{i j}} \prod_{k = 1}^{n_{i j}} A (t_{i j k}; ψ_{i j}) \\ e x p [- \frac{1}{2 γ_{i j}^{2}} \sum_{k = 1}^{n_{i j}} a (t_{i j k}; ψ_{i j})]}, ψ_{i j}, γ_{i j} > 0. \end{aligned}

(2)

Integration of the likelihood function with respect to the uniform prior over $[0, \infty)$ for all $γ_{i j}$ , $i = 1, \dots, I, j = 1, \dots, J$ yields

\begin{aligned} \bar{L} (ψ ψ | t t) & = \prod_{i = 1}^{I} \prod_{j = 1}^{J} {\bar{L}}_{i j} (ψ_{i j}) \\ \propto \prod_{i = 1}^{I} \prod_{j = 1}^{J} {{(\frac{1}{ψ_{i j}})}^{n_{i j}} \prod_{k = 1}^{n_{i j}} A (t_{i j k}; ψ_{i j}) {[\sum_{k = 1}^{n_{i j}} a (t_{i j k}; ψ_{i j})]}^{- (n_{i j} - 1) / 2}} . \end{aligned}

(3)

The log-IL functions under $H_{0}$ and the whole parameter space are denoted by ${\bar{l}}_{0}$ and ${\bar{l}}_{1}$ , respectively, and are given by

{\bar{l}}_{0} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {- n_{i j} l o g (ψ_{i j 0}) + \sum_{k = 1}^{n_{i j}} l o g [A (t_{i j k}; ψ_{i j 0})] - \frac{(n_{i j} - 1)}{2} l o g [\sum_{k = 1}^{n_{i j}} a (t_{i j k}; ψ_{i j 0})]}

(4)

and

{\bar{l}}_{1} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {- n_{i j} l o g (ψ_{i j 1}) + \sum_{k = 1}^{n_{i j}} l o g [A (t_{i j k}; ψ_{i j 1})] - \frac{(n_{i j} - 1)}{2} l o g [\sum_{k = 1}^{n_{i j}} a (t_{i j k}; ψ_{i j 1})]},

(5)

where $ψ_{i j 0} = (μ + α_{i} + β_{j})$ and $ψ_{i j 1} = (μ + α_{i} + β_{j} + (α β)_{i j})$ .

Let ${\bar{l}}_{0}$ be maximized at $δ δ = ({\hat{μ}}_{0}, {\hat{α α}}_{0}, {\hat{β β}}_{0})$ . Its value can be obtained by maximizing right-hand side (rhs) of Equation (4) numerically. Any numerical algorithm for maximization (equivalently, minimization of the function after reversing its sign) needs an initial starting value to start with, for which the choice $(\sqrt{{\bar{t}}_{a} \times {\bar{t}}_{h}}, \sqrt{{\bar{t}}_{a 1..} \times {\bar{t}}_{h 1..}}, \dots, \sqrt{{\bar{t}}_{a I . .} \times {\bar{t}}_{h I . .}}, \sqrt{{\bar{t}}_{a .1 .} \times {\bar{t}}_{h .1 .}}, \dots, \sqrt{{\bar{t}}_{a . J .} \times {\bar{t}}_{h . J .}})$ is found to work very well. Similarly ${\bar{l}}_{1}$ is maximized at $({\hat{μ}}_{1}, {\hat{α α}}_{1}, {\hat{β β}}_{1}, (\hat{α α β β})_{1})$ and its value can be computed by maximizing rhs of (5) where initial value suggested is either $(δ δ, \sqrt{{\bar{t}}_{a 11.} \times {\bar{t}}_{h 11.}}, \dots, \sqrt{{\bar{t}}_{a I J .} \times {\bar{t}}_{h I J .}})$ or $(\sqrt{{\bar{t}}_{a} \times {\bar{t}}_{h}}, \sqrt{{\bar{t}}_{a 1..} \times {\bar{t}}_{h 1..}}, \dots, \sqrt{{\bar{t}}_{a I . .} \times {\bar{t}}_{h I . .}}, \sqrt{{\bar{t}}_{a .1 .} \times {\bar{t}}_{h .1 .}}, \dots, \sqrt{{\bar{t}}_{a . J .} \times {\bar{t}}_{h . J .}}, \sqrt{{\bar{t}}_{a 11.} \times {\bar{t}}_{h 11.}}, \dots, \sqrt{{\bar{t}}_{a I J .} \times {\bar{t}}_{h I J .}}) .$

Though analytical proof of the existence of a unique maximum likelihood estimate (MLE) is not tractable at stage, thousands of sample data sets were randomly generated and observed to give unique MLE's both under ${\bar{l}}_{0}$ as well as ${\bar{l}}_{1}$ , and practically no difficulty was found in employing the test developed here.

The IL ratio is

\begin{aligned} \bar{λ} & = \frac{sup_{ψ ψ \in Θ_{0} Θ_{0}} \bar{L} (ψ ψ | t t)}{sup_{ψ ψ \in Θ Θ} \bar{L} (ψ ψ | t t)} \\ = \prod_{i = 1}^{I} \prod_{j = 1}^{J} {\frac{[{(\frac{1}{{\hat{ψ}}_{i j 0}})}^{n_{i j}} \prod_{k = 1}^{n_{i j}} A (t_{i j k}; {\hat{ψ}}_{i j 0}) {[\sum_{k = 1}^{n_{i j}} a (t_{i j k}; {\hat{ψ}}_{i j 0})]}^{- (n_{i j} - 1) / 2}]}{[{(\frac{1}{{\hat{ψ}}_{i j 1}})}^{n_{i j}} \prod_{k = 1}^{n_{i j}} A (t_{i j k}; {\hat{ψ}}_{i j 1}) {[\sum_{k = 1}^{n_{i j}} a (t_{i j k}; {\hat{ψ}}_{i j 1})]}^{- (n_{i j} - 1) / 2}]}}, \end{aligned}

(6)

where $Θ Θ = {ψ ψ : ψ_{i j} \in (0, \infty), i = 1, \dots, I, j = 1, \dots, J},$ $Θ_{0} Θ_{0}$ is the subset of $Θ Θ$ and ${\hat{ψ}}_{i j 0} = {\hat{μ}}_{0} + {\hat{α}}_{i 0} + {\hat{β}}_{j 0}$ , ${\hat{ψ}}_{i j 1} = {\hat{μ}}_{1} + {\hat{α}}_{i 1} + {\hat{β}}_{j 1} + (\hat{α β})_{i j 1}$ ; $i = 1, \dots, I, j = 1, \dots, J$ . The proposed ILRT statistic $- 2 \log \bar{λ}$ equals

\begin{aligned} T_{I L R T 1} & = 2 \sum_{i = 1}^{I} \sum_{j = 1}^{J} {n_{i j} l o g (\frac{{\hat{ψ}}_{i j 0}}{{\hat{ψ}}_{i j 1}}) + \sum_{k = 1}^{n_{i j}} l o g [\frac{A (t_{i j k}; {\hat{ψ}}_{i j 1})}{A (t_{i j k}; {\hat{ψ}}_{i j 0})}] \\ + \frac{(n_{i j} - 1)}{2} l o g [\frac{\sum_{k = 1}^{n_{i j}} a (t_{i j k}; {\hat{ψ}}_{i j 0})}{\sum_{k = 1}^{n_{i j}} a (t_{i j k}; {\hat{ψ}}_{i j 1})}]} . \end{aligned}

(7)

The asymptotic $χ^{2}$ distribution of $T_{I L R T 1}$ is stated in Theorem 2.1. The theme of the proof goes in line with that of the Theorem 1 of SenGupta and Kulkarni [8] and given as supplementary material.

Theorem 2.1

The second-order accurate asymptotic distribution of $T_{I L R T 1}$ is $χ_{(I - 1) (J - 1)}^{2}$ .

2.2. Two-way setup without interaction

The pdf of a BS distribution under two-way setup without interaction model is as in Section 2.1 except that the vector of unknown parameters of interest is $(μ, α_{1}, \dots, α_{I}, β_{1}, \dots, β_{J})$ with $ψ_{i j} = μ + α_{i} + β_{j},$ $i = 1, \dots, I; j = 1, \dots, J$ ; $ψ ψ = (ψ_{11}, \dots, ψ_{I J})$ . The hypothesis of interest is to test

H_{0} : β_{1} = β_{2} = \dots = β_{J} v / s H_{1} : β_{j} \neq β_{j^{'}} for some j \neq j^{'} .

(8)

The expressions for the likelihood function L, the IL function $\bar{L}$ and the log-IL function ${\bar{l}}_{1}$ under $H_{1}$ are given exactly by Equations (2), (3) and (4), respectively, while under $H_{0}$ , we get

{\bar{l}}_{0} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} {- n_{i j} l o g (ψ_{i 0}) + \sum_{k = 1}^{n_{i j}} l o g [A (t_{i j k}; ψ_{i 0})] - \frac{(n_{i j} - 1)}{2} l o g [\sum_{k = 1}^{n_{i j}} a (t_{i j k}; ψ_{i 0})]},

(9)

where $ψ_{i 0} = (μ + α_{i})$ .

Here, ${\bar{l}}_{0}$ is maximized at $δ δ = ({\hat{μ}}_{0}, {\hat{α α}}_{0})$ . Its value can be obtained by maximizing rhs of Equation (9) numerically using an initial starting value $(\sqrt{{\bar{t}}_{a} \times {\bar{t}}_{h}}, \sqrt{{\bar{t}}_{a 1..} \times {\bar{t}}_{h 1..}}, \dots, \sqrt{{\bar{t}}_{a I . .} \times {\bar{t}}_{h I . .}})$ . Similarly ${\bar{l}}_{1}$ is maximized at $({\hat{μ}}_{1}, {\hat{α α}}_{1}, {\hat{β β}}_{1})$ and its value can be computed by maximizing rhs of (4) where initial value suggested is either $(δ δ, \sqrt{{\bar{t}}_{a .1 .} \times {\bar{t}}_{h .1 .}}, \dots, \sqrt{{\bar{t}}_{a . J .} \times {\bar{t}}_{h . J .}})$ or $(\sqrt{{\bar{t}}_{a} \times {\bar{t}}_{h}}, \sqrt{{\bar{t}}_{a 1..} \times {\bar{t}}_{h 1..}}, \dots, \sqrt{{\bar{t}}_{a I . .} \times {\bar{t}}_{h I . .}}, \sqrt{{\bar{t}}_{a .1 .} \times {\bar{t}}_{h .1 .}}, \dots, \sqrt{{\bar{t}}_{a . J .} \times {\bar{t}}_{h . J .}}) .$

The expressions of resulting IL ratio and the ILRT statistics say, $T_{I L R T 2}$ are same as those given in Equations (6) and (7), respectively, with estimates of median parameter replaced by

{\hat{ψ}}_{i j 0} = {\hat{μ}}_{0} + {\hat{α}}_{i 0}

(10)

and

{\hat{ψ}}_{i j 1} = {\hat{μ}}_{1} + {\hat{α}}_{i 1} + {\hat{β}}_{j 1}

(11)

are the MLE's of $ψ_{i j}$ under $H_{0}$ and whole parameter space, respectively. The asymptotic $χ^{2}$ distribution of $T_{I L R T 2}$ is stated in Theorem 2.2.

Theorem 2.2

The second-order accurate asymptotic distribution of $T_{I L R T 2}$ is $χ_{(J - 1)}^{2}$ .

The proof is exactly in line with the one of Theorem 2.1 and is omitted.

2.3. Assessment of convergence of type-I error rate through simulations

The type-I errors of ILRT are simulated for various common group sizes $n = 5 (5) 50$ (here and in the sequel the notation $a (b) c$ denotes the set of values of n starting with a, ending with c, with successive increments by the number b), overall effect $μ = 0.5, 1, 2, 3.5, 5$ ; ith row level $α_{1} = 0.5, 1, 2, 3.5, 5$ , $α_{i} = α_{i - 1} \times h_{2}$ , $i = 2, \dots, I$ ; incrementing $h_{2}$ through 0.5, 1, 1.5. By default, under H₀, under the without interaction setup, $β_{j} = 0.5, 1, 2, 3.5, 5$ , $j = 1, \dots, J$ while under the with-interaction setup, jth column level $β_{1} = 0.5, 1, 2, 3.5, 5$ , $β_{j} = β_{j - 1} \times h_{2}$ , $j = 2, \dots, J$ and (ij)th interaction effect $(α β)_{i j} = 0.5, 1, 2, 3.5, 5$ , $i = 1, \dots, I$ , $j = 1, \dots, J$ (and other parametric combinations as specified in the set $P$ given in Section 3.1 except GSP). The simulated type-I errors plotted in Figures 1 and 2 for with and without interaction setup, respectively, converge very well to the target level 0.05, indicating good conformation to the asymptotic $χ^{2}$ distribution specified by Theorems 2.1 and 2.2, for group sizes exceeding 5 and 30, respectively. Under smaller group sizes, the type-I errors are slightly smaller for the with-interaction setup and larger for the without-interaction setup than the desired level. Ad-hoc corrective adjustment factors suggested in the next subsection (Equations (12) and (13)) stabilized the type-I errors quite well to the desired level under small group sizes.

Figure 1. — Box plots portraying the rate of convergence of type-I errors of ILRT with common group size (n) under two-way with interaction setup.

Figure 2. — Box plots portraying the rate of convergence of type-I errors of ILRT with common group size (n) under two-way without interaction setup.

2.4. Corrective adjustments under small samples (two-way setup)

Let $T_{α}$ and $q_{α}$ be the αth quantiles of the test statistic T and its desired $χ^{2}$ distribution, respectively. Nothing that $P (T > T_{1 - α}) = α$ , and hence that $P [(q_{1 - α} / T_{1 - α}) T > q_{1 - α}] = α$ , it follows that the entity $q_{1 - α} / T_{1 - α}$ works as a corrective factor (cf) for adjusting the observed sizes to the desired level α under small group sizes. These adjustments are most often essential under small group sizes since the asymptotic distributions of likelihood-based tests work well under little larger group sizes.

The cf is calculated under different group sizes using the ratio of theoretical 95th quantiles of the desired $χ_{(I - 1) (J - 1)}^{2}$ and $χ_{(J - 1)}^{2}$ (note that these are $q_{1 - α}$ at 5 $%$ level) distribution to the simulated 95th quantiles of $T_{I L R T 1}$ of (7) and $T_{I L R T 2}$ of (7, 10 and 11) under $H_{0}$ , respectively, based on 2000 simulations for each of the parametric combinations specified in Section 2.3. The resulting cf was found to depend mainly on the common group size and is given in Equations (12) and (13) for two-way with and without interaction setup, respectively

\begin{aligned} c f = {\begin{cases} 1.0146 & f o r n < 10, \\ 1 & f o r n \geq 10, \end{cases} \end{aligned}

(12)

\begin{aligned} c f = {\begin{cases} 0.8440 & f o r n < 10, \\ 0.9596 & f o r 10 \leq n \leq 20, \\ 0.9800 & f o r 25 \leq n \leq 30, \\ 1 & f o r n > 30. \end{cases} \end{aligned}

(13)

The ILRT is suggested to be used with this cf to be employed individually for each group based on the distinct group size $n_{i j}$ in place of n in the expression for cf (resulting in $c f_{i j}$ say) giving rise to the adjusted statistic for two-way with and without interaction setup

\begin{aligned} T_{I L R T_a d j} & = 2 \sum_{i = 1}^{I} \sum_{j = 1}^{J} c f_{i j} \times {n_{i j} l o g (\frac{{\hat{ψ}}_{i j 0}}{{\hat{ψ}}_{i j 1}}) + \sum_{k = 1}^{n_{i j}} l o g [\frac{A (t_{i j k}; {\hat{ψ}}_{i j 1})}{A (t_{i j k}; {\hat{ψ}}_{i j 0})}] \\ + \frac{(n_{i j} - 1)}{2} l o g [\frac{\sum_{k = 1}^{n_{i j}} a (t_{i j k}; {\hat{ψ}}_{i j 0})}{\sum_{k = 1}^{n_{i j}} a (t_{i j k}; {\hat{ψ}}_{i j 1})}]} . \end{aligned}

The box plots of the simulated sizes of adjusted ILRT plotted in Figure 3(a,b) for two-way with and without interaction setup, respectively, are indicative of the improvement in the size performance of ILRT after adjustments. Throughout in the sequel, we use this adjusted version of ILRT and the word ‘ILRT’ is used to represent this adjusted ILRT throughout.

3. Performance assessment through simulation study

This section presents an exhaustive simulation study to test the performance of the proposed ILRT in terms of type-I error and power function under a fairly representative set of parametric combinations.

3.1. Type-I error rate

The parametric setup considered under two-way model for empirical assessment constitutes $I = 2 (1) 6$ row levels, $J = 2 (1) 6$ column levels, overall effect $μ = 0.5, 5$ , ith row effect $α_{1} = 0.5, 5$ , $α_{i} = α_{i - 1} \times h_{2}$ , $i = 2, \dots, I$ ; incrementing $h_{2}$ through 0.5, 1.5. By default, under $H_{0}$ , the column levels are same under the without interaction setup, and the common values are taken to be 0.5 and 5. Under the with-interaction setup, the column levels are set at $β_{1} = 0.5, 5$ , $β_{j} = β_{j - 1} \times h_{2}$ , $j = 2, \dots, J$ . For with-interaction setup, under $H_{0}$ , the interaction term is set at common values 0.5 and 5.

Based on the pattern of group sizes among the IJ cells, three group size patterns (GSP) were considered: GSP-1: $100 %$ (all) small size groups ( $n_{i j} = 5$ , $i = 1, \dots, I$ , $j = 1, \dots, J$ ), GSP-2: $50 %$ small size ( $n_{i j} = 5$ ) and $50 %$ medium size ( $n_{i j} = 15$ ) groups, GSP-3: $100 %$ (all) medium size groups ( $n_{i j} = 15$ ). Similarly based on the variability in shape parameters among the IJ groups, three variability patterns in shape (VPS) were considered: VPS-1: $100 %$ (all) groups with equal shape parameters ( $γ_{i j} = 1$ , $i = 1, \dots, I$ , $j = 1, \dots, J$ ), VPS-2: first $50 %$ groups with equal shape parameters ( $γ_{i j} = 0.5 (< 1)$ ) and last $50 %$ groups with equal shape parameters ( $γ_{i j} = 1.5 (> 1)$ ), VPS-3: first $50 %$ groups with equal shape parameters ( $γ_{i j} = 1.5$ ) and last $50 %$ groups with equal shape parameters ( $γ_{i j} = 0.5$ ). The set of all parametric combinations resulting from the above settings is denoted by $P$ in the sequel.

Type-I errors of ILRT are simulated for the parametric combinations in $P$ based on 2000 simulations. The commonly used $5 %$ level of significance was used. Figure 3( a,b) giving the box plots of the simulated sizes under two-way with and without interaction setup indicates well-concentrated type-I errors of ILRT around the nominal level.

3.2. Power performance

Based on 2000 simulations, the power functions were simulated for the same parametric setup $P$ as above. In addition, under two-way with-interaction setup, under the alternative hypothesis, the interaction factor $(α α β β)$ was systematically varied taking $(α β)_{11} = 0.5$ , $(α β)_{i j} = (α β)_{i (j - 1)} \times h_{1}$ for j>1 and $(α β)_{i j} = (α β)_{(i - 1) (j + 2)} \times h_{1}$ for j = 1, $i = 1, \dots, I$ , $j = 1, \dots, J$ ; incrementing $h_{1}$ through $h_{1} = 1 (0.2) 3; 3.5 (0.5) 5; 7$ . Note that $h_{1} = 1$ corresponds to the null hypothesis. The overall effect is set to be $μ = 0.5$ , ith row effect $α_{1} = 0.5$ ; $α_{i} = α_{i - 1} \times 0.5$ , $i = 2, \dots, I$ and jth column effect $β_{1} = 0.5$ ; $β_{j} = β_{j - 1} \times 0.5$ , $j = 2, \dots, J$ .

Figure 4 displays the power functions (as a function of $h_{1}$ ) under I = 2; J = 2 and I = 6; J = 2 as representative of all patterns represented in the simulation study. For I = 6; J = 2 increments were slightly altered to be $h_{1} = 1 (0.2) 2.6$ for a better visual display. The panel headings of each sub-figure are the pairs (VPS, GSP).

Similarly, under two-way without interaction setup, the column factor $β β$ was systematically varied taking $β_{1} = 0.5$ ; $β_{j} = β_{j - 1} \times h_{1}$ , $j = 2, \dots, J$ ; incrementing $h_{1}$ through $h_{1} = 1 (.02) 1.1; 1.2 (0.2) 3; 3.5 (0.5) 5; 7 (2) 15$ under the alternative. The overall effect and ith row effect were set as above. Figure 5 displays the power functions (as a function of $h_{1}$ ) under I = 2; J = 2 and I = 6; J = 6 as representative of all patterns represented in the simulation study. Here too, for I = 6; J = 6 increments were slightly altered to be $h_{1} = 1 (.02) 1.1; 1.2 (0.2) 2$ for a better visual display. As before, the panel headings of each sub-figure are the pairs (VPS, GSP).

Figure 5. — Power functions of ILRT under two-way without interaction setup. (a) Two row (I = 2) and two column (J = 2) levels. (b) Six row (I = 6) and six column (J = 6) levels.

Clearly, ILRT gives consistently satisfactory power under all parametric combinations considered and hence it can be uniformly used under all real-life situations.

ILRT-based homogeneity test under one-way setup can be similarly developed and is derived in the next section. Note that the test is equivalent to ANOVA for means under BS distribution when the shape parameters of all groups are the same.

4. ILRT under one-way setup

Consider the one-way setup with I groups, there being $n_{i}$ observations in ith group, $i = 1, \dots, I$ . The pdf of the jth observation in the ith group $t_{i j}$ is

\begin{aligned} f (t_{i j}, β_{i}, γ_{i}) & = \frac{1}{2 \sqrt{2 π} β_{i} γ_{i}} [A (t_{i j}; β_{i})] e x p [- \frac{1}{2 γ_{i}^{2}} a (t_{i j}; β_{i})], \\ t_{i j} > 0, β_{i}, γ_{i} > 0, i = 1, \dots, I; j = 1, \dots, n_{i}, \end{aligned}

where $A (t_{i j}; β_{i}) = (β_{i} / t_{i j})^{1 / 2} + (β_{i} / t_{i j})^{3 / 2}$ , $a (t_{i j}; β_{i}) = (t_{i j} / β_{i}) + (β_{i} / t_{i j}) - 2$ , $β_{i}$ is the median and $γ_{i}$ is the shape parameter.

Let $T_{i 1}, \dots, T_{i n_{i}}$ be the random observations forming the ith group, following $B S (β_{i}, γ_{i})$ distribution, $i = 1, \dots, I$ . Let $N = \sum_{i = 1}^{I} n_{i}$ , ${\bar{T}}_{a} = \sum_{i = 1}^{I} \sum_{j = 1}^{n_{i}} T_{i j} / N$ and ${\bar{T}}_{h} = (\sum_{i = 1}^{I} \sum_{j = 1}^{n_{i}} T_{i j}^{- 1} / N)^{- 1}$ be the overall sample size, arithmetic and harmonic mean, respectively, ${\bar{T}}_{a i .} = \sum_{j = 1}^{n_{i}} T_{i j} / n_{i}$ and ${\bar{T}}_{h i .} = (\sum_{j = 1}^{n_{i}} T_{i j}^{- 1} / n_{i})^{- 1}$ be the arithmetic and harmonic means of $i t h$ group and ${\bar{t}}_{a}$ , ${\bar{t}}_{h}$ , ${\bar{t}}_{a i}$ , ${\bar{t}}_{h i}$ denote their observed values, respectively. The hypothesis of interest is to test

H_{0} : β_{1} = β_{2} = \dots = β_{I} v / s H_{1} : β_{i} \neq β_{i^{'}} for some i \neq i^{'} .

(14)

The likelihood function of aggregate sample is

\begin{aligned} L (β β, γ γ | t t) & = \prod_{i = 1}^{I} {{(2 \sqrt{2 π})}^{- n_{i}} {(β_{i} γ_{i})}^{- n_{i}} \prod_{j = 1}^{n_{i}} A (t_{i j}; β_{i}) e x p [- \frac{1}{2 γ_{i}^{2}} \sum_{j = 1}^{n_{i}} a (t_{i j}; β_{i})]} \\ β_{i}, γ_{i} > 0, \forall i = 1, \dots, I . \end{aligned}

Here, $β β = (β_{1}, \dots, β_{I})$ is the vector of unknown parameters of interest and $γ γ = (γ_{1}, \dots, γ_{I})$ is the vector of unknown nuisance parameters. Similar to two-way setup, integration of the likelihood function with respect to the uniform prior yields

\bar{L} (β β | t t) = \prod_{i = 1}^{I} {\bar{L}}_{i} (β_{i}) \propto \prod_{i = 1}^{I} {{(\frac{1}{β_{i}})}^{n_{i}} \prod_{j = 1}^{n_{i}} A (t_{i j}; β_{i}) {[\sum_{j = 1}^{n_{i}} a (t_{i j}; β_{i})]}^{- (n_{i} - 1) / 2}},

and with the abuse of similar notation,

{\bar{l}}_{0} = \sum_{i = 1}^{I} {- n_{i} l o g (β_{0}) + \sum_{j = 1}^{n_{i}} l o g [A (t_{i j}; β_{0})] - \frac{(n_{i} - 1)}{2} l o g [\sum_{j = 1}^{n_{i}} a (t_{i j}; β_{0})]}

(15)

and

{\bar{l}}_{1} = \sum_{i = 1}^{I} {- n_{i} l o g (β_{i}) + \sum_{j = 1}^{n_{i}} l o g [A (t_{i j}; β_{i})] - \frac{(n_{i} - 1)}{2} l o g [\sum_{j = 1}^{n_{i}} a (t_{i j}; β_{i})]} .

(16)

In the similar context, ${\hat{β}}_{0}$ and $\hat{β} \hat{β}$ are maximizers of rhs of (15) and (16), respectively. For numerical maximization, the vector $(\sqrt{{\bar{t}}_{a} \times {\bar{t}}_{h}}, (\sqrt{{\bar{t}}_{a_{1} .} \times {\bar{t}}_{h_{1} .}}, \dots, \sqrt{{\bar{t}}_{a_{I} .} \times {\bar{t}}_{h_{I} .}}))$ worked well as initial starting value. Exactly parallel computations yield the following ILRT statistics for the one-way setup:

\begin{aligned} T_{I L R T 3} & = 2 {\sum_{i = 1}^{I} n_{i} l o g (\frac{{\hat{β}}_{0}}{{\hat{β}}_{i}}) + \sum_{i = 1}^{I} \sum_{j = 1}^{n_{i}} l o g [\frac{A (t_{i j}; β_{i})}{A (t_{i j}; β_{0})}] \\ + \sum_{i = 1}^{I} \frac{(n_{i} - 1)}{2} l o g [\frac{\sum_{j = 1}^{n_{i}} a (t_{i j}; β_{0})}{\sum_{j = 1}^{n_{i}} a (t_{i j}; β_{i})}]} . \end{aligned}

(17)

The asymptotic $χ^{2}$ distribution of $T_{I L R T 3}$ is stated in Theorem 4.1. Its proof is similar to that of Theorem 2.1 and is omitted.

Theorem 4.1

The second-order accurate asymptotic distribution of $T_{I L R T 3}$ is $χ_{(I - 1)}^{2}$ .

4.1. Corrective adjustments under small samples (one-way setup)

Based on 100,000 simulations, the simulated type-I errors of ILRT at 5% level of significance, for various group sizes $n = 5 (5) 45; 100, 200$ ; group shape parameters $γ_{1} = 0.5 (0.5) 5$ , $γ_{i} = γ_{i - 1} \times h_{2}$ , $i = 2, \dots, I$ , incrementing $h_{2}$ through 0.5, 1, 1.5; (and other parametric combinations as specified in the set $Q$ given in Section 4.2.1 except GSP) converged well to the target level 0.05, indicating good conformation to the asymptotic $χ^{2}$ distribution specified by Theorem 4.1 for group sizes exceeding 45. As before, a minor ad-hoc corrective adjustment factor is developed using a parallel approach, stated in Equation (18), and succeeds very well in controlling the sizes, suggesting the use of corrected statistics given in Equation (19)

\begin{aligned} c f & = {\begin{cases} 0.936 + 0.00128 \times n & f o r 5 \leq n \leq 45, \\ 1 & f o r n > 45. \end{cases} \end{aligned}

(18)

\begin{aligned} T_{I L R T 3_a d j} & = 2 \sum_{i = 1}^{I} c f_{i} \times {\sum_{i = 1}^{I} n_{i} l o g (\frac{{\hat{β}}_{0}}{{\hat{β}}_{i}}) + \sum_{i = 1}^{I} \sum_{j = 1}^{n_{i}} l o g [\frac{A (t_{i j}; β_{i})}{A (t_{i j}; β_{0})}] \\ + \sum_{i = 1}^{I} \frac{(n_{i} - 1)}{2} l o g [\frac{\sum_{j = 1}^{n_{i}} a (t_{i j}; β_{0})}{\sum_{j = 1}^{n_{i}} a (t_{i j}; β_{i})}]} . \end{aligned}

(19)

4.2. Performance assessment through simulation study

A rigorous simulation study is attempted to compare the performance of ILRT with the GPQ-based test in terms of type-I error and power function.

4.2.1. Type-I error rate

The study is carried out under the following fairly representative set of parametric combinations: number of groups $I = 2 (2) 8$ ; group shape parameters $γ_{1} = 0.5, 2$ , $γ_{i} = γ_{i - 1} \times h_{2}$ , $i = 2, \dots, I$ , incrementing $h_{2}$ through 0.8, 1, 1.15; group median parameters $β_{i} = 1$ , $i = 1, \dots, I$ (kept fixed based on a pilot study showing no impact of $β β$ on the observed sizes under $H_{0}$ ), and $n_{i} = 5$ and $n_{i} = 15$ , respectively, representing small and medium group sizes, $i = 1, \dots, I$ .

Similar to the two-way setup, four GSP were considered: GSP-1: $100 %$ (all) small size groups ( $n_{i} = 5$ , $i = 1, \dots, I$ ), GSP-2: $75 %$ small size and $25 %$ medium size ( $n_{i} = 15$ ) groups, GSP-3: 50:50 proportion of small and medium size groups and the last one GSP-4: $25 %$ small size and $75 %$ medium size groups. The four VPS considered were VPS-1 (small equal shape parameters $(< 1)$ ): $γ_{1} = 0.5$ ; $h_{2}$ =1, VPS-2 (large shape under large size groups): $γ_{1} = 0.5$ ; $h_{2}$ =1.15, VPS-3 (small shape under large size groups): $γ_{1} = 2$ ; $h_{2}$ =0.8 and VPS-4 (large equal shape parameters $(> 1)$ ): $γ_{1} = 2$ ; $h_{2}$ =1. As before, the set of all parametric combinations resulting from the above settings is denoted by $Q$ in the sequel.

Based on 5000 simulations, type-I errors of ILRT, LRT, GPQ and Delta method are simulated for each of the parametric combinations in $Q$ . For GPQ, the type-I error is estimated as the proportion of the simulated 5000 P-values falling below the nominal level of α. A single P-value was calculated based on 1000 GPQs. Commonly used $5 %$ level of significance was used. The box plots of simulated type-I errors plotted in Figure 6 revealed uniform performance of tests over $Q$ , differing across the tests, namely, conservative sizes of GPQ, well-concentrated close to nominal sizes of ILRT and liberal (large) sizes of LRT and Delta method.

Figure 6. — Simulated type-I errors of tests under one-way setup.

Thus, LRT and Delta method being not usable with respect to size criterion is not included in the further power study presented next.

4.2.2. Power comparison

Powers were simulated based on 5000 simulations for the aforementioned tests for the same parametric setup $Q$ . Additionally, the median parameter vector $β β$ was systematically varied taking $β_{1} = 1$ and $β_{i} = β_{i - 1} \times h_{1}$ , $i = 2, \dots, I$ , incrementing $h_{1}$ through $h_{1} = 1 (.02) 1.1; 1.2 (0.2) 3$ and additionally by $h_{1} = 5 (5) 30, 40, 50$ under two groups so as to acquire reasonable power. Again, $h_{1} = 1$ corresponds to the null hypothesis.

The power functions (as a function of increments $h_{1}$ ) for I = 2, 4, 6 and 8 groups are displayed in Figure 7. Here too, the panel headings of each sub-figure are the pairs (VPS, GSP). For two groups, GSP-2 and 4 are not valid as the partition can be only 50:50 or 100 $%$ . Notable observations revealed in the power comparison are reported below.

4.3. Observations based on power functions

ILRT performs excellent under two situations: (i) small size groups, irrespective of the shape parameter pattern in the groups and (ii) under larger values of shape parameter for all groups. In all other situations, both tests perform almost similar.

5. Examples

The three real data sets; two under the two-way and one under the one-way setup highlight the contribution of the ILRT tests developed here in industrial applications to arrive at more correct decisions. The model adequacy checks attempted by employing two graphical analyses, namely the quantile residual plots suggested by Dunn and Smyth [2] and interval plots of residuals, as well as the AIC and BIC criteria strongly support in favor of the decision given by ILRT. Matlab codes for data analysis are provided as supplementary material.

5.1. Example 1. Two-way setup: epitaxial layer thickness

The data are taken from Montgomery [5], where the epitaxial layer thickness (measured in μm) is studied to examine the impact of two factors, namely, deposition time (A) and arsenic flow rate (B), each one at two levels. Each of the resulting four combinations is replicated four times. The summary statistics sample median ( ${\hat{β}}_{i j}$ ) and shape parameter ( ${\hat{γ}}_{i j}$ ), $i = 1, \dots, I$ , $j = 1, \dots, J$ are reported in Table 1. The researchers were interested in reducing the variability in the thickness of the layer by keeping the average thickness level as close to the target level as possible.

Table 1.

Example 1. Summary sample statistics for four treatment combinations.

Cell number $(i, j)$	(1, 1)	(1, 2)	(2, 1)	(2, 2)
$n_{i j}$	4	4	4	4
Median (scale) ( ${\hat{β}}_{i j}$ )	14.4913	13.9213	14.8247	14.7874
Shape parameter ( ${\hat{γ}}_{i j}$ )	0.0632	0.0048	0.0030	0.0148

Open in a new tab

Often, practitioners adopt a normal-based F-test with homogeneous variances by ignoring the non-negative feature and/or skewed nature of the response distribution. This popularly used analysis employed on the present data picked none of the two factors as having a significant impact on the epitaxial layer thickness. The data are also analyzed using the BS-based ILRT developed in Section 2 which seems more appropriate here on account of the non-negative nature of the response. The later was also validated by model adequacy checks described in the next paragraph. As noted earlier, this analysis is equivalent to testing the impact of factors on the cell medians, that is more relevant for a skewed distribution like BS. Results of analyses are reported in Table 2, where the ILRT strongly picked out the deposition time (factor A) as an influential factor, which the conventional F-based analysis was incapable to pick.

Table 2.

Example 1. Data analysis results: two-way setup.

	P-values for the test for effects
Tests	A	B	AB	P-values for normality of residuals $^{a}$	AIC	BIC	MSE
Conventional F-test	0.0600	0.2830	0.3390	0.0110	33.5024	35.0476	0.3701
BS ILRT $^{a}$	0.0089	0.6585	1.0000	0.4790	1.3391	6.7472	0.3173

Open in a new tab

$^{a}$ Quantile residuals are used.

To assess the adequacy of the model supported by ILRT, the quantile residuals $(r_{q})$ are obtained as, $r_{q i j k} = Φ^{- 1} {F (t_{i j k}, \hat{μ}, {\hat{α}}_{i}, {\hat{γ}}_{i j})}$ ; $i = 1, \dots, I$ , $j = 1, \dots, J$ and $k = 1, \dots, n_{i j}$ , where $F (\cdot)$ is the cumulative distribution function (CDF) of BS distribution and $Φ (\cdot)$ is the standard normal CDF. Since the model has picked out a single significant factor, the estimates $\hat{μ}, {\hat{α}}_{i}$ are taken as the maximizers of the log-IL given in Equation (9) involving single factor and ${\hat{γ}}_{i j}$ are the regular MLE's of the group shape parameters computed holding the parameters $μ, α_{i}$ fixed at $\hat{μ}, {\hat{α}}_{i}$ . For the model involving no significant factors supported by the F-test, $r_{i j k} = (t_{i j k} - {\hat{t}}_{i j k})$ ; $i = 1, \dots, I$ , $j = 1, \dots, J$ and $k = 1, \dots, n_{i j}$ . ${\hat{t}}_{i j k} = 14.514$ ; $i = 1, \dots, I$ , $j = 1, \dots, J$ and $k = 1, \dots, n_{i j}$ being the overall mean of all observations. The P-value (0.4790) of the normality test for quantile residuals under ILRT based on BS (Figure 8(a)) distribution strongly supports the validity of the BS assumption for the data as well as the model proposed by ILRT. Furthermore, the almost uniform appearance of the interval plot (Figure 9(a)) of the residuals grouped cell-wise over the four cells (renumbered as $(1, 1) = 1, (1, 2) = 2, (2, 1) = 3$ and $(2, 2) = 4$ ) where each individual interval covers zero, under the BS model goes in the favor of the model given by the ILRT indicating factor A significantly affects the epitaxial layer thickness. Note that the similar plot for F-based residuals indicates heterogeneous variances for the residuals and the middle two cells do not cover zero, indicating that their mean is different from zero. Furthermore, the smaller values of AIC, BIC and MSE reported in Table 2 for BS more strongly support in favor of the BS-based model, against the F-based model with no significant factors.

Figure 8. — Example 1. Normal probability plots of residuals: two-way setup. (a) Quantile residuals under ILRT based on BS. (b) Residuals under F-test based on normal.

Figure 9. — Example 1. 95% confidence interval plots of the mean of residuals for each treatment combination: two-way setup. (a) Quantile residuals under ILRT based on BS. (b) Residuals under F-test based on normal.

The main effect plot of deposition time (factor A) plotted in Figure 10 further suggests that epitaxial layer thickness increases as the deposition time increases. In the plot, the range of epitaxial layer thickness under both levels of deposition time falls in the allowable range of 14–15 $μ m$ . However, the main aim was to reduce the variability of the layer while being close to the target level. As the smaller level of deposition time controls the mean level within the target range, the smaller level, that is 10 min of deposition time is recommended to insure smaller median parameters and hence smaller variability. The level of arsenic flow rate can be arbitrarily chosen, as it does not produce any significant impact on the median.

5.2. Example 2. One-way setup: failure times of Kevlar 49 fiber

The test procedure under one-way setup is illustrated with a real data set from Leiva [3]. The response variable is the failure time (measured in hours) of Kevlar 49 fiber subject to various stress levels (measured in MPa) and spools. Eight spools at a constant stress level 29.7 are considered for the present analysis. Table 3 presents the summary statistics, sample median ( ${\hat{β}}_{i}$ ) and shape parameter ( ${\hat{γ}}_{i}$ ), $i = 1, \dots, 8$ .

Table 3.

Example 2. Summary sample statistics for eight spools.

Treatment number (i)	1	2	3	4	5	6	7	8
$n_{i}$	4	8	4	5	4	3	8	3
Median (scale) ( ${\hat{β}}_{i}$ )	767.8610	21.0540	26.8041	982.8460	45.5263	28.4913	11.8251	309.9670
Shape parameter ( ${\hat{γ}}_{i}$ )	0.3498	1.5816	0.8882	0.7868	1.5224	1.4335	1.1013	0.9032

Open in a new tab

The data are analyzed as a one-way setup for the median under BS distribution assuming unequal shape parameters. ILRT (Equation (19)) is employed with the multiplicative corrective adjustment given in Equation (18). The P-value for the GPQ test was computed based on 100,000 samples. A conventional F-test is also employed on the data. The results of the data analysis are reported in Table 4.

Table 4.

Example 2. Data analysis results: one-way setup.

Test	P-values for testing equality of eight spools	P-values for normality of residuals $^{a}$	AIC	BIC	MSE
Conventional F-test	0	0.0005	556.4478	573.0834	55116
ILRT $^{a}$	0.0108	0.1210	452.6069	479.2239	55192
GPQ $^{a}$	0.1605	0.008	486.8273	501.7994	117160

Open in a new tab

$^{a}$ Quantile residuals are used.

The ILRT and F-test indicate a significant impact of spools on the fiber life while the GPQ is not capable of identifying this impact. Note that this is a situation with large number of small sized groups. In the simulation study for power comparison considered in Section 4, this situation is represented in Figure 7 under I = 8 groups with the first column representing small sized groups. Thus, the present situation resembles first column of Figure 7 under I = 8 groups where ILRT is clearly an out-performer over GPQ indicating more trust in the results produced by ILRT over those of GPQ for this case. The main effect plot (Figure 11) indicates that the average failure time of fiber is maximum at spool-4.

Figure 11. — Example 2. Main effect plot of spool under one-way model.

As before, for model adequacy checks, the quantile residuals are obtained as, $r_{q i j} = Φ^{- 1} {F (t_{i j}, {\hat{β}}_{i}, {\hat{γ}}_{i})}$ ; $i = 1, \dots, I$ , $j = 1, \dots, n_{i}$ , where $F (\cdot)$ CDF of BS distribution and $Φ (\cdot)$ is the standard normal CDF. For ILRT ${\hat{β}}_{i}$ are obtained based on the log-IL given in Equation (16) and ${\hat{γ}}_{i}$ is the regular MLE computed holding the medians $β_{i}$ fixed at ${\hat{β}}_{i}$ while for GPQ, GPQ-based estimates of parameters are used. As before, for conventional F-test $r_{i j} = (t_{i j} - {\hat{t}}_{i j})$ ; $i = 1, \dots, I$ and $j = 1, \dots, n_{i}$ , where ${\hat{t}}_{i j} = {\bar{t}}_{i .}$ , $i = 1, \dots, I$ and $j = 1, \dots, n_{i}$ being the mean of ith group. The normal probability plots of residuals are plotted in Figure 12, which clearly indicate that the ILRT-based residuals are normal while others may not be so.

Figure 12. — Example 2. Normal probability plots of residuals: one-way setup. (a) Quantile residuals based on maximum IL estimates of parameters under $H_{1}$ . (b) Quantile residuals based on GPQ-based estimates of parameters under $H_{0}$ . (c) Residuals under F-test based on normal distribution with unequal treatment means.

On account of AIC, BIC and P-value for normality of quantile residuals, the model and hence the parameter estimates offered by ILRT seem to be more reliable. Here too, the uniform appearance of the interval plots (Figure 13(a)) without any trend under the $H_{1}$ model also goes in favor of the decision given by the ILRT indicating significant differences among the spools. Note also that the GPQ-based residuals indicate non-zero mean for spool numbered 1, 2, 4 and 7.

Figure 13. — Example 2. Interval plots of residuals: one-way setup. (a) Quantile residuals based on maximum IL estimates of parameters under $H_{1}$ . (b) Quantile residuals based on GPQ-based estimates of parameters under $H_{0}$ . (c) Residuals under F-test based on normal distribution with unequal treatment means.

5.3. Example 3. Two-way setup: delivery time data

The data are taken from Montgomery [5], where the engineer is interested to test the impact of two different types of 32-ounce bottles (glass (level-1) and plastic (level-2)) (factor A) on the time to deliver 12-bottle cases of the product. Two workers (factor B) are used to move 40 cases of product 50 ft on a standard type of hand truck and stacking the cases in a display to perform a task. Each treatment combination is replicated four times, and the delivery time is recorded. Sample medians and shape parameters of the resulting data under BS distribution assumption are tabulated in Table 5.

Table 5.

Example 3. Summary sample statistics for four treatment combinations.

Cell number $(i, j)$	(1, 1)	(1, 2)	(2, 1)	(2, 2)
$n_{i j}$	4	4	4	4
Median (scale) ( ${\hat{β}}_{i j}$ )	4.9968	5.9632	4.4666	4.9075
Shape parameter ( ${\hat{γ}}_{i j}$ )	0.0164	0.0805	0.0615	0.0450

Open in a new tab

The data are analyzed using conventional normal-based F-test with homogeneous variances and also using the BS-based ILRT developed in Section 2. The results are assessed by model adequacy checks. Table 6 shows the results of the analysis, which show that both tests identified both factors as influential factors, and no significant interaction. However, it is notable that the model parameter estimates given by the two tests are different.

Table 6.

Example 3. Data analysis results: two-way setup.

Tests	P-values for the test for effects
	A	B	AB	P-value for normality of residuals $^{a}$	AIC	BIC	MSE
Conventional F-test	0.0010	0.0020	0.1460	0.0260	22.3679	27.0035	0.1119
BS ILRT	0.0095	0.0194	0.1539	0.1250	17.7571	24.7104	0.1288

Open in a new tab

$^{a}$ Quantile residuals are used.

To assess the adequacy of the model supported by ILRT, the quantile residuals are obtained as before, $r_{q i j k} = Φ^{- 1} {F (t_{i j k}, \hat{μ}, {\hat{α}}_{i}, {\hat{β}}_{j}, {\hat{γ}}_{i j})}$ ; $i = 1, \dots, I$ , $j = 1, \dots, J$ and $k = 1, \dots, n_{i j}$ . Since the interaction term does not seem to be significantly different from zero, the estimates $\hat{μ}, {\hat{α}}_{i}$ , ${\hat{β}}_{j}$ are taken as the maximizers of the log-IL given in Equation (4) and estimates of the shape parameter ${\hat{γ}}_{i j}$ are the regular MLE's of the group shape parameters computed holding the parameters $μ, α_{i}, β_{j}$ fixed at $\hat{μ}, {\hat{α}}_{i}, {\hat{β}}_{j}$ ; $i = 1, \dots, I, j = 1, \dots, J$ . For the model supported by the F-test, the residuals are $r_{i j k} = (t_{i j k} - {\hat{t}}_{i j k})$ ; where ${\hat{t}}_{i j k} = {\bar{t}}_{a i . .} + {\bar{t}}_{a . j .} - {\bar{t}}_{a}$ , referring to the notation developed in Section 2.1, $i = 1, \dots, I$ , $j = 1, \dots, J$ and $k = 1, \dots, n_{i j}$ . The P-value (0.1250) of the normality test for quantile residuals under ILRT based on BS (Figure 14(a)) distribution strongly supports the validity of the BS assumption for the data as well as the model parameter estimates obtained under ILRT. Furthermore, the almost uniform length and appearance of the interval plot (Figure 15(a)) of the residuals grouped cell-wise over the four cells, where each individual interval covers zero, under the BS model goes in the favor of the model estimates given by the ILRT indicating factors A and B significantly affect the delivery time. Similar plot for F-based residuals indicates heterogeneous variances for the residuals. Furthermore, the smaller values of AIC and BIC reported in Table 6 under BS-based model more strongly support in favor of the BS-based model, against the F-based model.

Figure 14. — Example 3. Normal probability plots of residuals: two-way setup. (a) Quantile residuals under ILRT based on BS. (b) Residuals under F-test based on normal.

Figure 15. — Example 3. 95% confidence interval plots of the mean of residuals for each treatment combination: two-way setup. (a) Quantile residuals under ILRT based on BS. (b) Residuals under F-test based on normal.

The main effect plot of two worker (factor B) and bottle type (factor A) plotted in Figure 16 further suggests that delivery time required for worker 2 and plastic bottle is smaller.

6. Concluding remarks

The proposed ILRT provides a uniformly satisfactory option for two-way and one-way setup under BS distributed response variable commonly encountered in industries. The multiway generalization of the test is straightforward, and an attempt is being made to extend the test to other non-normal response variables namely when the data are Gamma, or Weibull distributed, after employing the close to-normal transformations suggested in Kulkarni and Powar [11], Kulkarni and Powar [12], respectively as well as under censoring themes, an area that is very less addressed in the literature in spite of its wide applicability. For one-way model, the ILRT developed here is observed to prominently outperform its GPQ-based peer test under almost all of scenarios. ILRT has an added advantage of having less computational efforts making it more useful for practitioners not using advanced software. The analysis of real data sets clearly highlights the need of ILRT in industrial applications.

Supplementary Material

Supplemental Material

Click here for additional data file.^{(240.6KB, pdf)}

Acknowledgments

The authors are sincerely thankful to three anonymous referees, associate editor and editor for careful reading and insightful comments that greatly improvement of this manuscript.

Funding Statement

The authors are very much thankful to National Board for Higher Mathematics (NBHM), Department of Atomic Energy (DAE), India, under the grant sanction letter no. 2/48(34)/2016/NBHM(RP)/R&D II/4531 dated 31/03/2017, Department of Science and Technology, New Delhi, through FIST scheme (SR/FST/MSI-103 dated 18/11/2015) and UGC, New Delhi under Special Assistance Programme (SAP) (F.520/8/DRS-I/2016(SAP-I)) for financial support for this research work.

Disclosure statement

No potential conflict of interest was reported by the authors.

References

1.Berger J.O., Liseo B., and Wolpert R.L., Integrated likelihood methods for eliminating nuisance parameters, Statist. Sci. 14 (1999), pp. 1–28. [Google Scholar]
2.Dunn P.K. and Smyth G.K., Randomized quantile residuals, J. Comput. Graph. Statist. 5 (1996), pp. 236–244. [Google Scholar]
3.Leiva V., The Birnbaum–Saunders Distribution, Academic Press, Amsterdam, 2015. [Google Scholar]
4.Lemonte A.J., Cordeiro G.M., and Moreno-Arenas G., Improved likelihood-based inference in Birnbaum–Saunders nonlinear regression models, Appl. Math. Model. 40 (2016), pp. 8185–8200. [Google Scholar]
5.Montgomery D.C., Design and Analysis of Experiments, John Wiley & Sons, New York, 2017. [Google Scholar]
6.Niu C., Guo X., Xu W., and Zhu L., Comparison of several Birnbaum–Saunders distributions, J. Stat. Comput. Simul. 84 (2014), pp. 2721–2733. [Google Scholar]
7.Rieck J.R. and Nedelman J.R., A log-linear model for the Birnbaum–Saunders distribution, Technometrics 33 (1991), pp. 51–60. [Google Scholar]
8.SenGupta A. and Kulkarni H.V., Universal and efficient tests for homogeneity of mean directions of circular populations, Statist. Sinica 30 (2020), pp. 1995–2021. [Google Scholar]
9.Kulkarni H.V. and Patil S.M., Uniformly implementable small sample integrated likelihood ratio test for one-way and two-way ANOVA under heteroscedasticity and normality, AStA Adv. Stat. Anal. 105 (2021), pp. 273–305. [Google Scholar]
10.Kulkarni H.V. and SenGupta A., An efficient test for homogeneity of mean directions on the hyper-sphere, Int. Stat. Rev. 90 (2021), pp. 1–43. [Google Scholar]
11.Kulkarni H.V. and Powar S.K., A new method for interval estimation of the mean of the Gamma distribution, Lifetime Data Anal. 16 (2010), pp. 431–447. [DOI] [PubMed] [Google Scholar]
12.Kulkarni H.V. and Powar S.K., A simple normal approximation for Weibull distribution with application to estimation of upper prediction limit, J. Probab. Stat. 2011 (2011), p. 863274. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Material

Click here for additional data file.^{(240.6KB, pdf)}

[CIT0001] 1.Berger J.O., Liseo B., and Wolpert R.L., Integrated likelihood methods for eliminating nuisance parameters, Statist. Sci. 14 (1999), pp. 1–28. [Google Scholar]

[CIT0002] 2.Dunn P.K. and Smyth G.K., Randomized quantile residuals, J. Comput. Graph. Statist. 5 (1996), pp. 236–244. [Google Scholar]

[CIT0003] 3.Leiva V., The Birnbaum–Saunders Distribution, Academic Press, Amsterdam, 2015. [Google Scholar]

[CIT0004] 4.Lemonte A.J., Cordeiro G.M., and Moreno-Arenas G., Improved likelihood-based inference in Birnbaum–Saunders nonlinear regression models, Appl. Math. Model. 40 (2016), pp. 8185–8200. [Google Scholar]

[CIT0005] 5.Montgomery D.C., Design and Analysis of Experiments, John Wiley & Sons, New York, 2017. [Google Scholar]

[CIT0006] 6.Niu C., Guo X., Xu W., and Zhu L., Comparison of several Birnbaum–Saunders distributions, J. Stat. Comput. Simul. 84 (2014), pp. 2721–2733. [Google Scholar]

[CIT0007] 7.Rieck J.R. and Nedelman J.R., A log-linear model for the Birnbaum–Saunders distribution, Technometrics 33 (1991), pp. 51–60. [Google Scholar]

[CIT0008] 8.SenGupta A. and Kulkarni H.V., Universal and efficient tests for homogeneity of mean directions of circular populations, Statist. Sinica 30 (2020), pp. 1995–2021. [Google Scholar]

[CIT0009] 9.Kulkarni H.V. and Patil S.M., Uniformly implementable small sample integrated likelihood ratio test for one-way and two-way ANOVA under heteroscedasticity and normality, AStA Adv. Stat. Anal. 105 (2021), pp. 273–305. [Google Scholar]

[CIT0010] 10.Kulkarni H.V. and SenGupta A., An efficient test for homogeneity of mean directions on the hyper-sphere, Int. Stat. Rev. 90 (2021), pp. 1–43. [Google Scholar]

[CIT0011] 11.Kulkarni H.V. and Powar S.K., A new method for interval estimation of the mean of the Gamma distribution, Lifetime Data Anal. 16 (2010), pp. 431–447. [DOI] [PubMed] [Google Scholar]

[CIT0012] 12.Kulkarni H.V. and Powar S.K., A simple normal approximation for Weibull distribution with application to estimation of upper prediction limit, J. Probab. Stat. 2011 (2011), p. 863274. [Google Scholar]

PERMALINK

Analysis of medians under two-way model with and without interaction for Birnbaum–Saunders distributed response

S M Patil

H V Kulkarni

Abstract

1. Introduction

2. The proposed ILRT

2.1. Two-way setup with interaction

Theorem 2.1

2.2. Two-way setup without interaction

Theorem 2.2

2.3. Assessment of convergence of type-I error rate through simulations

Figure 1.

Figure 2.

2.4. Corrective adjustments under small samples (two-way setup)

Figure 3.

3. Performance assessment through simulation study

3.1. Type-I error rate

3.2. Power performance

Figure 4.

Figure 5.

4. ILRT under one-way setup

Theorem 4.1

4.1. Corrective adjustments under small samples (one-way setup)

4.2. Performance assessment through simulation study

4.2.1. Type-I error rate

Figure 6.

4.2.2. Power comparison

Figure 7.

4.3. Observations based on power functions

5. Examples

5.1. Example 1. Two-way setup: epitaxial layer thickness

Table 1.

Table 2.

Figure 8.

Figure 9.

Figure 10.

5.2. Example 2. One-way setup: failure times of Kevlar 49 fiber

Table 3.

Table 4.

Figure 11.

Figure 12.

Figure 13.

5.3. Example 3. Two-way setup: delivery time data

Table 5.

Table 6.

Figure 14.

Figure 15.

Figure 16.

6. Concluding remarks

Supplementary Material

Acknowledgments

Funding Statement

Disclosure statement

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases