Skip to main content
MethodsX logoLink to MethodsX
. 2022 Nov 9;9:101915. doi: 10.1016/j.mex.2022.101915

Optimum estimator in simple random sampling using two auxiliary attributes with application in agriculture, fisheries and education sectors

Monika Saini 1, Bhatt Ravi Jitendrakumar 1, Ashish Kumar 1,
PMCID: PMC9674916  PMID: 36411800

Abstract

In modern age of information technology, data is available everywhere in huge amount. Every sector generates lot of data every day. The investigation of each unit of data is not feasible due to limited resources like time, labor, and cost. In such situations, survey sampling is recommended to draw the information about the population parameters. Therefore, the main objective of present study is to develop an estimation method for obtaining the information about population parameter. We propose an optimum estimator for enhanced estimation of population mean in simple random sampling by utilizing the information of the two auxiliary attribute. The expression for bias, mean squared error (MSE) and minimum mean squared error of the proposed estimator are derived up to the first order of approximation and it is shown that the proposed estimator under derived conditions perform better than the existing estimators theoretically. Four population are demonstrated to assess the performance as well as applicability of the proposed estimator. The percentage relative efficiency (PRE) of proposed estimator for all the populations is 209.533, 163.852, 210.398 and 340.578, respectively. The numerical illustrations confirm that the proposed estimator dominates over the existing estimators.

  • The main objective of present study is to propose a new estimator/method for estimation of population mean using two auxiliary attributes under simple random sampling.

  • The bias and mean square error of the proposed estimator/method is derived and compared with the existing estimators to compare the efficiency theoretically.

  • Applications of the proposed method/estimator is highlighted using thorough the real data sets of various sectors.

Keywords: Finite population, Auxiliary attribute, Simple random sampling, Bias, Mean squared error

Graphical abstract

Image, graphical abstract


Specifications table

Subject Area: Mathematical Statistics
More specific subject area: Sample Surveys
Method name: Optimum estimator for estimating population mean using two auxiliary attributes
Name and references of original method: The proposed method is motivated by following references:
  • Ahmad, S., Arslan, M., Khan, A., & Shabbir, J. (2021). A generalized exponential-type estimator for population mean using auxiliary attributes. Plos one, 16(5), e0246947.

  • Zaman, T., & Kadilar, C. (2019). Novel family of exponential estimators using information of auxiliary attribute. Journal of Statistics and Management Systems, 22(8), 1499–1509

  • Lu, J. (2017). Efficient estimator of a finite population mean using two auxiliary variables and numerical application in agricultural, biomedical, and power engineering. Mathematical Problems in Engineering.

Resource availability: Data utilized in the analysis is available in public domain.

Background

In sample surveys, it is well documented in Cochran [20] that the use of supplementary information provided by auxiliary variables or attributes is frequently used for increasing the precision of the estimators by taking the advantages of correlation between the study variable and auxiliary variable. Regression, ratio, and product estimators are good examples in this perspective. Though Cochran [1] investigated that ratio estimator is best suited when the study and auxiliary variable are highly positively correlated whereas in case of highly negative correlated variables, product method of estimation is better. In many real-life situations, the study variable is not always quantitative in nature. The responses recorded from respondents are qualitative in such situations the recorded information is called attributes. Several studies like Shabbir and Gupta [22] and Abd-Elfattah et al. [23] have been conducted to improve the precision of the estimator by utilizing the auxiliary attributes having applications in agriculture, health science, fisheries, power engineering etc. It is also investigated that use of more than one auxiliary attribute enhance the efficiency of the estimator. Singh et al. [2] used auxiliary attribute information for establishment of ration estimators in simple random sampling (SRS). The bias and mean square error of the estimator has been computed for existing data set available in literature. It is developed as a modified estimator of Koyuncu and Kadilar [21] estimator and proved that it outperformed the existing estimators. Singh and Kumar [4] used auxiliary information to estimate improved regression estimator for SRS. Here the auxiliary information is qualitative, and concept of non-response incorporated in estimation. Malik and Singh [3] initiated the use of two auxiliary variables in estimation of population mean and proposed enhanced estimators. Here, auxiliary information available in qualitative form and this estimator performed better than simple regression estimator. Ekpenyong and Enang [5] suggested better exponential estimators in SRS for estimating population mean. The concept of simple random sampling without replacement used for development of estimator. Lu [6] explored the applications of estimators developed under auxiliary information in agriculture and power engineering sectors. The estimator compared with regression estimator and other existing estimators and proved efficient. Zaman and Kadilar [7] utilized auxiliary information for development of a novel family of exponential estimators. Ahmad et al. [8] carried out the generalization of exponential ratio estimators under auxiliary estimators. The estimator was exponential-based estimator while estimator proposed by us is mixed type estimator while estimator proposed in present study is a mixture of simple, ratio and product estimators. Mahajan et al. [9] explored the applications of estimation and sample surveys in agriculture and health sciences. Kumar and Saini [10] suggested a predictive approach to estimate the population mean under auxiliary attribute. Yunusa et al. [11] utilized auxiliary variables in development of regression type estimators to estimation the population mean. Rather et al. [24] used auxiliary information for development of a mixed exponential ratio type estimator for estimating the population mean. The simple random sampling and double sampling techniques utilized for selection of the sample. Zaman et al. [25] proposed an exponential type estimator for assessing and estimating the COVID-19 risk in various countries. Two multivariate families of exponential type estimators proposed by utilizing the information on two auxiliary variables. Many authors like Wayangkau et al. [18], Waheeb et al. [15], Rajak [16], and Jabal et al. [17] discussed some other data analysis techniques for analysis of agricultural information. The above cited literature motivates to explore the applicability of two auxiliary attributes in estimation of mean of various populations associated with agriculture, fisheries, and education sectors. The primary goal of this paper is to propose a novel optimum estimator for estimating finite population mean using auxiliary attributes. The expressions for the bias and mean square error (MSE) of the proposed estimator are inferred up to the first order of approximation. On the bases of theoretical and numerical comparisons, we demonstrate that the proposed estimator is more efficient than existing estimators.

Material and methods

Consider γ=(γ1,γ2,γ3,.,γN) be a finite population of size N. we draw a sample of size n (with n<N) units from γ using simple random sample without replacement (SRSWOR). Let yi be the study variable and i be the characteristics of the auxiliary attributes i.e. i = 1 if the ith unit possess attribute and i = 0, otherwise. Let G=i=1Nδibe the total number of units in the population possessing attributes i and g=i=1nδibe the total number of units in the sample possessing attributes i. Let D=(G/N) be the proportion of units in the population and d=(g/n) be the population of units in the sample. Let Y¯=i=1NyiNand y¯=i=1nyinbe the population and sample mean of the study variable. Let D1 and D2 be the population proportion of the auxiliary attributes and sample proportion of auxiliary attributes is denoted by d1 and d2. Let Sy2=i=1N(yiY¯)2N1be the population variance of the study variable y. Let Sd12=i=1N(d1D1)2N1and Sd22=i=1N(d2D2)2N1respectively be the population variance of the auxiliary attributes d1 and d2. Let Cy=SyY¯ be the coefficient of variation of the study variable y. Let Cd1=Sd1D1 and Cd2=Sd2D2 be the coefficient of variation of the auxiliary d1 and d2. Let Sydj=i=1N(yiY¯)(djDj)N1be the population covariance between the study variable y and the auxiliary attributes dj(j=1,2). Let Sd1d2=i=1N(d1D1)(d2D2)N1be the population covariance between the auxiliary attributes d1 and d2. Let ρydj=SydjSySdj be the population point bi-serial correlation coefficient between the study variable y and the auxiliary attribute dj(j=1,2).Let π0=y¯Y¯Y¯; π1=d1D1D1 and π2=d2D2D2 be the error terms such that E[ei]=0(i=0,1,2), E[π02]=ξCy2,E[π12]=ξCd12,E[π22]=ξCd22,E[π0π1]=ξρyd1CyCd1,E[π0π2]=ξρyd2CyCd2andE[π1π2]=ξρd1d2Cd1Cd2where ξ=(1n1N) and f=nN .

The stepwise framework of proposed method is described as follows:

  • Step 1: Consider a finite population of size N.

  • Step 2: Select a random sample of size n from the population using simple random sampling without replacement.

  • Step 3: Observe yi and i from sampling units.

  • Step 4: Define the expressions for population and sample characteristics.

  • Step 5: Propose the estimator for estimating population mean using two auxiliary attributes and derive its properties.

  • Step 6: Compare the proposed estimator with the existing estimators theoretically and numerically.

Existing estimators

Unbiased estimator (μ^0)

The most widely used estimator discussed by Cochran [1] of population mean Y¯of the study variable, is given by

μ^0=y¯, (1)

sample mean is an unbiased estimator of population mean and upto the first order of approximation the variance or MSE is given by

V(μ^0)=V(y¯)=ξY¯2cy2. (2)

Naik and Gupta (μ^1)

Naik and Gupta [12] proposed the following ratio estimator of population mean Y¯when the population proportion D1 of auxiliary attribute is known

μ^1=y¯(D1d1), (3)

The bias and MSE of μ^1 to the first order of approximation is given by,

Bias(μ^1)ξY¯[cd12ρyd1cycd1],

and

MSE(μ^1)ξY¯2[cy2+cd122ρyd1cycd1]. (4)

Naik and Gupta (μ^2)

Naik and Gupta [12] proposed the following product estimator of population mean Y¯when the population proportion D1 of auxiliary attribute is known

μ^2=y¯(d1D1), (5)

The bias and MSE of μ^2, to the first order of approximation is given by

Bias(μ^2)Y¯ξ[ρyd1cycd1],

and

MSE(μ^2)ξY¯2[cy2+cd12+2ρyd1cycd1]. (6)

Singh et al. (μ^3)

Singh et al. [2] suggested an exponential type ratio estimator for estimating population mean Y¯using the population proportion D1 of auxiliary attribute is known

μ^3=y¯exp[D1d1D1+d1], (7)

The bias and MSE of μ^3, to the first order of approximation,

Bias(μ^3)Y¯ξ[38cd1212ρyd1cycd1],

and

MSE(μ^3)ξY¯2[cy2+14cd12ρyd1cycd1]. (8)

Singh et al. (μ^4)

Singh et al. [2] suggested the following product estimators for estimating population mean Y¯when the population proportion D1 of auxiliary attribute is known

μ^4=y¯exp[d1D1d1+D1], (9)

Similarly, the bias and MSE of μ^4, is given by

Bias(μ^4)12Y¯ξ[ρyd1cycd114cd12],

and

MSE(μ^4)12Y¯2ξ[cy2+14cd12+ρyd1cycd1]. (10)

Kumar and Bhougal (μ^5)

Kumar and Bhougal [13] proposed an exponential type of ratio-product estimator for estimating population mean Y¯when the population proportion D1 of auxiliary attribute is known

μ^5=y¯[αexp(D1d1D1+d1)+(1α)exp(D1d1D1+d1)], (11)

where α is unknown constant.

The bias and MSE of μ^5 to the first order of approximation is given by

Bias(μ^5)Y¯ξ[18(4α1)cd12(α12)ρyd1cycd1],

and

MSE(μ^5)ξY¯2cy2[1ρyd12]. (12)

The optimum value of α is given by

αopt12+ρyd1cycp1.

Singh and Kumar (μ^6)

Singh and Kumar [4] suggested ratio estimator for estimating population mean Y¯when the population proportion D1 and D2of auxiliary attribute are known

μ^6=y¯(D1d1)(D2d2), (13)

The bias and MSE of μ^6 to the first order of approximation are given by

Bias(μ^6)Y¯ξ[cd12+cd22+ρd1d2cd1cd2ρyd1cycd1ρyd2cycd2],

and

MSE(μ^6)ξY¯2[cy2+cd12+cd222(ρyd1cycd1ρyd1cycd1ρyd2cycd2)]. (14)

Singh and Kumar (μ^7)

Singh and Kumar [4] suggested product estimator for estimating population mean Y¯when the population proportion D1 and D2of auxiliary attribute are known

μ^7=y¯(d1D1)(d2D2), (15)

The bias and MSE of μ^7 to the first order of approximation are given by

Bias(μ^7)Y¯ξ[ρd1d2cd1cd2ρyd1cycd1+ρyd2cycd2],
MSE(μ^7)ξY¯2[cy2+cd12+cd22+2(ρyd1cycd1ρyd1cycd1+ρyd2cycd2)]. (16)

Ahmed et al. (μ^8)

Ahmed et al. [8] proposed a generalized class of factor type of estimators for estimating population mean Y¯when the population proportion D1 and D2of auxiliary attribute are known is given by

μ^8=y¯[exp(S1M1S1+M1)exp(S2M2S2+M2)] (17)

where,

S1=(A1+C1)D1+FB1d1, S2=(A2+C2)D2+FB2d2, M1=(A1+FB1)D1+C1d1, M2=(A2+FB2)D2+C2d2, Ai=(Ki1)(Ki2), Bi=(Ki1)(Ki4), and Ci=(Ki2)(Ki3)(Ki4).

The bias and MSE of μ^8 to the first order of approximation are given by

Bias(μ^8)=y¯ξ[12σ1ρyd1CyCd1+12σ2ρyd2CyCd2+18Cd12(σ122σ1v1)+18Cd22(σ222σ2v2)],

where v1=FB1+C1A1+FB1+C1 and v2=FB2+C2A2+FB2+C2 .

The minimum MSE of (μ^8)tothefirstorderofapproximation is given by

MSE(μ^8)y¯2Cy2[1Ryd1d22], (18)

where

Ryd1d22=ρyd12+ρyd222ρyd1ρyd2ρd1d21ρd1d22,

The optimum value of σ10pt and σ20pt are

σ10pt=2Cy(ρyd1ρd1d2ρyd2)Cd1(ρd1d221)andσ20pt=2Cy(ρyd2ρd1d2ρyd1)Cd2(ρd1d221).

Proposed estimator & its properties

Motivated by Singh and Espejo [14] and Lu [6], adopting the same procedure we proposed as estimator for estimating population mean by utilizing two auxiliary attributes as

(μ^pop)=ω1y¯+ω2(D1d1)+ω3(D2d2)4(D1d1+d1D1)(D2d2+d2D2) (19)

To obtain the bias and MSE of proposed estimator, applying the error approximation into Eq. (17), in terms of π’s, the proposed estimator can be expressed as

μ^prop=ω1Y¯(π0+1)ω2D1π1ω3D2π24(1π1+1+π1+1)(1π2+1+π2+1) (20)

Expanding the right-hand side of [20], we get

μ^prop=ω1Y¯(π0+1)ω2D1π1ω3D2π24[π1+1+(1π1+π12+)][π2+1+(1π2+π22+)]. (21)

In Eq. (21) we will neglect the terms of π’s, power having greater than two, we get

μ^propY¯(ω11)Y¯+ω1Y¯π0ω2D1π1ω3D2π2+12ω1Y¯π12+12ω1Y¯π22, (22)

To get the bias of the proposed estimator, we need to take expectation on both the sides of (22), hence we will get the bias of the proposed estimator up to the first order of approximation (Fig. 1).

E(μ^propY¯)(ω11)Y¯+ω1Y¯E(π0)ω2D1E(π1)ω3D2E(π2)+12ω1Y¯E(π12)+12ω1Y¯E(π22).

Fig. 1.

Fig 1

Graphical representation of estimators w.r.t to their MSE.

The bias of the proposed estimator is given by

Bias(μ^prop)=[E(y^pr)Y¯]Y¯[(ω11)+12ξω1Cd12+12ξω1Cd22].

Now for deriving the MSE of the proposed estimator, let us square both sides of (22), neglecting terms π’s having power greater than two, and taking expectation on both sides, after simplification we get,

MSE(μ^prop)=E(μ^propY¯)2,Y¯2[(ω11)2ξω1(Cd12+Cd22)+ξω1(Cd12+Cd22+Cy2)2ξω1Y¯Cy(ω2Cd1ρyd1D1+ω3Cd2ρyd2D2)+ξ(ω22Cd12D12+2Cd1Cd2ω2ω3ρd1d2Dd1Dd2+ω32Cd22D22)]. (23)

To obtain the optimum value of ω1,ω2andω3, we partially differentiate the Eq. (23) with respect to ω1,ω2andω3 and put it equal to zero, the optimum values of ω1,ω2andω3 are given by

ω1*=β[2+ξ(Cd12+Cd22)]2[β+Cy2ξα+βξ(Cd12+Cd22)],
ω2*=CyY¯(ρyd1ρd1d2ρyd2)[2+ξ(Cd12+Cd22)]2Cd1D1[β+Cy2ξα+βξ(Cd12+Cd22)],
ω3*=CyY¯(ρyd2ρd1d2ρyd1)[2+ξ(Cd12+Cd22)]2Cd2D2[β+Cy2ξα+βξ(Cd12+Cd22)].

The minimum MSE of μ^prop can be shown as,

MSE(μ^prop)min=ξY¯2[4Cy2αβξ(Cd12+Cd22)2]4[β+Cy2ξα+βξ(Cd12+Cd22)], (24)

where α=1ρd1d22ρyd12+2ρd1d2ρyd1ρyd2ρyd22 and β=1ρd1d22.

Theoretical comparison of estimators

In this section we compare our proposed estimator with the exiting estimators discussed in Section 2, we will have the conditions as follows:

From Eqs. (2) and (24)

MSE(μ^0)>MSE(μ^prop)minifandonlyif
cy2>[4Cy2αβξ(Cd12+Cd22)2]4[β+Cy2ξα+βξ(Cd12+Cd22)].

From Eqs. (4) and (24)

MSE(μ^1)>MSE(μ^prop)minifandonlyif
[cy2+cd122ρyd1cycd1]>[4Cy2αβξ(Cd12+Cd22)2]4[β+Cy2ξα+βξ(Cd12+Cd22)]

From Eqs. (6) and (24)

MSE(μ^2)>MSE(μ^prop)minifandonlyif
[cy2+cd12+2ρyd1cycd1]>[4Cy2αβξ(Cd12+Cd22)2]4[β+Cy2ξα+βξ(Cd12+Cd22)]

From Eqs. (8) and (24)

MSE(μ^3)>MSE(μ^prop)minifandonlyif
[cy2+14cd12ρyd1cycd1]>[4Cy2αβξ(Cd12+Cd22)2]4[β+Cy2ξα+βξ(Cd12+Cd22)]

From Eqs. (10) and (24)

MSE(μ^4)>MSE(μ^prop)minifandonlyif
[cy2+14cd12+ρyd1cycd1]>[4Cy2αβξ(Cd12+Cd22)2]4[β+Cy2ξα+βξ(Cd12+Cd22)]

From Eqs. (12) and (24)

MSE(μ^5)>MSE(μ^prop)minifandonlyif
cy2[1ρyd12]>[4Cy2αβξ(Cd12+Cd22)2]4[β+Cy2ξα+βξ(Cd12+Cd22)]

From Eqs. (14) and (24)

MSE(μ^6)>MSE(μ^prop)minifandonlyif
[cy2+cd12+cd222(ρyd1cycd1ρyd1cycd1+ρyd2cycd2)]>[4Cy2αβξ(Cd12+Cd22)2]4[β+Cy2ξα+βξ(Cd12+Cd22)]

From Eqs. (16) and (24)

MSE(μ^7)>MSE(μ^prop)minifandonlyif
[cy2+cd12+cd22+2(ρyd1cycd1ρyd1cycd1+ρyd2cycd2)]>[4Cy2αβξ(Cd12+Cd22)2]4[β+Cy2ξα+βξ(Cd12+Cd22)]

From Eqs. (18) and (24)

MSE(μ^8)>MSE(μ^prop)minifandonlyif
y¯2Cy2[1Ryd1d22]>[4Cy2αβξ(Cd12+Cd22)2]4[β+Cy2ξα+βξ(Cd12+Cd22)]

Results and discussion

To examine the dominance and applicability of proposed estimator in simple random sampling, we considered four real data sets from agricultural, fishers and education sectors available in Ahmad et al. [8]. A similar kind of population for reference is given as Appendix A.

Population 1. (Source from education sector)

Let y represent the number of instructors, d1 the total number of primary and secondary school students in Turkey in 2007, which was larger than 11,440.5, and d2 the total number of primary and secondary school students in Turkey in 2008, which was greater than 333.1647. The population information about data set is given as:

N=923,n=180,Y¯=436.4346,D1=2.6625,D2=3.125,ρyd1=0.6904898,ρyd2=0.652149,ρd1d2=0.8465885,Cd1=1.826732,Cd2=1.641621,Cy=1.718333

Population 2. (Source from fishers’ sector)

Let y represent the expected catch by recreational marine fishermen in 1995, d1 represent the percentage of fish captured larger than 1000 in 1993, and d2 represent the percentage of fish caught greater than 2000 in 1994. The population information about data set is given as:

N=69,n=14,Y¯=4514.89,D1=0.7391304,D2=0.5507246,ρyd1=0.3966081,ρyd2=0.538047,ρd1d2=0.6577519,Cd1=0.5984409,Cd2=0.9098277,Cy=1.350

Population 3. (Source from agricultural sector)

For the 47 districts of Pakistan, let y represent the tobacco area production in hectares for the year 2009, d1 represent the percentage of farms with tobacco cultivation areas greater than 500 ha for the year 2007, and d2 represent the percentage of farms with tobacco cultivation areas greater than 800 ha for the year 2008. The population information about data set is given as:

N=47,n=10,Y¯=1004.447,D1=0.4255319,D2=0.3829787,ρyd1=0.4395989,ρyd2=0.4661508,ρd1d2=0.9153857,Cd1=1.174456,Cd2=1.283018,Cy=2.341245

Population 4. (Source from agricultural sector)

Let y represent the amount of cotton produced in hectares in 2009, d1 represent the percentage of farms with cotton cultivation areas greater than 37 ha in 2007, and d2 represent the percentage of farms with cotton cultivation areas greater than 35 ha in 2008 for 52 districts in Pakistan. The population information about data set is given as:

N=52,n=11,Y¯=50.03846,D1=0.3846154,D2=0.4423077,ρyd1=0.7369579,ρyd2=0.6935718,ρd1d2=0.8877181,Cd1=1.277252,Cd2=1.13384,Cy=1.421524

We compute the MSE values of existing estimates and proposed estimator using Eqs. (2), (4), (6), (8), (10), (12), (14), (16), (18) and (24) and these values are shown in Table 1.

Table 1.

MSE values estimators.

Population
Estimator P1 P2 P3 P4
μ^0 2515.170 2,115,180.565 435,363.127 362.664
μ^1 1665.169 1,787,076.238 352,906.039 175.164
μ^2 9050.198 3,274,574.672 736,929.821 1135.734
μ^3 1379.541 1,847,217.179 366,745.882 195.717
μ^4 2536.028 1,295,483.198 279,378.887 338.001
μ^5 1315.997 1,782,466.925 351,230.425 165.699
μ^6 5151.870 2,045,097.266 480,329.895 466.088
μ^7 10,154.721 4,937,999.787 870,994.957 1306.264
μ^8 1275.439 1,496,047.750 340,313.804 163.048
μ^prop 1200.370 1,290,907.767 206,924.055 106.485

Note: Bold number indicate the lowest MSE.

To measure the Percentage Relative Efficiency (PRE), we apply the following formula:

PRE=MSE(μ^0)MSE(t)X100

where t=μ^0,μ^1,μ^2,μ^3,μ^4,μ^5,μ^6,μ^7,μ^8,and μ^prop

The numerical comparison of PRE for existing and proposed estimator is shown in Table 2.

Table 2.

Percentage Relative Efficiency (PRE) of estimators.

Population
Estimator P1 P2 P3 P4
μ^0 100 100 100 100
μ^1 151.046 118.360 123.365 207.043
μ^2 27.791 64.594 59.078 31.932
μ^3 182.319 114.506 118.710 185.300
μ^4 99.178 163.273 155.833 107.297
μ^5 191.123 118.666 123.954 218.870
μ^6 48.821 103.427 90.638 77.810
μ^7 24.768 42.835 49.985 27.763
μ^8 163.048 141.384 127.929 222.428
μ^prop 209.533 163.852 210.398 340.578

Note: Bold number indicate the higest PRE.

The proposed estimator is compared with the eight estimators as shown in Section 2 and results shown numerically and graphically. It is observed from Table 1, that proposed estimator attains the minimum mean square error 1200.370, 1,290,907.767, 206,924.055 and 106.485 for populations (P1,,P4), respectively. While from Table 2, it is revealed that PRE value of the proposed estimator for all four populations (P1,..,P4) is 209.533, 163.852, 210.398 and 340.578, respectively. The PRE of proposed estimator is highest in comparison to the existing estimators as well as all the estimators given in Ahmed et al. [8]. It is observed from Figs. 1 and 2 that proposed estimator is more efficient for population mean estimation in comparison to the existing estimators. In theory of sample surveys, the major objective of any statistician or investigator is to minimize the MSE and maximize the PRE in the estimation to ideally draw an inference about the study population. The proposed estimator is validated with the help of minimum MSE and maximum PRE. It attains the minimum MSE and maximum PRE in comparison of the other existing estimators as shown graphically for all four populations.

Fig. 2.

Fig 2

Graphical representation of PRE of estimatotrs w.r.t to μ^0.

Conclusion

In this article, we proposed an optimum estimator for estimation of population mean in simple random sampling by utilizing the information of the two auxiliary attribute. Mathematical expressions of the proposed estimator i.e. bias, mean squared error (MSE) and minimum mean squared error are derived. Mean square error of proposed estimator is shown in Section 3. The proposed estimator is efficient under the derived conditions discussed in Section 4. The implementation of proposed estimator helps in estimation of précised value of the population mean on the basis of which effective decisions can be made. It is recommended that the proposed estimator may be used in sample surveys in the areas of education, agriculture fisheries and health sciences, etc. Further, the proposed estimator can be extended to utilization of multi auxiliary information under various sampling designs.

CRediT authorship contribution statement

Monika Saini: Conceptualization, Methodology, Validation, Writing – review & editing. Bhatt Ravi Jitendrakumar: Investigation, Resources, Writing – original draft. Ashish Kumar: Formal analysis, Methodology, Writing – review & editing.

Declaration of Competing Interest

There is no conflict of interest among the authors.

Submission Type: Direct Submission

Appendix A

Hypothetical Population: (Source: Singh and Choudhary (20), pp. 177)

Let y be the area under wheat crop (in acres) during the year 1974, x1 be the area under wheat (in acres) 1971 and x2 be the area under wheat (in acres) in 1973.

Hypothetical Population: (Source: Singh and Choudhary [19], pp. 177)
S.N. Area under wheat (in acres)
(1971) (1973) (1974)
(x1) (x2) (y)
1 401 70 50
2 630 163 149
3 1194 320 284
4 1170 440 381
5 1065 250 278
6 827 125 111
7 1737 558 634
8 1060 254 278
9 360 101 112
10 946 359 355
11 4170 109 99
12 1625 481 498
13 827 125 111
14 96 5 6
15 1304 427 339
16 377 78 80
17 259 75 105
18 186 45 27
19 1767 564 515
20 604 238 249
21 700 92 85
22 524 247 221
23 571 134 133
24 962 131 144
25 407 129 103
26 715 190 175
27 845 363 335
28 1016 235 219
29 184 73 62
30 282 62 79
31 194 71 60
32 439 137 100
33 854 196 141
34 820 255 263

The population information about data set is given as:

  • 1 = 1 if farms under the wheat crop which have more than 500 acres land during the year 1971, otherwise ẟ1 = 0.

  • 2 = 1 if farms under wheat crop which have more than 100 acres land during the year 1973, otherwise ẟ2 = 0.

  • D1 be the proportion of farms under the wheat crop which have more than 500 acres land during the year 1971 and D2 be the proportion of farms under wheat crop which have more than 100 acres land during the year 1973.

Descriptive Statistics:

N=34,Y¯=199.4,D1=0.6765,D2=0.7353,ρyd1=0.599,ρyd2=0.599,ρd1d2=0.725,Sd12=0.225490,Sd22=0.200535,Sy2=22,564.6

Data availability

  • Data is available in public domain.

References

  • 1.Cochran W.G. The estimation of the yields of cereal experiments by sampling for the ratio of grain to total produce. J. Agric. Sci. 1940;30(2):262–275. [Google Scholar]
  • 2.Singh R., Chauhan P., Sawan N., Smarandache F. Ratio estimators in simple random sampling using information on auxiliary attribute. Auxiliary Information and a priori Values in Construction of Improved Estimators. Infinite Study. 2007;1:7–17. [Google Scholar]
  • 3.Malik S., Singh R. An improved estimator using two auxiliary attributes. Appl. Math. Comput. 2013;219(23):10983–10986. [Google Scholar]
  • 4.Singh H.P., Kumar S. A regression approach to the estimation of the finite population mean in the presence of non-response. Aust. N. Z. J. Stat. 2008;50(4):395–408. [Google Scholar]
  • 5.Ekpenyong E.J., Enang E.I. Efficient exponential ratio estimator for estimating the population mean in simple random sampling. Hacet. J. Math. Stat. 2015;44(3):689–705. [Google Scholar]
  • 6.Lu J. Efficient estimator of a finite population mean using two auxiliary variables and numerical application in agricultural, biomedical, and power engineering. Math. Probl. Eng. 2017:1–7. doi: 10.1155/2017/8704734. [DOI] [Google Scholar]
  • 7.Zaman T., Kadilar C. Novel family of exponential estimators using information of auxiliary attribute. J. Stat. Manag. Syst. 2019;22(8):1499–1509. [Google Scholar]
  • 8.Ahmad S., Arslan M., Khan A., Shabbir J. A generalized exponential-type estimator for population mean using auxiliary attributes. PLoS ONE. 2021;16(5) doi: 10.1371/journal.pone.0246947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Mahajan J., Banal K., Mahajan S. Estimation of crop production using machine learning techniques: a case study of J&K. Int. J. Inf. Technol. 2021;13(4):1441–1448. [Google Scholar]
  • 10.Kumar A., Saini M. A predictive approach for finite population mean when auxiliary variables are attributes. Thail. Stat. 2022;20(3):575–584. [Google Scholar]
  • 11.Yunusa M.A., Gidado A., Ahijjo Y.M., Danbaba A., Abdulazeez S.A., Audu A. Modified classes of regression-type estimators of population mean in the presences of auxiliary attribute. Asian Res. J. Math. 2022;18(1):65–89. [Google Scholar]
  • 12.Naik V.D., Gupta P.C. A note on estimation of mean with known population proportion of an auxiliary character. J. Ind. Soc. Agric. Stat. 1996;48(2):151–158. [Google Scholar]
  • 13.Kumar S., Bhougal S. Estimation of the population mean in presence of non-response. Commun. Stat. Appl. Methods. 2011;18(4):537–548. [Google Scholar]
  • 14.Singh H.P., Espejo M.R. On linear regression and ratio–product estimation of a finite population mean. J. R. Stat. Soc. Ser. D. 2003;52(1):59–67. (The Statistician) [Google Scholar]
  • 15.Waheeb, R., Wheib, K., Andersen, B., & Alsuhili, R. (2022). The prospective of Artificial Neural Network (ANN's) model application to ameliorate management of post disaster engineering projects. Available at SSRN 4180813. doi: 10.2139/ssrn.4180813. [DOI]
  • 16.Rajak A.A. Emerging technological methods for effective farming by cloud computing and IoT. Emerg. Sci. J. 2022;6(5):1017–1031. [Google Scholar]
  • 17.Jabal Z.K., Khayyun T.S., Alwan I.A. Impact of climate change on crops productivity using MODIS-NDVI time series. Civ. Eng. J. 2022;8(06):1136–1155. [Google Scholar]
  • 18.Wayangkau I.H., Mekiuw Y., Rachmat R., Suwarjono S., Hariyanto H. Utilization of IoT for soil moisture and temperature monitoring system for onion growth. Emerg. Sci. J. 2020;4:102–115. [Google Scholar]
  • 19.Singh D., Chaudhary F.S. John Wiley & Sons; 1986. Theory and Analysis of Sample Survey Designs. [Google Scholar]
  • 20.Cochran W.G. John Wiley & Sons; 2007. Sampling Techniques. [Google Scholar]
  • 21.Koyuncu N., Kadilar C. Family of estimators of population mean using two auxiliary variables in stratified random sampling. Commun. Stat. Theory Methods. 2009;38(14):2398–2417. [Google Scholar]
  • 22.Shabbir J., Gupta S. Estimation of the finite population mean in two phase sampling when auxiliary variables are attributes. Hacet. J. Math. Stat. 2010;39(1):121–129. [Google Scholar]
  • 23.Abd-Elfattah A.M., El-Sherpieny E.A., Mohamed S.M., Abdou O.F. Improvement in estimating the population mean in simple random sampling using information on auxiliary attribute. Appl. Math. Comput. 2010;215(12):4198–4202. [Google Scholar]
  • 24.Rather, K.U.I., Jeelani, M.I., Rizvi, S.E.H., & Sharma, M. (2022). A new exponential ratio type estimator for the population means using information on auxiliary attribute. J. Stat. Appl. Pro. Lett. 9, No. 1, 31-42 (2022).
  • 25.Zaman T., Sagir M., Şahin M. A new exponential estimators for analysis of COVID-19 risk. Concurr. Comput. Pract. Exp. 2022;34(10):e6806. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

  • Data is available in public domain.


Articles from MethodsX are provided here courtesy of Elsevier

RESOURCES