Skip to main content
Wiley - PMC COVID-19 Collection logoLink to Wiley - PMC COVID-19 Collection
. 2022 Oct 30;34(28):e7386. doi: 10.1002/cpe.7386

Evaluating COVID‐19 risk under the estimation of population mean using two attributes

Nadia Mushtaq 1, Iram Saleem 1,2,, Mustansar Aatizaz Amjad 2
PMCID: PMC9874473  PMID: 36714183

Abstract

The virus of COVID‐19 has affected humans physically, mentally, and economically all over the world. Each country's development level, resources, and immunization to have coping capacity against this covid19 differ country‐wise. Thus, understanding socioeconomic vulnerability and coping capacity among countries under different health systems can be crucial. Contrasting most articles on COVID‐19, this article focuses on evaluating and estimating the COVID‐19 risk and lack of coping capacity in 190 countries. This present study suggests an exponential estimator using two auxiliary attributes. Theoretically, the mean square error expressions are derived and compared with some existing estimators. These findings were supported by using the real data of INFORM COVID‐19 risk.

Keywords: auxiliary attributes, COVID‐19 risk, exponential estimator, immunization coverage, lack of coping capacity, mean square error

1. INTRODUCTION

The coronavirus, or COVID‐19, is agitating panic for thus several reasons. Therefore, nobody has the immunity to fight it. Second, it is highly contagious, which means it spreads fast. Third, the scientists do not seem to be entirely sure of how it behaves since they need borderline history. Hoseinpour Dehkordi et al. 1 presented a new procedure based on a regression approach to estimate the COVID‐19 case fatality rate during the pandemic. They have also compared the case history of a few Asian and European countries. Hoang 2 presented the characteristics of COVID‐19 case detection in two phases to overcome the pandemic using the data set from the Ministry of Health of Vietnam. At that time, the situation in Vietnam was more volatile than expected. Overton et al. 3 provided a toolkit to analyze the outbreak and access the interventions using the statistical and mathematical model‐based toolkit. They applied three different COVID‐19 pandemic datasets to test the toolkit and focused on parameter estimation under known biases in the datasets. Wong et al. 4 explained that not only do different countries have different resources for the health system to make. The immunization is strong so that people can cope with the disease, but also have a diversity of economic, political, and social features that the government's effort was unable to cope with disasters like COVID‐19. In an article by Silveira et al. 5 the measures of relevance to psychological vulnerability, resilience, and social cohesion were identified for Germany in a heterogeneous Berlin population ranging from 18 to 65 years of age. Lockett et al. 6 studied that people with higher levels of posttraumatic stress symptoms and COVID‐19‐related functional impairment would display greater anxiety and lower coping self‐efficacy after pandemic reminders. From these studies, one can conclude that the COVID‐19 pandemic devastated the people economically and physically and mentally in many cases.

In survey sampling, we deal with different interdisciplinary research territories such as population studies, agriculture, health issues, engineering, etc. Therefore, the researchers are interested in estimating the average using the proportion of the population that possesses a specified attribute as an auxiliary variable.

Wynn 7 firstly suggested a ratio estimator using the auxiliary attribute when the population is divided into two classes. Singh et al. 8 proposed a ratio of proportion utilizing the attribute as a piece of auxiliary information. Naik and Gupta 9 discussed the point biserial correlation coefficient idea. Solanki and Singh 10 recommended a class of estimators for the population mean of the study variable using the known population proportion of the auxiliary attribute. Malik and Singh 11 proposed the exponential type of estimator using two auxiliary attributes. Zaman 12 suggested a new family of estimators using auxiliary information for simple random sampling when the data are qualitative in nature. Yadav and Zaman 13 presented a family of estimators of population mean utilizing the know parameters of the auxiliary attribute. Zaman, 14 Sajjad et al., 15 Zaman and Kadilar, 16 and Bhushan et al. 17 suggested estimators estimate the population mean using auxiliary attributes. Bhushan et al. 18 introduced a modified class of estimators to estimate the population variance using auxiliary attributes. The auxiliary attributes are the information readily available sometimes to estimate the variable of interest for example: to estimate the proportion of voters favoring a proposition, the proportion of drug usage in both genders, the irrigation status, and yield of the crop, etc.

Regarding the COVID‐19 pandemic, few survey statisticians have participated in evaluating and estimating the dynamics of the evolution of COVID‐19 risk as Shahzad et al. 19 presented some new calibration estimators to estimate population variance of recovery time of COVID‐19 under two‐phase stratified sampling design. Further, Zaman et al. 14 presented two families of multivariate exponential estimators for simple random sampling to measure the proportion of health care and socioeconomic vulnerability during the COVID‐19 pandemic.

This present study estimated the lack of coping capacity among different countries of the world based on their health systems and immunization capability using the INFORM COVID‐19 risk datasets. The INFORM COVID‐19 risk index facilitates us with the devastating effects of the pandemic on the human health system all over the world. This article aims to suggest a general form of the exponential estimator to estimate the population mean for two auxiliary attributes using COVID‐19 risk. The expression of the mean square error (MSE) of the suggested estimator is derived up to the first order of approximation. The outline of the article is organized as follows: In Section 2 the suggested estimator is introduced using the two auxiliary attributes along with the MSE expression. In Section 3, the theoretical comparison of the suggested exponential estimator and some existing estimators using the MSE expressions are reported. A real‐life application using INFORM COVID‐19 risk dataset is discussed in Section 4. The concluding remarks are given in Section 5.

2. METHODOLOGY

In this section properties of the proposed estimator are explained, and datasets are described.

2.1. The suggested estimator and properties

The exponential estimators can be used for the exponential relationship between study and auxiliary attributes Sanaullah et al. 20 The current article aims to propose an exponential estimator to estimate the population mean of the study variable considering the information about the population proportion possessing certain attributes.

2.1.1. Notations

Consider U = (u 1 , u 2 , …, u N ) be a finite population of size N. We draw a sample of size n by SRSWOR from a population U″. Let yi be the study variable and ϕi be the characteristic of the auxiliary attribute such as:

ϕi=1if theithunit possesses the attribute.ϕi=0other.

Let A=i=1Nϕi and a=i=1nϕi be total units in population and sample, respectively.

Let P=A/N and p=a/n be the proportion of units in the population and sample, respectively.

Let Y=i=1NyiN and y=i=1nyin be the population and sample mean, respectively.

Let P1 and P2 be the population proportion and p1 be the sample proportion of the auxiliary attributes.

To obtain MSE the error term and their expectations are given as:

eo=(yY)Y,e1=p1P1P1,e2=p2P2P2bethe error terms.

such that.

Eei=0,(i=0,1,2),Ee02=λCy2,Ee02=λCp12,Ee02=λCp22,
Ee0e1=λρyp1CyCp1,Ee0e2=λρyp2CyCp2,Ee1e2=λρp1p2Cp1Cp2,

where λ=1n1N, ρyp1=Syp1SySp1, ρyp2=Syp2SySp2, ρp1p2=Sp1p2Sp1Sp2,

Cy=SyY,Cp1=Sp1P1,Cp2=Sp2P2,Sy2=i=1NyiY2N1,Sp12=i=1Np1iP12N1
Sp22=i=1Np2iP22N1,Syp1=i=1NyiYp1iP1N1,Syp2=i=1NyiYp2iP2N1
Sp1p2=i=1Np1iP1p2iP2N1.

2.1.2. Properties of the suggested estimator

We propose the following unbiased exponentiation ratio type estimator in estimating the population mean when using two auxiliary attributes as given by.

yM=m=03αmtm(m=0,1,2,3). (1)

Such that αmR, and αm (m=0,1,2,3) denotes suitably chosen constants and R stands for the set of real numbers.

Rewriting (1), we have

yM=α0t0+α1t1+α2t2+α3t3, (2)

where t0=y,t1=yexpP1p1P1+p1, t2=yexpP2p2P2+p2, t3=yexpP1p1P1+p1expP2p2P2+p2.

then yM can be written as:

yM=yα0+α1expP1p1P1+p1+α2expP2p2P2+p2+α3xpP1p1P1+p1expP2p2P2+p2. (3)

Expressing the above equation in terms of error terms to a first‐order approximation, we have

yM=y1+e0α0+α1expe12+e1+α2expe22+e2+α3expe12+e1expe22+e2. (4)

The MSE of the above equation is given as:

MSEyM=Y2Ee0e1α12+α32e2α22+α32+Oe22. (5)

Let

α12+α32=α(say)α22+α32=α*(say), (6)

where α and α* are another constant.

Therefore MSE of yM in the form of α and α* can be written as

MSEyMY2Eeoαe1α*e22. (7)

By expanding and applying expectations, we get

MSEyM=θY2Cy2+α2Cp12+α*2Cp222αρyp1CyCp12α*ρyp2CyCp2+2αα*ρp1p2Cp1Cp2. (8)

Now setting

MSEyMα=0

So ρyp1=ρyp2.

Let Lab=ρabCaCb.

The subscripts a and b show the presence of classes and a ≠ b.

Therefore α=Lyp1α*Lp1p2.

Now ∂MSEyMα*=0, we get

α*=Lyp2αLp1p2,

where Lyp2=ρyp2CyCp2 and Lp1p2=ρp1p2Cp1Cp2.

Now solving the above, we have

α=Lyp1Lyp2Lp1p21Lp1p2Lp2p1=L(say)α*=Lyp2Lyp1Lp1p21Lp1p2Lp2p1=L*(say). (9)

Substituting α′ and α* after solving we get the minimum given by

MSEyMminθY2Cy21ρyp12+ρyp222ρyp1ρyp2ρp1p21ρp1p22.
MSEyMminθY2Cy21Ry.p1p22, (10)

where

Ry.p1p22=ρyp12+ρyp222ρyp1ρyp2ρp1p21ρp1p22.

3. COMPARISON OF THE ESTIMATORS

This section compares the efficiencies of the proposed estimator with other estimators based on their MSE expressions. In the first part, we discussed existing estimators and in the second part, we make the theoretical comparison.

3.1. Existing estimators

  1. The usual mean estimator is
    y0=y. (11)

The var of the yo is given as:

Vary0=θY2Cy2. (12)
  • ii
    The regression type estimator is given as:
    yReg=y+β(Pp) (13)

where β is given as

β=Cov(Y,P)Var(P).
VaryReg=θY2Cy21ρyp2, (14)
  • iii
    The exponential type of estimator is given as:
    yexp=yexpPpP+p, (15)
    MSEyexp=θY2Cy2+14Cp22ρypCyCp. (16)
  • iv
    Kumar and Bhoughal proposed exponential ratio‐product estimator is given as:
    yKB=yαexpP1p1P1+p1+(1α)expp1P1p1+P1. (17)
    MSEyRPmin=θY2Cy21ρyp12. (18)

The optimum value of α is given as:

αopt12+ρyp1CyCp1.
  • v
    Singh and Kumar suggested a ratio‐product estimator for two auxiliary attributes given:
    ySK=yP1p1P2p2. (19)
    MSEySK=θY2Cy2+Cp12+Cp222ρyp1CyCp1+ρyp2CyCp2ρp1p2Cp1Cp2. (20)

3.2. Theoretical comparison of the proposed and existing estimators

  1. From (12) and (10), we have
    VaryMminVary0=Ry.p1p220, (21)

where Ry.p1p22=ρyp12+ρyp222ρyp1ρyp2ρp1p21ρp1p22.

  • ii
    From (14) and (10), we have
    VaryMminVaryReg=Ry.p1p22ρyp120. (22)
  • iii
    From (16) and (10), we have
    VaryMminVaryexp=14Cp2+2ρypCyCp+Cy2Ry.p1p220. (23)
  • iv
    From (18) and (10), we have
    varYMminvarYKB=Ry.p1p22ρyp120. (24)
  • v
    From (20) and (10), we have
    varYMminvarYSK.
    Cp12+Cp222ρyp1CyCp1+ρyp2CyCp2ρp1p2Cp1Cp2+Cy2Ry.p1p220 (25)

The proposed estimators are more efficient than traditional estimators if conditions ((21), (22), (23), (24), (25)) are satisfied.

4. THE REAL‐LIFE EXAMPLE

In this section, the performance of the proposed estimator has been evaluated by using INFORM COVID‐19 risk data.

4.1. The data description

The data concerning the COVID‐19 risk index, access to the socioeconomic vulnerability index, and lack of coping capacity were obtained from the INFORM COVID‐19 risk dataset. 21 In this article, different studies and auxiliary attributes have been chosen. Firstly, the study variable is the lack of coping capacity data scaled for 190 countries. From the same index, the first auxiliary attribute P1 is health system capacity data scaled from 0 to 10 (0 is low capacity, 10 is high capacity). The other auxiliary attribute P2 is immunization coverage based on a scale of 0 to 10 (0 is low coverage, 10 is high coverage).

Secondly, from the socioeconomic vulnerability index, the study variable is the COVID‐19 vulnerability and attribute variables are P1 is awareness, and P2 is demographic and co‐morbidities. The figures for datasets are given as:

According to Figure 1, Canada, Denmark, and Syria have the least COVID‐19 lack coping capacity whereas Colombia is a highly lacking country with coping capacity for COVID‐19. If we observe health system capacity, Belgium and Micronesia have the least proportion as compared to Malta with 10. Similarly, we noticed the immunization proportion coverage among 190 countries with which Egypt and Sri Lanka with 10.1 which means the highest immunization coverage capacity.

FIGURE 1.

CPE-7386-FIG-0001-c

COVID‐19 lack coping capacity with health system capacity and immunization coverage for 190 countries

In Figure 2, comparing 190 countries Peru and Denmark have the highest vulnerability capacity of bearing COVID‐19 risk. If we observe the same figure, Canadian people are more aware as compared to all other countries, and the United States has the least awareness regarding COVID‐19. Burkina Faso has the uppermost demographic and co‐morbidities proportion as compared to the other countries.

FIGURE 2.

CPE-7386-FIG-0002-c

COVID‐19 vulnerability evaluate with awareness and demographic and co‐morbidities

4.2. Results and discussion

The MSE and percent relative efficiency of the proposed and exponential estimator with some existing estimators for N = 190 countries are presented in Tables 1 and 2. Using simple random sampling, we assume n = 60, n = 80, and n = 100 for two datasets.

TABLE 1.

Mean square error (MSE) and PRE for dataset 1

n = 60 n = 80 n = 100
Estimators MSE PRE MSE PRE MSE PRE
y0
0.05455 100 0.03433 100 0.02276 100
yReg
0.05441 100.25 0.03421 100.35 0.02266 100.44
yexp
0.01214 449.34 0.00764 449.36 0.00505 450.69
yM
0.00639 853.67 0.00402 853.98 0.00266 855.63
y¯KB
0.00653 835.37 0.00413 831.23 0.00291 782.13
y¯SK
0.00805 677.63 0.00632 543.19 0.00381 597.37

TABLE 2.

Mean square error (MSE) and PRE for dataset 2

n = 60 n = 80 n = 100
Estimators MSE PRE MSE PRE MSE PRE
y0
0.00059 100 0.00043 100 0.00033 100
yReg
0.00051 115.68 0.00032 134.37 0.00021 157.14
yexp
0.00030 196.67 0.00028 215.00 0.00016 206.25
yM
0.00027 218.51 0.00019 226.31 0.00009 366.67
y¯KB
0.00028 210.71 0.00021 204.76 0.00010 330.00
y¯SK
0.00029 203.44 0.00025 172.00 0.00013 253.84

The relative efficiency values of the proposed estimator concerning traditional estimators are as below.

PRE=MSEy0MSEyi. (26)
i=Reg,exp,M,KB,SK.

As noted earlier in Table 1, Y represents lack of coping capacity, P1 means the proportion of health capacity, and P2 represents the proportion of immunization coverage considered for the present study. The coefficient of correlation between lack of coping capacity and proportion of health capacity is 0.1484. These values indicate that the lack of coping capacity during COVID‐19 and the proportion of health capacity have a very low correlation. In other words, the lack of coping capacity is not attentively related to the proportion of health capacity. Similarly, for the lack of coping capacity and the proportion of immunization coverage correlation coefficient is 0.4731. This correlation coefficient value specifies that the two variables are moderately correlated; in other words, the lack of coping capacity is thoroughly correlated to immunization coverage.

In Table 2, the study variable Y symbolizes COVID‐19 vulnerability, P1 signifies the proportion of awareness about COVID‐19, and P2 represents the proportion of demographic and co‐morbidities during COVID‐19 considered for the present study. The correlation coefficient value between COVID‐19 vulnerability and the proportion of awareness is −0.590. It shows that the variables COVID‐19 vulnerability and the proportion of awareness are strictly contradictory with a negative correlation. On the other hand, the coefficient of correlation between the variables COVID‐19 vulnerability and the proportion of demographic and co‐morbidities is 0.3326. This value indicates that both variables are moderately related to each other.

In this context, a generalized exponential estimator has been proposed to estimate the population mean of study variable having two different situations of COVID‐19 and their attributes such as (i) Y represents lack of coping capacity, P1 represents the proportion of health capacity, and P2 represents the proportion of immunization coverage (ii) Y symbolizes COVID‐19 vulnerability, P1 signifies the proportion of awareness about COVID‐19, P2 represents the proportion of demographic and co‐morbidities during COVID‐19. Using these two datasets, the proposed estimator and some existing estimators are compared numerically and theoretically. Tables 1 and 2 represent the MSE and PRE results of the proposed exponential estimator and some existing estimators. It is noticed from the numerical results that using both datasets the generalized exponential estimator yM performs more efficiently as compared to the competing estimators either for positive or negative correlation among study and auxiliary attributes.

5. CONCLUSION

This study proposes an exponential estimator to estimate the population mean using two auxiliary attributes performing more efficiently as compared to existing estimators mentioned in this article. The statistical properties of the estimator such as the bias and minimum MSE have been derived in the first order of approximation. In the first step, the proposed estimator is theoretically compared with the existing estimators. After that COVID‐19 risk dataset is used in the numerical comparison. The numerical study confirms that the proposed estimator has the minimum MSE and maximum PRE values among compared estimators. Therefore, using the proposed estimators in practice for the present study and such issues is recommended.

CONFLICT OF INTEREST

The authors have no conflicts of interest to declare. All coauthors have seen and agree with the contents of the manuscript and there is no financial interest to report. We certify that the submission is original work and is not under review at any other publication.

Mushtaq N, Saleem I, Amjad MA. Evaluating COVID‐19 risk under the estimation of population mean using two attributes. Concurrency Computat Pract Exper. 2022;34(28):e7386. doi: 10.1002/cpe.7386

DATA AVAILABILITY STATEMENT

The data that support the findings of this study are openly available on https://drmkc.jrc.ec.europa.eu/inform‐index/INFORM‐Covid. The code that supports the findings of this study is available at https://drmkc.jrc.ec.europa.eu/inform‐index.

REFERENCES

  • 1. Hoseinpour Dehkordi A, Alizadeh M, Derakhshan P, Babazadeh P, Jahandideh A. Understanding epidemic data and statistics: a case study of COVID‐19. J Med Virol. 2020;92(7):868‐882. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Hoang VM, Hoang HH, Khuong QL, La NQ, Tran TTH. Describing the pattern of the COVID‐19 epidemic in Vietnam. Glob Health Action. 2020;13(1):1776526. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Overton CE, Stage HB, Ahmad S, et al. Using statistics and mathematical modelling to understand infectious disease outbreaks: COVID‐19 as an example. Infect Dis Model. 2020;5:409‐441. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Wong MC, Teoh JY, Huang J, Wong SH. The potential impact of vulnerability and coping capacity on the pandemic control of COVID‐19. J Infect. 2020;81(5):816‐846. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Silveira S, Hecht M, Matthaeus H, Adli M, Voelkle MC, Singer T. Coping with the COVID‐19 pandemic: perceived changes in psychological vulnerability, resilience and social cohesion before, during and after lockdown. Int J Environ Res Public Health. 2022;19(6):3290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Lockett M, Pyszczynski T, Koole SL. Pandemic reminders as psychological threat: thinking about COVID‐19 lowers coping self‐efficacy among trauma‐exposed adults. Cognit Emot. 2022;36(1):23‐30. [DOI] [PubMed] [Google Scholar]
  • 7. Wynn HP. An unbiased estimator under SRS of a small change in a proportion. Stat. 1976;25:225. doi: 10.2307/2987838 [DOI] [Google Scholar]
  • 8. Singh G, Singh RK, Kaur P. Estimating ratio of proportions using auxiliary information. Biom J. 1986;28:637‐643. doi: 10.1002/bimj.4710280515 [DOI] [Google Scholar]
  • 9. Naik VD, Gupta PC. A note on estimation of mean with known population proportion of an auxiliary character. J Ind Soc Agric Stat. 1996;48:151‐158. [Google Scholar]
  • 10. Solanki RS, Singh HP. Improved estimation of population mean using population proportion of an auxiliary character. Chilean J Stat. 2013;4(1):3‐17. [Google Scholar]
  • 11. Malik S, Singh R. An improved estimator using two auxiliary attributes. Appl Math Comput. 2013;219:10983‐10986. doi: 10.1016/j.amc.2013.05.014 [DOI] [Google Scholar]
  • 12. Zaman T. New family of estimators using two auxiliary attributes. Int J Adv Res I Eng Manag (IJAREM). 2018;4(11):11‐16. [Google Scholar]
  • 13. Yadav SK, Zaman T. A general exponential family of estimators for population mean using auxiliary attribute. J Sci Arts. 2020;20(1):25‐34. [Google Scholar]
  • 14. Zaman T, Sagir M, Şahin M. A new exponential estimators for analysis of COVID‐19 risk. Concurr Comput Pract Exp. 2022;34(10):e6806. [Google Scholar]
  • 15. Sajjad I, Hanif M, Koyuncu N, Shahzad U, Al‐Noor NH. A new family of robust regression estimators utilizing robust regression tools and supplementary attributes. Stat Trans New Ser. 2021;22(1):207‐216. [Google Scholar]
  • 16. Zaman T, Kadilar C. Exponential ratio and product type estimators of the mean in stratified two‐phase sampling. AIMS Math. 2021;6(5):4265‐4279. [Google Scholar]
  • 17. Bhushan S, Kumar A, Kumar S, Singh S. Some modified classes of estimators for population variance using auxiliary attribute. Pak J Stat. 2022a;38(2):235‐252. [Google Scholar]
  • 18. Bhushan S, Kumar A, Tyagi D, Singh S. On some improved classes of estimators under stratified sampling using attribute. J Reliab Stat Stud. 2022b;16:187‐210. [Google Scholar]
  • 19. Shahzad U, Ahmad I, Almanjahie I, Al‐Noor NH. Utilizing L‐Moments and calibration method to estimate the variance based on COVID‐19 data. Fresenius Environ Bull. 2021;30(7A):8988‐8994. [Google Scholar]
  • 20. Sanaullah A, Ali HA, ul Amin MN, Hanif M. Generalized exponential chain ratio estimators under stratified two‐phase random sampling. Appl Math Comput. 2014;226:541‐547. [Google Scholar]
  • 21. INFORM . INFORM COVID‐19 risk index; 2021. https://drmkc.jrc.ec.europa.eu/inform‐index/INFORM‐Covid19#:˜:text=The20INFORM20COVID2D1920Risk,need20for20additional20international20assistanceE2809D.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data that support the findings of this study are openly available on https://drmkc.jrc.ec.europa.eu/inform‐index/INFORM‐Covid. The code that supports the findings of this study is available at https://drmkc.jrc.ec.europa.eu/inform‐index.


Articles from Concurrency and Computation are provided here courtesy of Wiley

RESOURCES