Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2020 Jul 2;15(7):e0235531. doi: 10.1371/journal.pone.0235531

Using social choice theory and acceptability analysis to measure the value of health systems

Hai Shen 1, Yubing Sui 2,*, Yelin Fu 3
Editor: Fausto Cavallaro4
PMCID: PMC7332049  PMID: 32614864

Abstract

The Future Health Index (FHI) is developed by the Royal Philips to help determine the readiness of countries to address global health challenges and build sustainable, fit-for-purpose national health systems. The FHI 2018 presents the Value Measure to measure the value of 16 health systems, which is formulated by taking the arithmetic average of Access, Satisfaction and Efficiency. However, this scheme is not the Pareto optimal and loses association with weights. For these reasons, this paper proposes to apply the social choice theory and Stochastic Multicriteria Acceptability Analysis for group decision making (SMAA-2) to measure the value of health systems, by means of re-constructing the Value Measure. Specifically, we begin with considering all possible individual preferences among Access, Satisfaction and Efficiency, which is mathematically represented by ranked weights of them; the pessimistic and optimistic outcomes under certain individual preference are derived in a closed-form manner, according to which an interval decision matrix is then formulated; the SMAA-2 is then lastly applied to compute the holistic acceptability index, which is considered as a revised Value Measure. An empirical study using the data of 16 health systems is conducted to show the effectiveness and superiority of our method. It is demonstrated that our method always outperforms the Value Measure, by means of comparing the Spearman’s rank correlation coefficients.

Introduction

The challenges of delivering health care in many countries are receiving increasing attentions as costs continue to rise and evidence of uneven quality accumulates [1]. Although most health care reforms have focused on coverage, the far bigger long-term driver of success will originate from restructuring the health care delivering system to a value-based system [2]. The concept of value-based health care suggests a change of model in which the provision of health services does not focus on the quantity of services provided but on the value they generate, understanding value as overall quality of care and health outcomes related to the costs achieving those outcomes. In this sense, as a people-centric approach, value-based health care describes a system with the goal of increasing access to care, improving patient outcomes, and delivering satisfaction to both patients and practitioners at optimum cost. In other words, value-based health care is contextual, geared towards providing the right care in the right place, at the right time and at the right level of cost. Therefore, achieving high value for various stakeholders must become the overwhelming goal of health care delivery. Rigorous, disciplined measurement and improvement of value are the best way to drive system progress [3]. Nevertheless, the value in health remains largely misunderstood and unmeasured.

The Future Health Index (FHI) is a research-based platform designed by the Royal Philips to help determine the readiness of countries to address global health challenges and build sustainable, fit-for-purpose national health systems. The FHI 2016 measures perceptions to produce a snapshot of how health care is experienced on both sides of the patient-professional divide. The FHI 2017 compares these perceptions to the reality of health systems in each country researched. The FHI 2018 builds on the increasing consensus that, with the rise of chronic diseases and health care costs, the value-based care model is the best approach to address these challenges. In addition, the FHI 2018 identifies key challenges that form a barrier to the large-scale adoption of value-based care and improved population access; and assesses where connected care technology—data collection and analytics, and telehealth—can help speed up the health care transformation process. The FHI 2018 measures and assesses the value presented in 16 health systems of developed and developing markets through proposing a broadly applicable composite indicator, namely, Value Measure. The Value Measure combines criteria with respect to value-based health care and access to care, arguably the ultimate goals of modern health care.

The Value Measure consists of three metrics: Access (how universal, and affordable, is access to health care in the designated market?), Satisfaction (to what extent do the general population and practitioners in the designated market see the health care system as trustworthy, and effective?) and Efficiency (does the system in the given market produce outcomes at an optimum cost?). The components of Value Measure are listed in Table 1. Each metric is composed of several sub-metrics, which are normalized to ensure comparability across countries and are scored to fit onto a 0 to 100 scale. The scores for each sub-metric are arithmetically averaged to calculate each metric sore and those scores are then arithmetically averaged to construct the Value Measure. That is,

ValueMeasure=Access+Satisfaction+Efficiency3. (1)

Table 1. Value measure.

Value Measure Access Skilled health professional density (per 10,000 population)
Risk of impoverishment due to surgical care (% of people at risk)
Hospital beds (per 10,000 population)
Satisfaction Trust in health care system (HCPs and general population)
Health care system meets needs (HCPs and general population)
Rating of health care system overall (HCPs)
Efficiency Health care spend as a percentage of GDP
Tuberculosis: incidence and treatment success rates
Life and healthy life expectancy at birth
Probability of dying from key chronic diseases between 30 and 70
Neonatal mortality rate
Maternal mortality rate

The scores for these sub-metrics use a combination of third-party data and survey data. Specifically, the third-party data is sourced from many organizations including the World Health Organization, The Commonwealth Fund, and the World Bank, while the survey data is collected from the countries analyzed using their native language. A combination of face-to-face, online and phone interviewing is employed. The sample from the survey includes 24,654 adults and 3,244 health care professionals.

As shown in (1), the Value Measure assigns equal weights to Access, Satisfaction and Efficiency, this plausible scheme results in substantial information loss [4]. In addition to this, the arithmetical average is significantly affected by the extreme values, not Pareto optimal, and losses association with weights [5]. All these shortcomings inspire scholars and practitioners to develop new methods for improving the calculation of the Value Measure. The contribution of this paper is the development of a new method to modify the Value Measure released by the Royal Philips for measuring the value of health systems, based upon the social choice theory and Stochastic Multicriteria Acceptability Analysis for group decision making (SMAA-2). Social choice is the theory of how one designs or chooses a mechanism to summarize from a set of individual preference orders over alternatives available to a society of those individuals to a collective or social preference order over those same alternatives [6, 7]. Stochastic Multicriteria Acceptability Analysis (SMAA) is a multicriteria decision support method for multiple experts in discrete problems, based on exploring the weight space to describe the valuations that make each alternative the preferred one [8, 9]. SMAA-2 extends SMAA by taking into account information about other ranking positions, therefore identifies good compromise alternatives.

Specifically, we begin with eliminating the equalitarianism assumption to consider all possible individual preferences among three metrics. Certain individual preference is mathematically represented by a set of ranked weights. It seems reasonable that a decision maker should at least rank the metrics, since rankings are normally easier to provide than usually inaccessible precise weights information [4, 10]. In the meanwhile, the decision maker may be unable, unavailable, or even unwilling to obtain sufficiently precise weights [11]. Nevertheless, it is difficult to achieve consensus about exact weights in a problem with multiple decision makers [9]. In this sense, we then calculate the worst and best outcomes under certain individual preference in a closed-form manner, according to which an interval-valued decision matrix is formulated with country-as-row and individual preference-as-column. Lastly, the SMAA-2 is applied to obtain the holistic acceptability index for each country, which is regarded as an improved version of the Value Measure. We compute the Spearman’s rank correlation coefficients to demonstrate the superiority and rationality of the proposed method. This study proposes a new incentive and a feasible direction to measure the value of health systems in an appropriate manner, along with the provision of some academic, managerial and policy-related implications.

The remainder of the paper is organized as follows. We present the method for improving the evaluation of the Value Measure in Section 2, followed by an empirical study for a panel of 16 countries in Section 3. We conclude in Section 4 by discussing the details of our method and suggestions for future research.

2. Method

For the purpose of measuring the value of health systems that are previously aggregated using the arithmetic average, this section proposes a method for the general case with m Decision Making Units (DMUs) and n metrics, which can be easily applied to improve the Value Measure with Access, Satisfaction and Efficiency. xij, i = 1, 2, …, m, j = 1, 2, …, n indicates the performance of DMU i under sub-index j. To adjust values measured on different scales to a notionally common scale, we use the feature scaling (alternative known as min-max normalization) to scale the range in [0, 1]:

zij=xij-mini{xij}maxi{xij}-mini{xij},i=1,2,,m,j=1,2,,n.

The method proposed is two-fold and begins with investigating all possible individual preferences among the n metrics, under which the pessimistic and optimistic outcomes are derived in a closed-form manner; we then employ the SMAA-2 to compute the holistic acceptability index for aggregating the individual preferences into a social choice result.

2.1 Individual preference

This paper takes into account all possible individual preferences among the metrics to deal with the drawbacks associated with the arithmetic average method. In this sense, an individual preference can be represented by an importance order of metrics. For the ease of demonstration, we only investigate one of the individual preferences in this section, the result of which can be easily migrated in other scenarios. We investigate the situation in which w1w2 ≥ ⋯ ≥ wn, and wj, j = 1, 2, …, n is the importance degree of metric j. In this manner, the pessimistic and optimistic results for DMU i can be determined by the following two linear programs:

vip=minj=1nzijwjs.t.w1w2wnj=1nwj=1,wj0. (2)
vio=maxj=1nzijwjs.t.w1w2wnj=1nwj=1,wj0. (3)

For αj ≥ 0, j = 1, 2, …, n, we define the weights as wk=j=knαj. This is consistent with given individual preference among metrics, w1w2 ≥ ⋯ ≥ wn. Let βj = j,

j=1nβj=j=1njαj=j=1m(k=jnαk)=j=1nwj=1. (4)

Moreover, we define sik=1kj=1kzij,k=1,2,,n, then

j=1nzijwj=j=1nk=jnzijαk=j=1nk=jnzij(1kβk)=k=1nβk(1kj=1kzij)=k=1nβksik. (5)

Therefore, the linear program (3) is equivalent to the following model:

vio=maxk=1nβksiks.t.k=1nβk=1,βk0. (6)

Let k^{1,2,,n} satisfies that sik^=maxk{sik}, then the optimal solution to linear program (6) is determined by

βk={1,k=k^;0,otherwise. (7)

Consequently, the optimistic result for DMU i with certain individual preference can be easily determined as the following closed form: vio=maxk{sik}=maxk{1kj=1kzij},k=1,2,,n. This scheme is easy-to-understand and simple-to-implement, and can be readily migrated to other situations. Similarly, the pessimistic result for DMU i with certain individual preference can be derived as vip=mink{sik}=mink{1kj=1kzij},k=1,2,,n.

Taking into account the pessimistic and optimistic outcomes under all possible individual preferences, an interval-valued decision matrix Ωm × n! is formulated as below:

Ωm×n!=[[v11p,v11o][v1tp,v1to][v1n!p,v1n!o][v21p,v21o][v2tp,v2to][v2n!p,v2n!o][vm1p,vm1o][vmtp,vmto][vmn!p,vmn!o]] (8)

As claimed by [12], Ωm × n! represents a stochastic decision problem. SMAA-2 has been accepted as an effective tool to solve this problem [9].

2.2 SMAA-2

Stochastic multicriteria acceptability analysis (SMAA) is a multicriteria decision support method for multiple experts in discrete problems, based on exploring the weight space to describe the valuations that make each alternative the preferred one [8, 9]. SMAA-2 extends SMAA by taking into account information about other ranking positions, therefore identifies good compromise alternatives. This in particular makes sense when some extreme alternatives obtain the best ranking positions through some experts, but reach a very bad ranking position according to others.

We describe the preference structure among different experts that can be represented by a real-valued utility function u(xi, λ), which maps different alternatives xi to utility values

ui(λ)=u(xi,λ), (9)

in terms of a weight vector λ to quantify each specific preference among various decision results. Consider a more general environment in which neither input data nor weights are exactly known. The uncertain or imprecise input data is represented by stochastic variables ζil with estimated joint probability distribution and density function f(ζ) in the space X, while the unknown or partially known preferences are represented by a weight distribution with density function f(λ) in the set of feasible weights Λ defined as

Λ={λRp:λ0,lλl=1}. (10)

The set of feasible weights is therefore a (p − 1) dimensional simplex. The aforementioned utility function is then employed to map stochastic input data and weight distributions into utility distributions u(ζi, λ).

Total loss of knowledge on weights is represented in “Bayesian” manner by a uniform weight distribution in Λ, which has density function

f(λ)=1vol(Λ)=(p-1)!p. (11)

In SMAA, the set of favorable weights for each alternative Λi(ζ) is then defined as:

Λi(ζ)={λΛ:u(ζi,λ)u(ζk,λ),k}. (12)

The ranking position of each alternative is defined as an integer from the best (= 1) to the worst (= m), in terms of a ranking function:

rank(ζi,λ)=1+kϕ(u(ζk,λ)>u(ζi,λ)), (13)

in which φ(ture) = 1 and φ(false) = 0.

In SMAA-2, the set of favorable weights for Λir(ζ) is defined as:

Λir(ζ)={λΛ:rank(ζi,λ)=r}. (14)

A weight λΛir(ζ) assigns utilities for the alternatives in this manner so that alternative xi reaches ranking position r.

The rank acceptability index bir is thereby defined as the expected volume of the set of favorable weights, and regarded as a measure of the variety of different valuations granting alternative xi achieves ranking position r. Meanwhile, the rank acceptability index is calculated as a multidimensional integral over the input data distributions and the favorable rank weights by means of

bir=Xf(ζ)Λir(ζ)f(λ)dλdζ. (15)

The rank acceptabilities can be utilized directly in the evaluation of alternatives. For large-scale problems, we introduce an iterative process, in which the κ best ranking positions (κbr) acceptabilities are analyzed at each iteration κ:

aiκ=r=1κbiκ. (16)

The kbr acceptabilities aiκ is a measure of the variety of different valuations that assign alternative xi any of the κ best ranking positions.

The problem of comparing alternatives through rank acceptabilities motivates us to propose a complementary method that integrates the rank acceptabilities into holistic acceptability indices aih for each alternative as:

aih=rβrbir, (17)

where βr are surrogate weights. The basic requirements for surrogate weights are nonnegative, normalized and nonincreasing when rank increases, namely, β1β2 ≥ ⋯ ≥ βm ≥ 0. The elicitation of surrogate weights have been extensively studied in literature [4, 10, 13].

3. Empirical study

3.1. FHI 2018

Data has been universally regarded as one of the most important resources in modern health care. The collection, sharing and analyzing of data can help identify disease earlier, make hospitals become faster organizations, and transform the patient experience. Value defined in health care are tracked, measured and improved though data. The FHI 2018 analyzes data and conducts interviews with leaders that are making value-based health care happen around the world, to produce practical insights that health care leaders can apply for accelerating their path towards that goal. The fist chapter of FHI 2018 outlines how the Value Measure tool can form the basis of a positive platform for change across the countries it surveys, and reports the value delivered by health systems of 16 countries, which are shown in Table 2 below. We observe that Germany performs best in Access, Singapore has the best performance in Satisfaction and Efficiency. The 16-country average Value Measure is 43.48, and Singapore has the highest Value Measure across the 16 countries surveyed.

Table 2. Value measure by country in the FHI 2018.

Country Access Satisfaction Efficiency Normalized Access Normalized Satisfaction Normalized Efficiency
Australia 65.05 66.85 25.87 0.79 0.97 0.38
Brazil 36.99 21.08 22.06 0.37 0.00 0.28
China 31.50 44.63 38.19 0.29 0.50 0.69
France 67.45 63.77 21.33 0.83 0.90 0.26
Germany 78.72 53.30 20.77 1.00 0.68 0.25
India 12.23 59.67 28.02 0.00 0.82 0.43
Italy 53.11 44.97 27.24 0.61 0.51 0.41
Netherlands 63.57 60.86 22.35 0.77 0.84 0.29
Russia 63.58 31.75 27.38 0.77 0.23 0.42
Saudi Arabia 43.59 62.75 44.17 0.47 0.88 0.85
Singapore 45.46 68.27 50.11 0.50 1.00 1.00
South Africa 29.21 39.53 11.09 0.26 0.39 0.00
Spain 51.43 66.50 27.79 0.59 0.96 0.43
Sweden 62.14 61.05 21.11 0.75 0.85 0.26
United Kingdom 54.38 55.18 26.25 0.63 0.72 0.39
United States 55.15 45.46 12.23 0.65 0.52 0.03

3.2. Result and analysis

We take into account all possible individual preferences among Access, Satisfaction and Efficiency: ASE: access ≽ satisfaction ≽ efficiency, AES: access ≽ efficiency ≽ satisfaction, SAE: satisfaction ≽ access ≽ efficiency, SEA: satisfaction ≽ efficiency ≽ access, EAS: efficiency ≽ access ≽ satisfaction, and ESA: efficiency ≽ satisfaction ≽ access. By means of the closed-form solutions obtained in Section 2.1, the pessimistic and optimistic results are derived to formulate the following interval decision problem as Table 3. As for this stochastic decision problem, we follow [12] to consider both Gaussian and Uniform distributions to implement the SMAA-2. [14] develops a open-source implementation of SMAA methods in java, which can be downloaded at http://smaa.fi/jsmaa/.

Table 3. Interval decision matrix.

Country ASE AES SAE SEA EAS ESA
Australia [0.7144,0.8822] [0.5866,0.7944] [0.7144,0.9699] [0.6743,0.9699] [0.3788,0.7144] [0.3788,0.7144]
Brazil [0.1862,0.3724] [0.2178,0.3724] [0.0000,0.2178] [0.0000,0.2178] [0.2178,0.3268] [0.1406,0.2811]
China [0.2898,0.4945] [0.2898,0.4945] [0.3944,0.4900] [0.4945,0.5968] [0.4922,0.6945] [0.4945,0.6945]
France [0.6659,0.8676] [0.5465,0.8305] [0.6659,0.9046] [0.5835,0.9046] [0.2624,0.6659] [0.2624,0.6659]
Germany [0.6436,1.0000] [0.6240,1.0000] [0.6436,0.8414] [0.4654,0.6828] [0.2481,0.6436] [0.2481,0.6436]
India [0.0000,0.4172] [0.0000,0.4172] [0.4089,0.8178] [0.4172,0.8178] [0.2169,0.4339] [0.4172,0.6258]
Italy [0.5117,0.6148] [0.5117,0.6148] [0.5063,0.5605] [0.4601,0.5117] [0.4139,0.5144] [0.4139,0.5117]
Netherlands [0.6346,0.8076] [0.5304,0.7721] [0.6346,0.8430] [0.5658,0.8430] [0.2886,0.6436] [0.2886,0.6436]
Russia [0.4720,0.7723] [0.4720,0.7723] [0.2261,0.4992] [0.2261,0.4720] [0.4175,0.5949] [0.3218,0.4720]
Saudi Arabia [0.4716,0.7341] [0.4716,0.7341] [0.6773,0.8830] [0.7341,0.8830] [0.6597,0.8478] [0.7341,0.8654]
Singapore [0.4998,0.8333] [0.4998,0.8333] [0.7499,1.0000] [0.8333,1.0000] [0.7499,1.0000] [0.8333,1.0000]
South Africa [0.2154,0.3232] [0.1277,0.2554] [0.2154,0.3910] [0.1955,0.3910] [0.0000,0.2154] [0.0000,0.2154]
Spain [0.5896,0.7760] [0.5088,0.6600] [0.6600,0.9625] [0.6600,0.9625] [0.4280,0.6600] [0.4280,0.6952]
Sweden [0.6181,0.7988] [0.5037,0.7506] [0.6181,0.8470] [0.5519,0.8470] [0.2568,0.6181] [0.2568,0.6181]
United Kingdom [0.5817,0.6783] [0.5112,0.6339] [0.5817,0.7226] [0.5556,0.7226] [0.3885,0.5817] [0.3885,0.5817]
United States [0.3971,0.6455] [0.3374,0.6455] [0.3971,0.5811] [0.2729,0.5166] [0.0292,0.3971] [0.0292,0.3971]

3.3 Gaussian distribution

We consider that the interval-valued data satisfies the Gaussian distribution, the mean and variance are simulated as [12]:

μit=vitp+vito2, (18)
(σ2)it=vito-vitp6. (19)

The rank acceptability indices are easily obtained and vividly illustrated in Table 4 and Fig 1 below. In addition, we use the rank-order centroid approach (ROC) to elicit surrogate weights for constructing the holistic acceptability indices: βr=116t=r161t,r=1,2,,16. It is observed that the first rank support of Singapore is 90.36% of the possibility, while the last rank supports of Brazil and South Africa are 46.23% and 53.22%, respectively. This implies that Singapore is most likely to be ranked at the first, Brazil and South Africa have the similar probability to realize the last rank.

Table 4. Rank acceptability indices and HAI calculated under Gaussian distribution.

Country b1 b2 b3 b4 b5 b6 b7 b8 b9 b10 b11 b12 b13 b14 b15 b16 HAI
Australia 0.0521 0.3051 0.4513 0.1466 0.0342 0.0081 0.0016 0.0006 0.0004 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.1271
Brazil 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0001 0.0264 0.5112 0.4623 0.0063
China 0.0000 0.0000 0.0009 0.0056 0.0155 0.0177 0.0189 0.0262 0.0414 0.1837 0.2191 0.3475 0.1144 0.0091 0.0000 0.0000 0.0297
France 0.0027 0.0253 0.0965 0.2036 0.3265 0.1856 0.0931 0.0390 0.0143 0.0081 0.0029 0.0021 0.0003 0.0000 0.0000 0.0000 0.0829
Germany 0.0398 0.0645 0.0924 0.1285 0.1438 0.1666 0.1418 0.1194 0.0623 0.0266 0.0088 0.0051 0.0004 0.0000 0.0000 0.0000 0.0824
India 0.0000 0.0000 0.0000 0.0001 0.0001 0.0006 0.0017 0.0038 0.0062 0.0643 0.0729 0.1444 0.3188 0.3640 0.0178 0.0053 0.0184
Italy 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0003 0.0014 0.0086 0.4697 0.4145 0.1054 0.0001 0.0000 0.0000 0.0000 0.0307
Netherlands 0.0000 0.0006 0.0054 0.0372 0.1289 0.2595 0.3194 0.1834 0.0414 0.0146 0.0057 0.0037 0.0002 0.0000 0.0000 0.0000 0.0626
Russia 0.0000 0.0000 0.0001 0.0003 0.0003 0.0023 0.0048 0.0123 0.0282 0.1455 0.2457 0.2715 0.2287 0.0602 0.0001 0.0000 0.0251
Saudi Arabia 0.0012 0.5359 0.1884 0.1022 0.0722 0.0413 0.0294 0.0265 0.0023 0.0006 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.1238
Singapore 0.9036 0.0522 0.0245 0.0113 0.0044 0.0018 0.0017 0.0003 0.0001 0.0001 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.2033
South Africa 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0017 0.4661 0.5322 0.0059
Spain 0.0006 0.0161 0.1385 0.3538 0.2136 0.1514 0.0842 0.0379 0.0032 0.0006 0.0001 0.0000 0.0000 0.0000 0.0000 0.0000 0.0876
Sweden 0.0000 0.0003 0.0020 0.0105 0.0571 0.1440 0.2448 0.3775 0.1036 0.0351 0.0131 0.0113 0.0007 0.0000 0.0000 0.0000 0.0548
United Kingdom 0.0000 0.0000 0.0000 0.0003 0.0034 0.0211 0.0583 0.1717 0.6879 0.0507 0.0056 0.0010 0.0000 0.0000 0.0000 0.0000 0.0440
United States 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0001 0.0004 0.0116 0.1080 0.3363 0.5386 0.0048 0.0002 0.0154

Fig 1. Rank acceptability indices calculated under Gaussian distribution.

Fig 1

3.4. Uniform distribution

Again, we take into account the uniform distribution and apply the open-source decision supporting software to calculate the rank acceptability indices and show them in the following Table 5 and Fig 2. The aforementioned surrogate weights are employed to build the holistic acceptability indices. Similar to that of Gaussian distribution, the first rank support of Singapore is 85.46% of the possibility, while the last rank supports of Brazil and South Africa are 47.38% and 51.16%, respectively.

Table 5. Rank acceptability indices and HAI calculated under Uniform distribution.

Country b1 b2 b3 b4 b5 b6 b7 b8 b9 b10 b11 b12 b13 b14 b15 b16 HAI
Australia 0.0662 0.2542 0.3222 0.1874 0.0988 0.0432 0.0196 0.0060 0.0016 0.0004 0.0003 0.0001 0.0000 0.0000 0.0000 0.0000 0.1203
Brazil 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0005 0.0484 0.4773 0.4738 0.0063
China 0.0000 0.0000 0.0023 0.0071 0.0153 0.0181 0.0276 0.0400 0.0662 0.1557 0.2263 0.2983 0.1235 0.0196 0.0000 0.0000 0.0306
France 0.0133 0.0587 0.1315 0.1633 0.1950 0.1590 0.1196 0.0818 0.0396 0.0181 0.0106 0.0077 0.0016 0.0002 0.0000 0.0000 0.0833
Germany 0.0421 0.0677 0.0944 0.1145 0.1256 0.1414 0.1414 0.1242 0.0770 0.0352 0.0185 0.0145 0.0033 0.0002 0.0000 0.0000 0.0807
India 0.0000 0.0000 0.0002 0.0000 0.0015 0.0025 0.0076 0.0133 0.0222 0.0721 0.0674 0.1448 0.3068 0.3248 0.0243 0.0125 0.0196
Italy 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0032 0.0120 0.0391 0.4191 0.3994 0.1240 0.0032 0.0000 0.0000 0.0000 0.0310
Netherlands 0.0008 0.0072 0.0334 0.0959 0.1542 0.1963 0.2098 0.1707 0.0815 0.0256 0.0142 0.0081 0.0023 0.0000 0.0000 0.0000 0.0659
Russia 0.0000 0.0003 0.0001 0.0012 0.0031 0.0073 0.0141 0.0234 0.0415 0.1368 0.1893 0.2700 0.2304 0.0813 0.0012 0.0000 0.0258
Saudi Arabia 0.0173 0.4838 0.2047 0.1106 0.0682 0.0484 0.0338 0.0239 0.0072 0.0016 0.0005 0.0000 0.0000 0.0000 0.0000 0.0000 0.1228
Singapore 0.8546 0.0783 0.0303 0.0165 0.0082 0.0062 0.0034 0.0016 0.0007 0.0002 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.1988
South Africa 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0076 0.4808 0.5116 0.0060
Spain 0.0051 0.0471 0.1629 0.2433 0.2134 0.1590 0.1012 0.0531 0.0116 0.0030 0.0002 0.0001 0.0000 0.0000 0.0000 0.0000 0.0881
Sweden 0.0006 0.0027 0.0177 0.0579 0.1013 0.1624 0.2008 0.2165 0.1356 0.0476 0.0280 0.0225 0.0060 0.0004 0.0000 0.0000 0.0586
United Kingdom 0.0000 0.0000 0.0003 0.0023 0.0154 0.0562 0.1179 0.2333 0.4757 0.0806 0.0139 0.0044 0.0000 0.0000 0.0000 0.0000 0.0467
United States 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0002 0.0005 0.0040 0.0314 0.1055 0.3224 0.5175 0.0164 0.0021 0.0157

Fig 2. Rank acceptability indices calculated under Uniform distribution.

Fig 2

In what follows, we use the holistic acceptability index under Gaussian and Uniform distributions as the revised metric of Value Measure, then compare these ranks with those according to the Value Measure, as shown in Table 6. It is evident that our method generates sufficiently robust rank among 16 countries. Only Australia and Saudi Arabia are ranked differently with slight difference. Meanwhile, the ranks of Brazil (15), Italy (10), Singapore (1), South Africa (16), Sweden (8) and United Kingdom (9) are significantly reliable because both our method and Value Measure produce the identical outcomes for them.

Table 6. Rank comparisons.

Country Value Measure SMAA-2 with Gaussian distribution SMAA-2 with Uniform distribution
Australia 2 2 3
Brazil 15 15 15
China 12 11 11
France 4 5 5
Germany 3 6 6
India 14 13 13
Italy 10 10 10
Netherlands 6 7 7
Russia 11 12 12
Saudi Arabia 5 3 2
Singapore 1 1 1
South Africa 16 16 16
Spain 7 4 4
Sweden 8 8 8
United Kingdom 9 9 9
United States 13 14 14

In addition, we make the full use of Spearman’s rank correlation coefficient to verify the feasibility and rationality of the proposed method. In statistics, Spearman’s rank correlation coefficient is a nonparametric measure between the rankings of two variables, and evaluates how well the relationship between two variables can be described using a monotonic function. The Spearman’s rank correlation coefficient is capable of reflecting the conflict between ranking orders [15]. The more discordant the rankings of two variables, the smaller the Spearman’s rank correlation coefficient [16]. The formula to compute Spearman’s rank correlation coefficient is

ρs=1-6i=1m(di)2m(m2-1), (20)

where di is the difference between the two ranks of each variable, and m is the number of DMUs [17].

We calculate and compare the average Spearman’s rank correlation coefficients in the following Table 7, which are capable of measuring the strength and direction of association between obtained ranks and variables, and assessing the accuracy of models [18]. The Spearman’s rank correlation coefficients between the Value Measure and Access, Satisfaction, Efficiency are computed as a benchmark for further analysis. Columns 2–4 report the Spearman rank correlation coefficients between the ranks obtained from our method and from Access, Satisfaction, Efficiency, respectively. Relative improvements are reported in the last column. Apparently, our method always outperforms the Value Measure, and the improvement from SMAA-2 with Gaussian and Uniform distributions are 4.15% and 3.25%, respectively.

Table 7. The superiority of our method.

Scenario Access Satisfaction Efficiency Average Improvement
FHI 2018 0.6000 0.8000 0.2294 0.5431 0.00%
SMAA-2 with Uniform distribution 0.3882 0.8706 0.4235 0.5608 3.25%
SMAA-2 with Gaussian distribution 0.4147 0.8794 0.4029 0.5657 4.15%

According to the comparison of average Spearman’s rank correlation coefficients, the proposed method outperforms the original Value Measure in terms of better associations between modified Value Measure and Access, Satisfaction, Efficiency. This indicates that countries can improve the levels of Value Measure in a precise manner.

4. Concluding remarks

The Future Health Index (FHI) 2018 measures and assesses the value presented in 16 health systems of developed and developing markets through proposing a broadly applicable composite indicator, namely, Value Measure, which is constructed in terms of the arithmetic average of Access, Satisfaction and Efficiency. However, the individual preferences among them remain largely unexplored in literature.

This paper proposes to apply the social choice theory and Stochastic Multicriteria Acceptability Analysis for group decision making (SMAA-2) for measuring the value of health systems, by means of re-constructing the Value Measure. Specifically, we begin with considering all possible individual preferences among Access, Satisfaction and Efficiency, which is mathematically represented by ranked weights of them; the pessimistic and optimistic outcomes under certain individual preference are derived in a closed-form manner, according to which an interval decision matrix is then formulated; the SMAA-2 is then applied to compute the holistic acceptability index and is considered as a revised Value Measure. An empirical study using the data of 16 countries is performed to demonstrate the usefulness of our method, in which both Gaussian and Uniform distributions have been taken into account. It is evident that our method is capable of generating sufficient robust and superior results to the Value Measure.

The applicability and feasibility of our method are in particular limited by two aspects of the data set: extreme values and number of metrics. Specifically, it is more meaningful to extensively investigate various individual preferences when the metric values are changed mildly among different metrics. Moreover, the application of our method could be more complicated when there exist more metrics to consider, since the importance orders would dramatically increase as the increase of the number of metrics. Therefore, the proposed method is applicable and feasible when the amount of metrics is considerably small, such as no more than four. For the scenario with over five metrics, future research should develop some statistical techniques, for example, principal component analysis, to select useful orders for implementation. In addition, future research should consider other statistical distributions (e.g., lognormal distribution, gamma distribution) of the stochastic parameters. A wide spectrum of methods should also be determined to select meaningful individual preferences for further analysis.

Acknowledgments

The authors thank the editor and two anonymous reviewers for their helpful comments on our paper.

Data Availability

All relevant data are within the paper and its Supporting Information files.

Funding Statement

This research is financially supported by Humanities and Social Science Research Projects of Ministry of Education of the People’s Republic of China (Funding No.18YJC790142), Humanities and Social Science Research Projects of Guangdong Province (Funding No. GD18CGL06), Humanities and Social Science Research Projects of Shenzhen (Funding No. SZ2018C012).

References

  • 1. Porter M. E. (2008). Value-based health care delivery. Annals of Surgery, 248, 503–509. [DOI] [PubMed] [Google Scholar]
  • 2. Porter M. E. (2009). A strategy for health care reform—toward a value-based system. New England Journal of Medicine, 361, 109–112. 10.1056/NEJMp0904131 [DOI] [PubMed] [Google Scholar]
  • 3. Porter M. E. (2010). What is value in health care? New England Journal of Medicine, 363, 2477–2481. 10.1056/NEJMp1011024 [DOI] [PubMed] [Google Scholar]
  • 4. Danielson M., Ekenberg L., … He Y. (2014). Augmenting ordinal methods of attribute weight approximation. Decision Analysis, 11, 21–26. 10.1287/deca.2013.0289 [DOI] [Google Scholar]
  • 5. Ding Y., Fu Y., Lai K. K., … Leung W. J. (2018). Using ranked weights and acceptability analysis to construct composite indicators: A case study of regional sustainable society index. Social Indicators Research, 139, 871–885. 10.1007/s11205-017-1765-3 [DOI] [Google Scholar]
  • 6. Sen A. (1977). Social choice theory: A re-examination. Econometrica, 45, 53–88. 10.2307/1913287 [DOI] [Google Scholar]
  • 7. Sen A. (1999). The possibility of social choice. American Economic Review, 89, 349–378. 10.1257/aer.89.3.349 [DOI] [Google Scholar]
  • 8. Lahdelma R., Hokkanen J., … Salminen P. (1998). SMAA-stochastic multiobjective acceptability analysis. European Journal of Operational Research, 106, 137–143. 10.1016/S0377-2217(97)00163-X [DOI] [Google Scholar]
  • 9. Lahdelma R., … Salminen P. (2001). SMAA-2: Stochastic multicriteria acceptability analysis for group decision making. Operations Research, 49, 444–454. 10.1287/opre.49.3.444.11220 [DOI] [Google Scholar]
  • 10. Barron F. H., … Barrett B. E. (1996). Decision quality using ranked attribute weights. Management Science, 42, 1515–1523. 10.1287/mnsc.42.11.1515 [DOI] [Google Scholar]
  • 11. Kirkwood C. W., … Sarin R. K. (1985). Ranking with partial information: A method and an application. Operations Research, 33, 38–48. 10.1287/opre.33.1.38 [DOI] [Google Scholar]
  • 12. Yang F., Ang S., Xia Q., … Yang C. (2012). Ranking DMUs by using interval DEA cross efficiency matrix with acceptability analysis. European Journal of Operational Research, 223, 483–488. 10.1016/j.ejor.2012.07.001 [DOI] [Google Scholar]
  • 13. Danielson M., … Ekenberg L. (2017). A robustness study of state-of-the-art surrogate weights for MCDM. Group Decision and Negotiation, 26, 677–691. 10.1007/s10726-016-9494-6 [DOI] [Google Scholar]
  • 14. Tervonen T. (2014). JSMAA: open source software for SMAA computations. International Journal of Systems Science, 45, 69–81. 10.1080/00207721.2012.659706 [DOI] [Google Scholar]
  • 15. Zhou P., Ang B., … Poh K. (2006). Comparing aggregating methods for constructing the composite environmental index: An objective measure. Ecological Economics, 59, 305–311. 10.1016/j.ecolecon.2005.10.018 [DOI] [Google Scholar]
  • 16. Diakoulaki D., Mavrotas G., … Papayannakis L. (1995). Determining objective weights in multiple criteria problems: The CRITIC method. Computers … Operations Research, 22, 763–770. 10.1016/0305-0548(94)00059-H [DOI] [Google Scholar]
  • 17. Fieller E. C., Hartley H. O., … Pearson E. S. (1957). Tests for rank correlation coefficients. i. Biometrika, 44, 470–481. 10.2307/2332878 [DOI] [Google Scholar]
  • 18. Lyerly S. B. (1952). The average Spearman rank correlation coefficient. Psy-chometrika, 17, 421–428. [Google Scholar]

Decision Letter 0

Fausto Cavallaro

23 Dec 2019

PONE-D-19-26948

Using social choice theory and acceptability analysis to measure the value of health systems

PLOS ONE

Dear Dr. Fu,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

We would appreciate receiving your revised manuscript by Feb 04 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Fausto Cavallaro, PhD

Academic Editor

PLOS ONE

Journal requirements:

When submitting your revision, we need you to address these additional requirements:

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.plosone.org/attachments/PLOSOne_formatting_sample_main_body.pdf and http://www.plosone.org/attachments/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We noticed you have some minor occurrence of overlapping text with the following previous publication(s), which needs to be addressed:

https://www.philips.com/a-w/about/news/future-health-index/about.html

The text that needs to be addressed involves the last paragraph of the Introduction.

In your revision ensure you cite all your sources (including your own works), and quote or rephrase any duplicated text outside the methods section. Further consideration is dependent on these concerns being addressed.

3. Thank you for stating the following financial disclosure:

"The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

a. Please provide an amended Funding Statement that declares *all* the funding or sources of support received during this specific study (whether external or internal to your organization) as detailed online in our guide for authors at http://journals.plos.org/plosone/s/submit-now

b. Please state what role the funders took in the study.  If any authors received a salary from any of your funders, please state which authors and which funder. If the funders had no role, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

c. Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: I Don't Know

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The underlying concepts need to be defined more precisely. Geographic access, financial access, access to specialists appropriate for treating specific conditions? How is efficiency to be measured? What are the output and cost measures? In general it is not clear where the data come from? If these are guesstimates by experts, who are the experts? This lack of detail will greatly limit reader interest.

Reviewer #2: Positive comments:

The paper covers an interesting topic with improving a method to appropriately address the several factors that are relevant for the value of different health systems. Following an approach that takes rankings of these factors into account instead of weighting them equally seems a reasonable approach.

The overall logic of the paper is understandable. The problem with the existing methodology (computing the average overall) and the aimed contribution is pointed out. In general, the development of the methodology in the empirical part (section 2) is reasonable. Moreover, it is clearly a plus, that the author includes detailed information about the used software (e.g., for the SMAA-2) and the tested data set. This makes the method also applicable for researchers and practitioners in other contexts for similar problems.

The general demands from the journal are met, in that the study presents the results of original research, the methodology is overall described in a reasonable manner, conclusions seem to be supported by the data (at least partly, although there are some issue with this method outlined below) and standards of data and methodological transparency are obtained.

However, there is some potential to improve the paper, as pointed out below.

Methodological Development (Section 2)

The method of using ranked weights to improve the value measurement seems to be solid and bases on an established approach that is motivated from former research. However, some open questions and issues remain.

First, it is not clear, whether or how it is accounted for the several sub-metrics that build the metrics Satisfaction, Access and Efficiency. Are they still aggregated using the average method or are ranked weights used for them as well? If they are still aggregated, there would still be the issue of information loss within these metrics. Is there an approach or argument to deal with this issue?

Second, it remains unclear, how feasible this methodology is for a large set of indicators with many possible preference rankings. It appears to require a high amount of computational power and might get complex if many indicators are present. This limitation raises concerns about its applicability for a broader range of (more complex) problems. It would be helpful to add a statement of anticipated generalizability of the method.

Moreover, the argumentation and description of the methodological development (section 2) should be improved. Some decisions in the development are (apparently) made arbitrarily in that they are not justified convincingly. For the following points, the author should outline their reasons leading to these methodological decisions more clear:

1. Why is it necessary to normalize values, since the original metrics in the FHI are already normalized (on a scale from 0-100)?

2. Why is a uniform and a Gaussian distribution used?

3. The introduction of surrogate weights is very short and the mathematical calculation of them appears 2 pages after their introduction. This disrupts comprehension. It could also contribute the quick comprehension of the methodology, if the author could add a short explanation when introducing some new parameters (e.g., what k stands for (p. 7)).

Results of Empirical Analysis (Section 3)

The final comparison between the developed and the previous method to evaluate the value of health systems bases on Spearman’s Rank correlations. However, there are several issues with the presentation of this analysis:

1. It is not designated if the Spearman rank correlations are statistically significant.

2. The author provides a proportional improvement of the summed correlation coefficients, but does not provide a statistical index of whether this improvement (in %) is a significant improvement. The argumentation would be more convincing if an objective index would be added that proves a significant improvement in comparison to the previous method of averaging over metrics.

3. It is questionable whether the correlation with ranks based on the averaged metrics is appropriate, since it is argued in the paper before that averaging over indices has some methodological issues. Taking the ranks developed by the averaged metrics then as a comparison standard seems inconsistent with that former argumentation.

As such, it remains unclear, how to evaluate the improvement of the revised value measure in comparison to the former approach. For these reasons the soundness of conclusions based on data can be only confirmed partly.

Tables and Figures

The Tables and Figures are helpful and improve understanding in general.

For tables, however, information about the depicted measure could help to intuitive understanding them. Although mentioned in the text, this information should be included directly in the table or the title of the table or added as a footnote (see for example Table 4, 5, 7). In Table 7 significance of correlation coefficients should be added (see also comment above).

It would be nice, if information from Figure 1 and 2 could be joined allowing a direct comparison between both distributions.

Language

The author could improve wording and language use at some points to make her points really clear:

For example the sentence “Such an investigation sheds much-needed light on potential incentives and directions for academic, managerial and policy-related implications.” (p. 5) does not point out which implications are derived. The sentence “For the purpose of measuring the performance previously aggregated using the arithmetic average” (p. 5) is misleading, as it suggests that the former performance of the value measure is investigated, which is not the aim nor the outcome of the paper (instead, it compares the correlations of the sub-metrics with the old and the new values measure.

Furthermore, when describing the metrics of the FHI, the terms “sub-metric” and “metric” (introduced on p. 3) are used, but later on in the methodological part the term “sub-indices” refers to the metrics Access, Satisfaction and Efficiency, which can be confusing. It should be stated clearly once, which concepts from the application case in the empirical part refer to which concepts in the developed methodology.

Another issue is that some sentences in the introduction are nearly adopted literally, but are not marked as verbatim quotes. Concretely these are the following sentences:

“The challenges of delivering health care in many countries are receiving increasing attentions as costs continue to rise and evidence of uneven quality accumulates” (p. 2, first sentence, adopted from Porter, 2008), and “The Future Health Index (FHI)1 is a research-based platform designed by the Royal Philips to help determine the readiness of countries to address global health challenges and build sustainable...” (adopted from the FHI website). Both should be marked as verbatim quotes.

There is an (important) typo in Table 1 (p. 4). It is “healthy life expectancy” instead of “health life expectancy”.

Structure

In general the structure and the development of the paper makes sense. However, maybe the author could avoid repeating information and foster a fluent reading when slightly restructuring the paper.

In the current draft, FHI is described in detail already in the introduction, then the general methodological approach is developed and then the empirical section is introduced again with a description of the FHI. The detailed description of the FHI could also be positioned as an own section before the empirical section or included in the empirical section.

In the introduction, the objective and contribution of the study is mentioned relatively late (after introduction of the FHI). This lets the reader in the dark about the purpose of the study and disrupts a proper understanding from the very beginning.

Outline of Contribution

Although the paper apparently focuses on the development of a methodological approach, the motivation of the paper addresses several content-wise issues that are not really addressed later on in the paper. The author should become more clear what exactly the problem is they aim to address and to solve with their research (otherwise it appears like overstating the contribution of the paper).

As such, it remains unclear how applicable the findings are and which exact practical implications can be derived from them. It would contribute to the value of the paper from a broader perspective if the author would answer the following questions when shortly discussing the results:

1. How do the findings and the improvement of measurement contribute to achieving high value for various stakeholders?

2. Which (if any) content-wise and/or practical implications can be derived from the improved value measurement?

Concluding Remarks

Taking together, the paper covers an important topic with an immense societal relevance. It uses a convincing methodological approach that is theoretically reasoned to improve the value of health systems.

Major issues in the current draft and important implications for the further development of the paper will be to improve the scientific proof of empirically testing the improvement of the revised measure of the FHI in comparison to the former approach of averaging the conducted metrics. Moreover, the practical implications and methodological applicability of the results should be worked out more clearly.

Some concerns are also raised through the use of nearly verbatim quotes from other sources without designating them as such.

I wish the author good luck and look forward to the further development of the paper.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

Decision Letter 1

Fausto Cavallaro

28 Apr 2020

PONE-D-19-26948R1

Using social choice theory and acceptability analysis to measure the value of health systems

PLOS ONE

Dear Dr. Fu,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

We would appreciate receiving your revised manuscript by Jun 12 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Fausto Cavallaro, PhD

Academic Editor

PLOS ONE

 Editor Comments: minor revision

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: (No Response)

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Partly

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: No

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: One major issue of the previous submission is still not addressed sufficiently: The authors argue that averaging over sub-metrics to identify the value of health systems entails several problems (see p. 4). However, their whole analyses bases on averaged values (sub-metrics) which definitely weakens the soundness of their findings and the applicability of the developed method to the given data base. This substantial limitation of their findings due to the given data set needs to be mentioned, e.g., in the conclusion section. The authors should also address the question of how serious this concern might be in comparable data sets (i.e., how accessible are other than averaged results). So far it is not clear how feasible their method is to address the initially stated shortcoming of existing approaches to calculate value of health systems.

This issue becomes even more significant given the potential problems that their methodology conveys when faced with a larger number of metrics that could or should be used when calculating a value measure. Again, the authors should point out more clearly, when and where exactly their developed method is applicable.

The claim that the proposed method states a reasonable improvement to the former calculation method remains untested and is therefore still questionable. This is particularly relevant given the fact that the proposed methods requires a much higher level of effort and computational power and might not be applicable to similar problems with a higher number of included measures as basis for the value calculation.

Contribution

The contribution, goal and applicability of the paper’s finding should be pointed out more clearly. The added statement on page 5 “to measure the value of health systems” does not clarify the issue and is rather confusing. What exactly is meant with “sheds much-needed light on potential incentives and directions for academic, managerial and policy-related implications…”? This is not really a sentence at all. Please clarify. Note: This could also be a language issue.

Minor issues

Figures and tables

Again: Table 4, 5, and figures 1 and 2 do not contain information, which values are depicted. They are as such not intuitive. The main title or an added sub-title or footnote should directly provide information about what is depicted. (e.g. title of table 5 could easily be “Rank acceptability indices calculated under uniform distribution” or similar); the tables should be comprehensible on their own.

Reviewer #3: This communication presents an interesting novel method for measuring the value of health systems.

As per reviewers' suggestions, the authors have revised the paper successfully, but some grammatical errors are still present in the revised manuscript. Improve the errors.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Jul 2;15(7):e0235531. doi: 10.1371/journal.pone.0235531.r004

Author response to Decision Letter 1


28 May 2020

Many thanks for your valuable comments on our manuscript. We make the revisions and highlight them in the revised version using RED.

Reviewer #2: One major issue of the previous submission is still not addressed sufficiently: The authors argue that averaging over sub-metrics to identify the value of health systems entails several problems (see p. 4). However, their whole analyses based on averaged values (sub-metrics) which definitely weakens the soundness of their findings and the applicability of the developed method to the given data base. This substantial limitation of their findings due to the given data set needs to be mentioned, e.g., in the conclusion section. The authors should also address the question of how serious this concern might be in comparable data sets (i.e., how accessible are other than averaged results). So far it is not clear how feasible their method is to address the initially stated shortcoming of existing approaches to calculate value of health systems.

This issue becomes even more significant given the potential problems that their methodology conveys when faced with a larger number of metrics that could or should be used when calculating a value measure. Again, the authors should point out more clearly, when and where exactly their developed method is applicable.

Many thanks. The applicability and feasibility of our method are in particular limited by two aspects of the data set: extreme values and number of metrics. Specifically, it is more meaningful to extensively investigate various individual preferences when the metric values are changed mildly among different metrics. Moreover, the application of our method could be more complicated when there exist more metrics to consider, since the importance orders would dramatically increase as the increase of the number of metrics. Therefore, the proposed method is applicable and feasible when the amount of metrics is considerably small, such as no more than four. For the scenario with over five metrics, future research should develop some statistical techniques, for example, principal component analysis, to select useful orders for implementation.

The claim that the proposed method states a reasonable improvement to the former calculation method remains untested and is therefore still questionable. This is particularly relevant given the fact that the proposed method requires a much higher level of effort and computational power and might not be applicable to similar problems with a higher number of included measures as basis for the value calculation.

Many thanks. Spearman's rank correlation coefficients have been extensively utilized to justify the superiority of different ranking results, by means of computing and comparing the strength and direction of association between obtained ranks and variables. The results can be easily obtained by using Excel and do not need high level of effort and large computational power, even the number of metrics is large.

Contribution

The contribution, goal and applicability of the paper’s finding should be pointed out more clearly. The added statement on page 5 “to measure the value of health systems” does not clarify the issue and is rather confusing. What exactly is meant with “sheds much-needed light on potential incentives and directions for academic, managerial and policy-related implications…”? This is not really a sentence at all. Please clarify. Note: This could also be a language issue.

Many thanks. We re-clarify the contribution on page 4: The contribution of this paper is the development of a new method to modify the Value Measure released by Philips for measuring the value of health systems, based upon the social choice theory and Stochastic Multicriteria Acceptability Analysis for group decision making (SMAA-2). The mentioned sentence on page 5 is rephrased as: This study proposes a new incentive and a feasible direction to measure the value of health systems in an appropriate manner, along with the provision of some academic, managerial and policy-related implications.

Minor issues

Figures and tables

Again: Table 4, 5, and figures 1 and 2 do not contain information, which values are depicted. They are as such not intuitive. The main title or an added sub-title or footnote should directly provide information about what is depicted. (e.g. title of table 5 could easily be “Rank acceptability indices calculated under uniform distribution” or similar); the tables should be comprehensible on their own.

Many thanks. The title of Tables 4&5, and Figures 1&2 are refined in the manuscript.

Many thanks for your valuable comments on our manuscript. We make the revisions and highlight them in the revised version using BLUE.

Reviewer #3: This communication presents an interesting novel method for measuring the value of health systems.

As per reviewers' suggestions, the authors have revised the paper successfully, but some grammatical errors are still present in the revised manuscript. Improve the errors.

Attachment

Submitted filename: Response to reviewer 3.docx

Decision Letter 2

Fausto Cavallaro

18 Jun 2020

Using social choice theory and acceptability analysis to measure the value of health systems

PONE-D-19-26948R2

Dear Dr. Fu,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Fausto Cavallaro, PhD

Academic Editor

PLOS ONE

Additional Editor Comments:

Dear Authors,

The reviewer affirms that in the revised version, all comments have been addressed successfully. However, some language / grammar issues remain and should be edited before final publication. Another minor point would be the labeling of table 1; to support comprehension you could improve the title of table 1. It would be more appropriate if this is called “Factors underlying the Value Measure” or similar.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: (No Response)

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: In the revised version, all comments have been addressed successfully. The revised contribution statement and limitation section contribute to the overall clarity and suggest some interesting potential directions for future research.

However, some language / grammar issues remain and should be edited before final publication. Another minor point would be the labeling of table 1; to support comprehension you could improve the title of table 1. It would be more appropriate if this is called “Factors underlying the Value Measure” or similar.

Otherwise, I congratulate the authors for the efforts they made for this paper.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

Acceptance letter

Fausto Cavallaro

24 Jun 2020

PONE-D-19-26948R2

Using social choice theory and acceptability analysis to measure the value of health systems

Dear Dr. Fu:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Professor Fausto Cavallaro

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Attachment

    Submitted filename: Response to reviewer 2.docx

    Attachment

    Submitted filename: Response to reviewer 3.docx

    Data Availability Statement

    All relevant data are within the paper and its Supporting Information files.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES