Skip to main content
PLOS One logoLink to PLOS One
. 2020 Jul 9;15(7):e0235666. doi: 10.1371/journal.pone.0235666

Value judgment of new medical treatments: Societal and patient perspectives to inform priority setting in The Netherlands

Anna Nicolet 1,*, Antoinette D I van Asselt 1, Karin M Vermeulen 1, Paul F M Krabbe 1
Editor: Jan Ostermann2
PMCID: PMC7347112  PMID: 32645035

Abstract

Background

In many countries, medical interventions are reimbursed on the basis of recommendations made by advisory boards and committees that apply multiple criteria in their assessment procedures. Given the diversity of these criteria, it is difficult to find common ground to determine what information is required for setting priorities.

Objective

To investigate whether society and patients share the same interests and views concerning healthcare priorities.

Methods

We applied a framework of discrete choice models in which respondents were presented with judgmental tasks to elicit their preferences. They were asked to choose between two hypothetical scenarios of patients receiving a new treatment. The scenarios graphically presented treatment outcomes and patient characteristics. Responses were collected through an online survey administered among respondents from the general population (N = 1,253) and patients (N = 1,389) and were analyzed using conditional logit and mixed logit models.

Results

The respondents’ preferences regarding new medical treatments revealed that they attached the most relative importance to additional survival years, age at treatment, initial health condition, and the cause of disease. Minor differences in the relative importance assigned to three criteria: age at treatment, initial health, and cause of disease were found between the general population and patient samples. Health scenarios in which patients had higher initial health-related quality of life (i.e., a lower burden of disease) were favored over those in which patients’ initial health-related quality of life was lower.

Conclusions

Overall, respondents within the general population expressed preferences that were similar to those of the patients. Therefore, priority-setting studies that are based on the perspectives of the general population may be useful for informing decisions on reimbursement and other types of priority-setting processes in health care. Incorporating the preferences of the general population may simultaneously increase public acceptance of these decisions.

1. Background

Governments in many Western countries rely on the recommendations of advisory boards and committees when making decisions on the reimbursement of new drugs. Decisions to reimburse non-pharmaceutical treatments are also increasingly based on assessments of the available evidence by these independent parties to determine whether the technology in question offers added value to patients and society at large. In the USA, the emphasis is on comparative effectiveness research, which entails direct comparisons of healthcare interventions to determine which ones work best for which patients and which ones pose beneficial or harmful outcomes [1]. In England and Wales, the National Institute for Health and Care Excellence advises the National Health Service on the clinical relevance and cost-effectiveness of treatments, whose health effects are expressed in quality-adjusted life years (QALYs) [2]. Elsewhere in Western Europe, assessment procedures used for this purpose are diverse. In this paper, we will elaborate on the assessment procedure used in one Western European country, the Netherlands [35].

In the Netherlands, the Appraisal Committee (Advies Commissie Pakket) of the National Health Care Institute (Zorginstituut Nederland) provides advice on whether or not to include certain care services or treatments in the basic insurance package. The main criteria used to assess the therapeutic and societal values of drugs and other health interventions are necessity, efficacy, cost-effectiveness, and feasibility. However, some of the sub-criteria can also have significant impacts on the final decision regarding the reimbursement of health interventions [6]. For example, despite low cost-effectiveness, reimbursement may be considered when no other treatment is available, when an orphan disease is in question, or when the burden of disease is substantial. The Appraisal Committee also takes account of societal value judgments in their deliberations. These judgments are not based solely on clinical relevance and cost-effectiveness; equity issues and moral values are also considered. For instance, the fair allocation and distribution of medical care is considered an important aspect of social and distributive justice.

Given this diversity of criteria, it is difficult to find common ground for determining what information is required for setting priorities. While a number of studies have explored the views of the general public on the principles that should guide priority setting [79], others have investigated the criteria that are actually used in priority setting [10, 11]. Tanios et al. [11] found a convergence among decision makers regarding the relevance of the criteria that were considered. However, in general, there seems to be a substantial degree of plurality of opinions regarding the identified criteria. Accordingly, Van Exel et al. [7] concluded that it is unlikely that a single decision rule could satisfactorily cover all of the relevant equity principles and viewpoints.

Another concern relates to the question of whose values should be applied for priority setting: those of the experts, the general public, or the patients. Most advisory boards and committees tasked with decision making on reimbursement use criteria that are derived from experts’ opinions. When health effects are expressed in QALYs, this usually implies that the applied values are derived from representative community samples [12]. As taxpayers, members of the general public are assumed to adhere to principles of justice and equity as opposed to self-interest. However, it is also often argued that patients are the best judges of their own health situations, as they are likely to be better informed than healthy people or more adept at visualizing certain health states [1317]. To the best of our knowledge, head-to-head studies aimed at eliciting views on priority setting within general populations as opposed to patients’ views have not been conducted so far.

In summary, a clear set of criteria to guide priority setting is lacking for the Netherlands, and the views of the general public or patients are not explicitly considered in the decision-making process. One of the questions emerging from this situation is whether societal interests and views concerning healthcare priorities accord with those of patients, who are most affected by them. Therefore, the aim of this study was to investigate whether the societal perspective on healthcare priorities reflects the interests and views of patients.

2. Methods

2.1 The discrete choice methodology and criteria selection

Discrete choice experiments (DCEs) are widely used to elicit personal and societal preferences in health valuation studies [18, 19]. DCEs have also been applied to inform a wide range of health policy, planning, and resource allocation decisions within various healthcare settings. The statistical literature classifies DCEs within the framework of probabilistic discrete choice models [2025]. All DCE models establish the relative merit of a phenomenon based on its relative attractiveness. This method requires participants to choose from among two or more scenarios (choice tasks) described using specific attributes with distinct levels. The respondents were required to make complex priority choices for the study’s aim to be achieved. Accordingly, we decided to use the DCE method, as it simulates the choice-making process in a real-world setting and can therefore be considered less cognitively demanding. However, when using this method, the selection and identification of the most important and informative attributes (and their levels) must be performed carefully to enable respondents to process these attributes without becoming fatigued. In the present study, the attributes included in the paired scenarios represented criteria. They were carefully selected to ensure that the essential aspects of the decision-making process were captured.

In light of our review of the available studies, we decided to present the relevant health outcomes in a way that enabled them to be easily understood. The conventional QALY concept served as a starting point for developing this approach. We created a graphical representation of a QALY (including levels and durations of particular health states) that was extended to incorporate other relevant and important criteria. To ensure that our selection of criteria (attributes in the DCE) was appropriate and well-informed, we performed an extensive review of the literature (publications from 2014 onward) using the Medline and Embase databases. The review was conducted with the aim of extracting a set of criteria that reflected societal concerns about treatment effectiveness and equity considerations [26].

Presenting all of the criteria (n = 25) within a single choice task would have made the task too demanding for the respondents, and many of the criteria were only applicable in very specific circumstances. Therefore, each of the 25 criteria was individually assessed by two authors and one senior researcher within the research team, and differences in their assessments were discussed to reach a consensus. Finally, a combination of essential and prominent criteria that are receiving attention within the Dutch (and international) social value appraisal system were selected. Apart from reflecting the relevant treatment outcomes, the criteria also reflected the characteristics of the potential recipients of the new treatment to enable an assessment of its necessity according to the burden and cause of a disease. The scenarios were constructed on the basis of the following factors:

  1. Relevant outcomes of new and standard treatments: any changes in a patient’s health-related quality of life (HRQoL) after undergoing a new treatment or after a standard treatment (if it exists and is accessible) and any gains in life years for a patient after undergoing a new treatment or a standard treatment (if it exists and is accessible).

  2. Relevant characteristics of the patient: age, the initial HRQoL of a patient burdened by a disease or by an exacerbated health condition, and the cause of an acute onset (deterioration of the current HRQoL) associated with an accident, genetics, or an unhealthy lifestyle. The definition of disease or exacerbated health condition includes injury; illness; a handicap; genetic deficit; or another physical, mental, or nervous condition, disorder or ailment.

We decided to focus on acute onsets rather than on chronic or reversible conditions. The distinction of levels of the attributes in a DCE necessitate step-wise changes in HRQoL, whereas chronic conditions are commonly characterized by a trajectory of slow deterioration, often without any clear episodes. Therefore, acute onsets are the only conceivable cause of sudden and dramatic differences in HRQoL. We applied the term “acute” to denote the sudden onset of a new disease, the sudden deterioration of an existing one, or the occurrence of an accident.

2.2 Scenarios

Each scenario covered a health condition that existed prior to the acute onset, the effect of the standard treatment, and the effect of a new treatment. There have been numerous debates regarding the added benefits of using graphical representations to enhance task comprehension in DCEs [2731]. Specifically, it has been suggested that diagrams and other graphical representations could facilitate respondents in making complex choices among health states in DCEs conducted online [31]. Although not all attributes can be presented graphically, the most important attributes (length of life, quality of health/HRQoL, and additional gains or losses) can be depicted in this way. Therefore, instead of using conventional textual descriptions in the DCE, we opted for graphical representations of the scenarios, which were expected to enhance respondents’ comprehension. To reduce possible framing bias relating to the use of graphs and icons to describe hypothetical states, we inserted written explanations and notes to support the graphical presentation. This combination of graphic and textual material was used because the findings of previous studies indicate that graphical representation helps respondents to understand tasks better, whereas written explanations facilitate their judgment [30]. The DCE was designed to ensure that none of the scenarios were implausible or unrealistic. Face-to-face pilot testing of the survey was performed at the University of Groningen as well as online in April 2015. There was no time limitation set for the pilot testing, although the mean duration of the face-to-face version, including instructions and feedback, was estimated to be 20 minutes. The online version was typically shorter.

During the pilot phase, the following questions were verbally posed to the respondents (n = 8): Were the tasks easy? Was the presentation attractive and comprehensible? Were you able to make a choice? The respondents indicated that the task was not difficult to understand, and they confirmed that the graphic design facilitated comprehension of the task. The respondents also provided some suggestions for improving the instructions provided with the task. Values and assumptions derived from the existing body of literature [3237] were used for portraying the initial health state (which could include an underlying disease, or not), the cause of the acute onset, health gains after the patient commenced the new treatment, and health gains after the patient commenced the standard treatment (Table 1). A detailed explanation of the values and assumptions used for the level selection is presented in S1 Appendix and S1 Fig.

Table 1. Attributes and their corresponding levels used to describe criteria in the scenarios.

Attributes Levels
Initial health state
Age at onset 25, 50, 75
Initial HRQoL 0.5, 0.7, 0.9
Effects of new treatment
Change in HRQoL -0.2, -0.1, 0.0
Gain in LY 2, 10, 20
Effects of standard treatment
Change in HRQoL -0.2
Gain in LY 0*, 2, 10, 20
Cause of acute onset unhealthy lifestyle (1), accident/genetics (0)

*In case a standard treatment is not available

Because the task of comparing options on the basis of several criteria is a demanding one, we restricted the number of criteria used to construct the scenarios. Only the most relevant ones were included so as not to overload the respondents with information.

2.3 Choice tasks

The respondents were provided with an explanation of the paired scenarios along with instructions on how to proceed with the task. Combinations of criteria were presented in three steps: the patient’s initial health state, followed by the effects of the new treatment, and lastly a comparison of the effects of the new and standard treatments. The patient’s age and HRQoL before the acute onset were respectively depicted on the x- and y-axis of the graph, and the benefits of the new and standard treatments were color-coded and depicted as shaded areas on a plane. The cause of the acute onset was depicted by two icons, one of which represented unhealthy lifestyle elements, such as smoking/being overweight, and the other represented external factors, such as a genetic predisposition or an accident. To reduce the cognitive burden on respondents, a three-step approach was implemented that facilitated their understanding of the distinct attributes in the scenarios. As they progressed through the three consecutive steps, the respondents gradually gained familiarity with each attribute before they finally encountered the actual choice task. The first step (see Fig 1A) presented a hypothetical patient’s age and initial health state, depicting this individual’s HRQoL before the occurrence of the acute event. The second step introduced new information, namely gains from the new treatment in terms of additional life years and post-treatment HRQoL (Fig 1B). Only when they reached the third step (Fig 1C) did the respondents encounter the final scenario comprising the patient’s health state before the onset of the acute event, the benefits of the new treatment, and the benefits of the standard treatment, if this information was available. The respondents then had to decide which of the two scenarios they preferred for the hypothetical patients according to the information available to them. The question associated with the task was framed as follows: “Which patient should receive the new treatment?” Additionally, arrows and balloons were introduced to explain where the gains from the standard treatment or the new treatment could be found on the screen.

Fig 1. Example of the three-step discrete choice response task.

Fig 1

a) Step 1. Health before the onset of the acute event. b) Step 2. Health after the onset of the acute event and the administration of the new treatment. c) Step 3. Actual choice task. Health after the acute event and the administration of the new treatment and standard treatment and the cause of the acute onset.

2.4 Study design

We created a total of 200 paired scenarios. To diminish the burden on respondents and to avoid fatigue, this set was subdivided into 20 blocks, each comprising 10 choice tasks. The tasks were selected from a range of possible combinations of paired comparisons using an efficient design (the Ngene software program, the MNL model, and null priors) [18]. Additional constraints were applied (see Appendix A) to impose the new treatment’s superiority, which essentially implied that compared with the standard treatment, the new treatment would always be associated with an equal or higher HRQoL. The survey was designed as a web-based experiment. The respondents were randomly assigned to one of the 20 blocks so that each respondent completed only 10 response tasks.

2.5 Respondents

A sample drawn from the Dutch population and from a panel of patients who were at least 18 years old were contacted by a market research agency Survey Sampling International [SSI] based in Rotterdam, the Netherlands. Individuals from the patient panel were recruited and requested to self-report their diagnosis. We defined patients as individuals with current diseases or serious complaints. Therefore, we did not consider individuals with past experience of a disease. The rewards for participation, which are typically vouchers or electronic gift cards, were arranged vis-à-vis internal agreements between the participants and SSI. The patients were asked to indicate whether they had any of the following disease types: diabetes, neck and back problems, heart disease, hearing or vision loss, asthma/COPD, eczema, mental health problems, stroke, rheumatism, cancer, epilepsy, lung disease, or gastrointestinal disease. Patients who had more than one diagnosed disease were allowed to report multiple diseases. The sampling design for the general population did not enable us to verify whether potential respondents had been diagnosed with a disease. The Medical Ethics Review Committee at the University Medical Center Groningen issued a waiver for this study, stating that the pertinent Dutch legislation (the Medical Research Involving Human Subjects Act) was not applicable to this non-interventional survey study (METc 2014.181).

2.6 Analysis

The data were analyzed using the McFadden conditional logit model [38, 39] with dummy-coded variables representing the levels of the attribute (Stata, clogit routine). The probit model (Stata, asmprobit routine) relaxing the independence of irrelevant alternatives has been executed but was outperformed by the conditional logit model (goodness-of-fit results are not presented). A mixed logit model (mixlogit) that accounted for random variations in preferences across respondents (heterogeneity) was also considered. However, we determined that the basic conditional logit model was sufficient for meeting the aims of the present experimental study, which focuses on overall preferences.

A large number of respondents were required to obtain precise estimates for the (paired) scenarios, According to Lancsar and Louviere [40], 20 respondents per survey block (20 blocks in this study) are sufficient to ensure that a model is reliable, but a bigger sample is required to conduct a significant post-hoc analysis. The minimal number of respondents for the present study was 400 patients and 400 members of the general population, but a larger sample was recruited to allow the possibility of conducting a further analysis.

We conducted three comparative analyses of the general population and the patients. First, we graphically depicted the regression coefficients for the criteria and their levels to convey relative positive or negative preferences. Second, we applied the range method to compare the relative importance assigned to the criteria for each sample [41]. We calculated the range between the coefficients for the individual levels, which were then converted into proportions. We applied this method to calculate and compare the difference in the preference weights for the best and worst levels of a criterion. This difference yielded an estimate of the relative importance of that criterion over the range of levels.

Wattribute(i)=maxCiminCij(maxCjminCj)

Third, we conducted an analysis using the asclogit routine in Stata that included second-order interactions between respondent types and all of the criteria in the joint sample. The statistical significance of an interaction indicated that the general population and the patients had different preferences regarding the criterion of interest.

3. Results

3.1 Respondents

Data were collected between July 2015 and January 2016. A total of 1,986 respondents from the general population and 2,256 patients were invited to participate in the study. Sociodemographic information provided by SSI indicated that with reference to age and sex, both of the recruited samples (from the general population and the list of patients) were representative of the Dutch population. Because of privacy restrictions, the background characteristics were collected only at the stage the participants were invited, and were subsequently aggregated. Therefore, this information was not available for the subset of respondents who were included in the analysis after completing the survey. Some patients had been diagnosed with more than one disease, and the most common diagnoses were neck and back pain, diabetes, and asthma/COPD. Out of all of the individuals who registered for the survey (Table 2), the numbers of respondents who completed the survey, and whose answers were included into the analysis, were 1,253 (63% of invited persons) for the general population sample and 1,389 (62% of invited persons) for the patient sample. We performed a quality check of the responses by identifying respondents who consistently chose either the left or right options as their responses. Only 36 respondents (0.01%) demonstrated this response pattern. Therefore, we decided to retain these respondents in the main analysis, as such a modest proportion would not bias the main results. Additionally, because the study design did not include any clearly dominated scenarios, we could not rule out any kind of pattern as being invalid.

Table 2. Characteristics of the participants in two sub-samples used in the study.

Characteristics General population Overall registered
N = 1,986
Patients Overall registered N = 2,256
Female, N (%) 1104 (56) 1239 (55)
Age, mean (SD) 46.6 (14.4) 47.8 (14.0)
Age group, N (%)
    18–24 286 (14) 244 (11)
    25–34 194 (10) 223 (10)
    35–44 240 (13) 281 (12)
    45–54 485 (24) 580 (26)
    Older 55 781 (39) 928 (41)
Diagnosed with*, N (%)
    Neck and back pain - 995 (44)
    Diabetes - 736 (33)
    Asthma/COPD - 418 (19)
    Mental health problems - 383 (17)
    Hearing or vision loss - 370 (16)
    Eczema - 352 (16)
    Rheumatism - 335 (15)
    Heart disease - 302 (13)
    Gastrointestinal disease - 168 (7)
    Cancer - 153 (7)
    Lung disease - 86 (4)
    Stroke - 80 (4)
    Epilepsy - 53 (2)

*The total frequencies exceeded 2,256 because some patients were diagnosed with more than one disease.

3.2 The relative importance of the criteria

Table 3 shows the parameter estimates for the conditional logit model, and Fig 2 shows the attribute weights. The results indicating the relative importance of the attributes and their weights obtained using the alternative mixed logit model did not differ substantially from those obtained using the conditional logit model. These results are presented in S1 Table. For example, respondents from both the general population and patient samples indicated that a gain in life years following the new treatment had the most significant effect (weights of 0.31 vs. 0.35 in the conditional logit model and 0.3 vs. 0.34 in the mixed logit model). The criteria ranked second for the general population respondents were age and cause of disease (similarly, 0.22). Respondents strongly favored treating a 25-year-old over a 50-year-old, and this effect was even stronger for a 75-year-old patient. Respondents in the patient group rated the cause of the disease as the second most important criterion (0.23), which was only slightly higher than their ranking of age at acute onset (0.22). The initial HRQoL (0.14 for the general population sample vs. 0.12 for the patient sample) and the treatment’s effect of maintaining the HRQoL at the same level as it was prior to the acute onset (0.08 for the general population sample vs. 0.07 for the patient sample) were the least important criteria within both groups. The outcomes of the standard treatment had no significant effect (0.03 for the general population sample vs. 0.02 for the patient sample).

Table 3. Parameter estimates (clogit) of the six criteria for the two sub-samples (based on completed surveys).

General population (SE) Patients (SE)
N = 1,253 N = 1,389
Obs = 47,756 Obs = 51,932
SCENARIO CRITERIA
Patient characteristics
Age
    Age 25 (reference) - -
    Age 50 -0.17 (0.03)* -0.10 (0.03)*
    Age 75 -0.67 (0.04)* -0.52 (0.04)*
Initial Health-Related Quality of Life (HRQoL)
    HRQoL 0.5 (reference) - -
    HRQoL 0.7 0.28 (0.03)* 0.24 (0.03)*
    HRQoL 0.9 0.42 (0.03)* 0.28 (0.03)*
Cause of acute onset
    Accident, genetics (reference) - -
    Unhealthy lifestyle -0.65 (0.03)* -0.55 (0.03)*
New treatment outcomes
HRQoL change after new treatment (ΔHRQoL)
    ΔHRQoL -0.2 (reference) - -
    ΔHRQoL -0.1 0.16 (0.03)* 0.05 (0.03)
    ΔHRQoL 0 0.25 (0.03)* 0.17 (0.03)*
Life years gained after new treatment (LYnew)
    LYnew 2(reference) - -
    LYnew 10 0.64 (0.04)* 0.55 (0.04)*
    LYnew 20 0.94(0.04)* 0.84 (0.04)*
Standard treatment outcomes***
Life years gained after standard treatment (LYstandard)
    Standard treatment unavailable (reference) - -
    LYstandard 2 -0.04 (0.03) 0.00 (0.03)
    LYstandard 10 -0.04 (0.03) 0.02 (0.03)
    LYstandard 20 -0.10 (0.05)** 0.01 (0.04)
Goodness-of-fit -14679 -16445

*P < 0.01

**P < 0.05

*** the change in Health-Related Quality of Life (HRQoL) after administering the standard treatment had no variance, as it was a fixed attribute with one possible level of -0.2 (the 7th criterion)

Fig 2. Attribute weights for the general population group and the patient group.

Fig 2

3.3 Differences between the general population and patient samples

Overall, the results for the relative importance of the criteria were almost identical for the two samples. However, there were statistically significant differences relating to preferences for specific criteria within the total sample. This analysis revealed statistically significant second-order interactions for combinations of the sample type with the following criteria: age, initial HRQoL (0.9), cause of acute onset, change in HRQoL, and life years gained by the respondent after undergoing the new treatment (S2 Table). The significance of the two criteria (age and cause of the disease), as revealed in this analysis, supported the results of the analyses conducted for each individual sample (Table 3). Finally, although the interactions of the sample type with the change in HRQoL and the initial HRQoL were statistically significant, they did not lead to any modification in our interpretation of the relative importance of the attribute, as these criteria remained among those that were ranked the lowest by respondents in both samples.

4. Discussion

This study, which was aimed at investigating whether society and patients share the same interests and views concerning healthcare priorities, was operationalized using two samples of respondents who expressed their preferences for a specific treatment in a specific situation, defined by particular criteria. These criteria, comprising the age of the patient and the effectiveness of the treatment, were expected to capture societal concerns, especially those relating to treatment effectiveness and equity considerations.

The results of the analysis revealed the relative importance of additional survival years attributed to the new treatment. Similar results were reported in a recent study [37], which showed that respondents tend to favor scenarios in which QALY gains are significant. The results of the present study showed that preferences related to new treatments also depend on patient characteristics (such as the individual’s age or initial health state). In a German study, criteria for prioritizing health care based on patients’ personal characteristics were examined. The findings of this study indicated that disease severity and patients’ HRQoL are the most important attributes, whereas an unhealthy lifestyle was ranked lowest [42]. However, these researchers did not associate an unhealthy lifestyle with the cause of a disease, as we did in the present study. Our findings as well as those of Skedgel et al. [37] suggest that preferences do not strictly and exclusively follow QALY-maximizing decision rules; rather, they incorporate the characteristics of both the patient and the treatment. The findings of another study [43] also indicate that age (i.e., children or a younger population) and saving lives, even when HRQoL values are low, are deemed important. The findings of the present study partly support those of the latter study, which emphasized the importance of saving younger patients. The availability and effect of a standard treatment appeared to have no effect on the appraisal of new treatments. This finding concurs with that of Green et al. [19], who found that the availability of “other treatments” was the least important attribute.

Our results can be considered in the context of Williams’ “fair innings” argument [44]. According to this argument, every individual is entitled to a lifespan that is considered reasonable or “fair.” The “fair innings” argument takes account of the characteristics of both the patient and the treatment (in terms of health gains) [45, 46] over this individual’s entire life span. Thus it takes account not only of the patients’ past health and actual ages but also of their future health. According to Williams [44], life years gained for individuals who have not had a “fair innings” should be valued above life years gained for people whose “innings” have been fair. However, our findings did not endorse this argument. We observed that age and a better initial health status (i.e., health in the past and actual age) were accorded a high level of societal importance along with health gains experienced after the new treatment (i.e., future health). The findings of two studies conducted, respectively, in Canada [37] and Sheffield (in the UK) [47] also indicated a strong preference for providing treatment to younger individuals. However, in a study conducted in the UK, Lancsar [48] found that while preference weights for the patient’s age at acute onset were small, weights for the patient’s age at death were larger. Evidently, the importance of age is not solely attributable to the fair innings principle; other reasons for favoring younger patients could relate to their economic or social productivity.

The argument concerning the “burden” or “severity of disease” has been widely applied within the literature on societal preferences [8, 46, 49, 50]. Shah [49], who conducted a literature review, observed that an individual’s pre-treatment health state is the most widely used criterion for defining the burden of disease. We therefore incorporated this definition as initial HRQoL into our study. There is considerable heterogeneity in existing definitions of the burden of disease and in the associated study methods (personal trade-off, DCE, and social welfare function), which may influence a study’s outcome. Nevertheless, Shah pointed out that in most studies, respondents were, on the whole, inclined to prioritize the severely ill. Skedgel et al. [47] also found that severely ill patients were favored over those who were less severely ill. This finding does not accord with our own findings, which showed that those patients whose initial HRQoL levels were higher were favored over those whose initial HRQoL levels were lower. There are two possible explanations for this finding. The first explanation could be that the respondents were inclined to favor younger and generally healthier individuals because they are economically and demographically productive. This strategy would therefore contribute the most to overall economic well-being within society. A second possible explanation relates to the HRQoL loss of 0.2 after the acute event that was depicted in the graph and how it was visually perceived. A level of HRQoL that is initially high would still appear high, even after a loss of 0.2 or 0.1 (in a large area of the graph), compared with an initially low HRQoL, which could worsen or remain at a low level (in a smaller area of the graph). Similar results were reported by Wetering et al. [51], demonstrating higher preferences for treating individuals who are already in a relatively good state of health before undergoing treatment. In that study, the researchers used graphical representations of the scenarios in which specific areas showing losses were depicted.

Our findings also demonstrated the importance of a lifestyle-related cause; a criterion that is rarely taken into consideration. The argument about individuals who are responsible for the causes of their diseases was raised in an earlier study by Singh et al. [52], which emphasized that the public accorded higher priority to interventions for diseases in which the patient has no control over the cause of the disease. Conversely, they accorded lower priority to programs for treating illnesses that were “self-inflicted.” In another study focusing on the prioritization of investments in health service innovations [53], the researchers reported that the respondents were negatively inclined toward funding innovations that targeted individuals with drug addiction and obesity issues. Although the respondents in our study tended to choose the alternatives associated with more survival years after the new treatment had been administered, they prioritized younger patients, those whose acute onset stemmed from an accident or genetic cause, and those with a higher initial HRQoL. This finding accords with those of other studies conducted in various countries that focused on societal preferences [37, 54, 55]. For example, the findings of Luyten et al. [54], who conducted a study in Belgium, indicate a stronger preference for individuals who did not cause their own illnesses. The findings of Gu et al. [8] suggest that in general, the young are favored over the old, the more severely ill are favored over the less severely ill, and individuals with self-induced illnesses tend to receive lower priority than those whose illnesses are not self-induced. In studies in which health gains were considered, larger gains were universally preferred. These findings trigger the interest to explore the interactions between lifestyles and various other criteria.

Our findings demonstrated that patients and members of the general population are able to compare different health outcomes and to express their preferences for one particular outcome. In most countries, including the Netherlands, the task of prioritizing healthcare interventions is mainly delegated to experts. To strengthen public acceptance and consensus on how healthcare interventions should be prioritized, a nation-wide assessment of public preferences, such as the one reported in our study, could be considered. Our finding that differences in the preferences of patients and non-patients are modest is an important one, as it suggests that assessments could be harmonized relatively easily.

Within the literature, there is no consensus regarding the issue of whether values relating to healthcare priorities differ between patients and members of general population. For instance, dissimilarities in the values for health states between these samples were reported in a number of previous studies [30, 5558]. Although we did not find any substantial differences between the general population and patients, the analysis of specific interaction terms did reveal a difference between the two samples relating to the relative importance of criteria (the cause of the acute onset was more important to patients than to members of the general population). The patients prioritized individuals whose diseases were induced by accidents or by genetic predispositions, indicating that patients may attach more importance to individuals who take responsibility for their own health. However, this observation could be attributed to the icon format used to depict the cause of the disease.

In the present study, we assumed that a graphical format would add value because it is attractive and relatively easy to comprehend. We also assumed that the use of supplementary visual aids, such as pop-up notes and balloons, would improve the respondents’ understanding. However, a graphical representation could plausibly influence the respondents to some extent. For example, the design could differentially influence the decisions of respondents who focus on the sizes of the graph areas compared with those who focus more on instructions. Moreover, the icons could be perceived differently when presented together with a graph or a diagram. As colorful and easy-to-understand elements, icons may initially capture the attention of respondents. Consequently, respondents could pay more attention to an attribute that is presented as an icon than to the same attribute presented in a conventional textual format. Therefore, it would be useful to provide respondents with step-by-step visual guidance and verbal or even video-recorded instructions that effectively discourage them from ignoring any of the elements on the screen.

During the pilot-testing phase, we confirmed the ease of comprehension and the appeal of our study design. However, providing the respondents with an additional opportunity to leave their feedback about the survey design and the challenges they faced could enhance the quality of the responses. These additional inputs would contribute to a deeper understanding of how the respondents executed the choice tasks and the decision strategies that they used.

This study had a few limitations that should be noted. First, although the two samples were representative of the Dutch population in terms of sex and age, the respondents from the general population were, on average, slightly younger than the patients. Moreover, the number of individuals below the age of 45 years within the general population sample was higher than the number of such individuals within the patient sample. It is plausible that younger respondents within the general population sample would favor scenarios in which younger patients were presented.

Second, the sampling design for the general population did not allow for verification of whether potential respondents had been diagnosed with a disease. Although we acknowledge that the general population includes patients and that some of the individuals may have had a disease at the time of their participation in the survey, our aim was not to investigate representative proportions of healthy and non-healthy individuals within the general population. Therefore, the general population was considered to comprise a mix of individuals with and without diseases and not just healthy individuals. However, access to information on the health states of respondents from the general population would enable the performance of subgroup analysis that would yield additional information about the heterogeneity of preferences for new treatments in the Netherlands. In general, these results should be interpreted with caution, as the representativeness of the samples in terms of the respondents’ health states, education levels, and socioeconomic statuses were not considered. Only age and sex were taken into account when determining the representativeness of the samples.

Third, our decision to focus on the acute onset of non-reversible deterioration of health conditions limits the generalizability of the results. Although many diseases are characterized by non-reversible deterioration of the quality of a patient’s health, for some diseases or individuals, there could be opportunities for patients to return (partially) to their initial health levels. Our DCE design limited the representation of such outcomes and thus neglected a proportion of actual health trajectories that can be observed in practice. Finally, we acknowledge the importance of qualitative research in which members of the general population or patients are involved in the selection of attributes and their levels, although the current study did not include this type of research. As the current study implies a high level of awareness and knowledge of priority assessment processes in the Netherlands, patients and the general population were not involved in this conceptual stage of criteria selection. However, members of the general population were involved in a subsequent pilot testing phase and were able to express their opinions regarding the comprehensibility of the tasks.

In conclusion, we found that five out of the six presented criteria affected respondents’ preferences for specific treatment scenarios, constituting a multifaceted system of distinctive and even conflicting arguments and criteria. Nevertheless, our study demonstrated that both patients and members of the general population were able to make choices regarding the prioritization of healthcare interventions. Moreover, patients and members of the general population demonstrated similar patterns of preferences, indicating that the preferences of the general population could be representative of the views of patients in the area of priority setting relating to health care. The assessment process may be enhanced through the involvement of members of the general population in priority assessments, in addition to experts, thereby strengthening public acceptance of reimbursement and other decisions relating to healthcare priority setting.

Supporting information

S1 Fig. Graphical explanation.

Explanation of various options in the scenarios: (A) Possible health states before onset; (B) Example of new treatment for 25-year-old patient with 0.9 HRQoL; (C) Standard treatment for 25-year-old patient with 0.9 HRQoL (if accessible and exist).

(TIF)

S1 Table. Parameter estimates (mixed logit) of the 6 criteria for the two sub-samples (based on completed surveys).

(DOCX)

S2 Table. Parameter estimates of the 6 criteria with interaction terms (sample type × criteria) for the combined sample of general population and patients.

(DOCX)

S1 Appendix

(DOCX)

S2 Appendix

(PDF)

Data Availability

All data (Stata format) is available at the public repository Figshare, under doi 10.6084/m9.figshare.10000556.

Funding Statement

Financial support for this study was provided entirely by ZonMw (The Netherlands Organization for Health Research and Development; grant number 152002053). The funding agreement ensured the authors’ independence in designing the study, interpreting the data, writing, and publishing the report.

References

  • 1.Brixner DI, Watkins JB. Can CER be an effective tool for change in the development and assessment of new drugs and technologies? J Manag Care Pharm. 2012; 18(5): S6–S11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.McCabe C, Claxton K, Culyer AJ. The NICE cost-effectiveness threshold: what it is and what that means. Pharmacoecon. 2008; 26(9): 733–44. [DOI] [PubMed] [Google Scholar]
  • 3.Pakketbeheer in de praktijk. CVZ, Diemen; 2006.
  • 4.Rechtvaardige en duurzame zorg. Advies uitgebracht door de Raad voor de Volksgezondheid en Zorg aan de minister van Volksgezondheid, Welzijn en Sport Zoetermeer; 2007.
  • 5.Zinnige en Duurzame zorg. Advies uitgebracht door de Raad voor de Volksgezondheid en Zorg aan de minister van Volksgezondheid, Welzijn en Sport, Zoetermeer; 2006.
  • 6.Brouwer WBF, Culyer AJ, van Exel NJA, Rutten FFH. Welfarism vs. extra-welfarism. J Health Econ. 2008; 27: 325–338. 10.1016/j.jhealeco.2007.07.003 [DOI] [PubMed] [Google Scholar]
  • 7.Van Exel J, Baker R, Mason H, Donaldson C, Brouwer W, EuroVaQ Team. Public views on principles for health care priority setting: Findings of a European cross-country study using Q methodology. Soc Sci & Med. 2015; 126: 128–137. [DOI] [PubMed] [Google Scholar]
  • 8.Gu Y, Lancsar E, Ghijben P, Butler JRG, Donaldson C. Attributes and weights in health care priority setting: A systematic review of what counts and to what extent. Soc Sci & Med. 2015;146: 41–52 [DOI] [PubMed] [Google Scholar]
  • 9.Kaplan RM. Value judgement in the Oregon Medicaid Experiment. Med Care.1994; 32(10): 975–88 10.1097/00005650-199410000-00001 [DOI] [PubMed] [Google Scholar]
  • 10.Fischer KE. A systematic review of coverage decision-making on health technologies-evidence from the real world. Health Policy. 2012; 107(2–3): 218–230. 10.1016/j.healthpol.2012.07.005 [DOI] [PubMed] [Google Scholar]
  • 11.Tanios N, Wagner M, Tony M, Baltussen R, van Til J, Rindress D, et al. ; International Task Force on Decision Criteria. Which criteria are considered in healthcare decisions? Insights from an international survey of policy and clinical decision makers. Int J Technol Assess Health Care. 2013; 29(4): 456–65. 10.1017/S0266462313000573 [DOI] [PubMed] [Google Scholar]
  • 12.Drummond MF, Sculpher MJ, Claxton K, Stoddart GL, O'Brien BJ, Torrance GW. Methods for the economic evaluation of health care programmes. Fourth ed Oxford University Press; 2015. [Google Scholar]
  • 13.Krabbe PFM. The Measurement of Health and Health Status: Concepts, Methods and Applications from a Multidisciplinary Perspective. San Diego: Elsevier/Academic Press; 2016. [Google Scholar]
  • 14.Gandjour A. Theoretical foundation of patient v. population preferences in calculating QALYs. Med Decis Making. 2010; 30 (4): 57–63. [DOI] [PubMed] [Google Scholar]
  • 15.Dolan P, Kahneman D. The interpretation of utility and their implications for the valuation of health. Econ J. 2008; 118: 215–234. [Google Scholar]
  • 16.De Wit GA, Busschbach JJV, De Charro F. Sensitivity and perspective in the valuation of health status: whose values count? Health Econ. 2000; 9(2): 109–126. [DOI] [PubMed] [Google Scholar]
  • 17.Krabbe PFM, Tromp N, Ruers TJM, van Riel PLCM. Are patients’ judgments of health status really different from the general population? Health Qual Life Outcomes. 2011; 9:31 10.1186/1477-7525-9-31 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Stolk EA, Oppe M, Scalone L, Krabbe PFM. Discrete choice modeling for the quantification of health states: The case of the EQ-5D. Value Health. 2010; 13:1005–1013. 10.1111/j.1524-4733.2010.00783.x [DOI] [PubMed] [Google Scholar]
  • 19.Green C, Gerard K. Exploring the social value of health-care interventions: a stated preference discrete choice experiment. Health Econ. 2009, 18(8): 951–976. 10.1002/hec.1414 [DOI] [PubMed] [Google Scholar]
  • 20.Arons MMA, Krabbe PFM. Probabilistic choice models in health-state valuation research: Background, theories, assumptions and applications. Expert Rev Pharmacoecon Outcomes Res. 2013; 13(1): 93–108. 10.1586/erp.12.85 [DOI] [PubMed] [Google Scholar]
  • 21.Krabbe PFM. Thurstone scaling as a measurement method to quantify subjective health outcomes. Med Care. 2008; 46(4): 357–365. 10.1097/MLR.0b013e31815ceca9 [DOI] [PubMed] [Google Scholar]
  • 22.Louviere JJ, Lancsar E. Choice experiments in health: the good, the bad, the ugly and toward a brighter future. Health Econ Policy Law. 2009; 4: 527–546. 10.1017/S1744133109990193 [DOI] [PubMed] [Google Scholar]
  • 23.Louviere LL, Woodworth G. Design and analysis of simulated consumer choice or allocation experiments: An approach based on aggregate data. J Mark Res.1983; 20(4): 350–367. [Google Scholar]
  • 24.Marley AAJ, Louviere JJ. Some probabilistic models of best, worst, and best-worst choices. J Math Psychol. 2005; 49: 464–480. [Google Scholar]
  • 25.Thurstone LL. A Law of Comparative Judgment. Psychol Rev. 1927; 4: 273–286. [Google Scholar]
  • 26.Vermeulen KM, Krabbe PFM. Value judgment of health interventions from different perspectives: arguments and criteria. Cost Eff Resour Alloc. 2018; 16:16 10.1186/s12962-018-0099-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Gamliel E, Kreiner H. Is a picture worth a thousand words? The interaction of visual display and attribute representation in attenuating framing bias. Judgm Decis Mak. 2013; 8(4): 482–491. [Google Scholar]
  • 28.Mueller S, Lockshin L, Louviere JJ. What you see may not be what you get: Asking consumers what matters may not reflect what they choose. Mark Lett.2010; 21:335–350. [Google Scholar]
  • 29.Veldwijk J, Lambooij MS, van Til JA, Groothuis-Oudshoorn CGM, Smit HA, de Wit GA. Words or graphics to present a Discrete Choice Experiment: Does it matter? Patient Educ Couns. 2015; 98(11):1376–1384. 10.1016/j.pec.2015.06.002 [DOI] [PubMed] [Google Scholar]
  • 30.Vriens M, Loosschilder GH, Rosbergen E, Wittink DR. Verbal versus Realistic Pictorial Representations in Conjoint Analysis with Design Attributes. J. Prod. Innovat. Manag. 1998; 15: 455–467. [Google Scholar]
  • 31.Lancsar E, Wildman J, Donaldson C, Ryan M, Baker R. Deriving distributional weights for QALYs through discrete choice experiments. J. Health Econ.2011; 30(2): 466–478. 10.1016/j.jhealeco.2011.01.003 [DOI] [PubMed] [Google Scholar]
  • 32.Skirbekk V. Age and Individual Productivity: A Literature Survey. Vienna Yearbook of Population Research. 2004; 2: 133–153. [Google Scholar]
  • 33.Smith JP. Unraveling the SES: Health Connection. Population and Development Review. 30, Supplement: Aging, Health, and Public Policy. 2004: 108–132.
  • 34.Matthews RJ, Smith LK, Hancock RM, Jagger C, Spiers NA. Socioeconomic factors associated with the onset of disability in older age: a longitudinal study of people aged 75 years and over. Soc Sci & Med. 2005; 61(7): 1567–1575. [DOI] [PubMed] [Google Scholar]
  • 35.Torrance GW. Utility approach to measuring health-related quality of life. J Chronic Dis. 1987; 40(6): 593–600. 10.1016/0021-9681(87)90019-1 [DOI] [PubMed] [Google Scholar]
  • 36.Megari K: Quality of Life in Chronic Disease Patients. Health Psychol Res. 2013; 1(3): 27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Skedgel C, Wailoo A, Akehurst R, Hon. Societal Preferences for Distributive Justice in the Allocation of Health Care Resources: A Latent Class Discrete Choice Experiment. Med Decis Making. 2015; 94–105. 10.1177/0272989X14547915 [DOI] [PubMed] [Google Scholar]
  • 38.McFadden D. Conditional logit analysis of qualitative choice behavior. In: Zarembka P. (ed.), Frontiers in Econometrics New York: Academic Press; 1974. [Google Scholar]
  • 39.McFadden D. Economic choices. Am Econ Rev. 2001; 91(3): 351–378. [Google Scholar]
  • 40.Lancsar E, Louviere J. Conducting discrete choice experiments to inform healthcare decision making: a user’s guide. Pharmacoecon. 2008; 26 (8): 661–77. [DOI] [PubMed] [Google Scholar]
  • 41.Orme B. Interpreting the Results of Conjoint Analysis. In: Getting Started with Conjoint Analysis: Strategies for Product Design and Pricing Research. Second Edition, Madison, Wis.: Research Publishers LLC; 2010. [Google Scholar]
  • 42.Diederich A, Swait J, Wirsik N. Citizen Participation in Patient Prioritization Policy Decisions: An Empirical and Experimental Study on Patients’ Characteristics. PLoS ONE. 2012; 7(5): e36824 10.1371/journal.pone.0036824 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Baker R, Wildman J, Mason H, Donaldson C. Q-ing for health–a new approach to eliciting the public’s views on healthcare resource allocation. J Health Econ. 2014; 23: 283–297. [DOI] [PubMed] [Google Scholar]
  • 44.Williams A. Intergenerational equity: an exploration of the ‘fair innings’ argument. J Health Econ. 1997; 6: 117–132. [DOI] [PubMed] [Google Scholar]
  • 45.Stolk EA, Pickee S, Ament AHJA, Van Bussbach JJ. Equity in health care prioritization: An empirical inquiry into social value. Health Policy. 2005; 74: 343–355. [DOI] [PubMed] [Google Scholar]
  • 46.Nord E. Concerns for the worse off: fair innings versus severity. Soc Sci & Med. 2005; 60: 257–263. [DOI] [PubMed] [Google Scholar]
  • 47.Skedgel CD, Wailoo AJ, Akehurst RL. Choosing vs. allocating: discrete choice experiments and constant-sum paired comparisons for the elicitation of societal preferences. Health Expect. 2015; 18(5): 1227–1240. 10.1111/hex.12098 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Lancsar E, Wildman J, Donaldson C, Ryan M, Baker R. Deriving distributional weights for QALYs through discrete choice experiments. J Health Econ. 2011; 30(2): 466–78. 10.1016/j.jhealeco.2011.01.003 [DOI] [PubMed] [Google Scholar]
  • 49.Shah KK. Severity of illness and priority setting in healthcare: a review of the literature. Health Policy. 2009; 93(2–3): 77–84. 10.1016/j.healthpol.2009.08.005 [DOI] [PubMed] [Google Scholar]
  • 50.Whitty JA, Lancsar E, Rixon K, Golenko X, Ratcliffe J. A systematic review of stated preference studies reporting public preferences for healthcare priority setting. Patient. 2014; 7(4): 365–86. 10.1007/s40271-014-0063-2 [DOI] [PubMed] [Google Scholar]
  • 51.van de Wetering L, van Exel J, Bobinac A, Brouwer WBF. Valuing QALYs in Relation to Equity Considerations Using a Discrete Choice Experiment. Pharmacoecon. 2015; 33(12): 1289–1300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Singh J, Lord J, Longworth L, Orr S, McGarry T, Sheldon R, et al. Does Responsibility affect the public's valuation of health care interventions? A Relative Valuation Approach to Health Care Safety. Value Health. 2012; 15: 690–698. 10.1016/j.jval.2012.02.005 [DOI] [PubMed] [Google Scholar]
  • 53.Erdem S, Thompson C. Prioritising health service innovation investments using public preferences: a discrete choice experiment. BMC Health Serv Res. 2014; 14(1). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Luyten J, Kessels R, Goos P, Beutels P. Public Preferences for Prioritizing Preventive and Curative Health Care Interventions: A Discrete Choice Experiment. Value Health. 2015; 18: 224–233. 10.1016/j.jval.2014.12.007 [DOI] [PubMed] [Google Scholar]
  • 55.Ubel PA, Loewenstein G, Jepson C. Whose quality of life? A commentary exploring discrepancies between health state evaluations of patients and the general public. Qual Life Res. 2003; 12: 599–607. 10.1023/a:1025119931010 [DOI] [PubMed] [Google Scholar]
  • 56.Mann R, Brazier J, Tsuchiya A. A comparison of patient and general population weightings of EQ-5D dimensions. J Health Econ. 2009; 18(3): 363–372. [DOI] [PubMed] [Google Scholar]
  • 57.Little MHR, Reitmeir P, Peters A, Leidl R. The Impact of Differences between Patient and General Population EQ-5D-3L Values on the Mean Tariff Scores of Different Patient Groups. Value Health. 2009; 17: 364–371. [DOI] [PubMed] [Google Scholar]
  • 58.Rowen D, Mulhern B, Banerjee S, Tait R, Watchurst C, Smith SC, et al. Comparison of General Population, Patient, and Carer Utility Values for Dementia Health States. Med Decis Making. 2015; 35(1): 68–80. 10.1177/0272989X14557178 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Jan Ostermann

20 Feb 2020

PONE-D-19-29090

VALUE JUDGMENT OF NEW MEDICAL TREATMENTS: SOCIETAL AND PATIENT PERSPECTIVES

PLOS ONE

Dear Dr Nicolet,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Addressing the 3 reviewers' comments will likely require substantial revisions to the manuscript.  Please ensure that adequate information about the development of the DCE is included in the revised manuscript to be able to assess the validity and reliability of the approach. If necessary, consider including an appendix with additional details. Please ensure that the study data are available as per journal policy.  

We would appreciate receiving your revised manuscript by Apr 05 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Jan Ostermann

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements:

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.plosone.org/attachments/PLOSOne_formatting_sample_main_body.pdf and http://www.plosone.org/attachments/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. PLOS requires an ORCID iD for the corresponding author in Editorial Manager on papers submitted after December 6th, 2016. Please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. Please see the following video for instructions on linking an ORCID iD to your Editorial Manager account: https://www.youtube.com/watch?v=_xcclfuvtxQ

3. Please amend your list of authors on the manuscript to ensure that each author is linked to an affiliation. Authors’ affiliations should reflect the institution where the work was done (if authors moved subsequently, you can also list the new affiliation stating “current affiliation:….” as necessary).

4. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: I Don't Know

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This article deals with a very important topic and challenging question, namely how reimbursement procedures could be improved by incorporating societal and/or patient needs more explicitly in reimbursement criteria. The answer to this question needs to be addressed via conducting research such as the study here presented. However, the following points of critique should be discussed/incorporated:

" The title, methods of abstract and methods section needs to mention more explicitly that the study was conducted in the Netherlands/with Dutch participants. The discussion should include a discussion of how these findings (do not) translate to other countries in Europe/US, in view of the nationally organized reimbursement system as pointed out in the introduction

" Title: consider adding: VALUE JUDGMENT OF NEW MEDICAL TREATMENTS: SOCIETAL AND PATIENT PERSPECTIVES to inform priority setting in the Netherlands

" Abstract methods: include how many participants (patients and general population separately), type of survey (online?), type of analysis

" Abstract conclusion should include a stronger recommendation; what is the key take-away for priority setting that could be recommended based on this research?

" Consider changing "discrete choice model" to "discrete choice experiment" (DCE), as this term is more commonly used

" Methods section in abstract and paper: ensure this is written in past tense, eg in methods of abstract "The scenarios graphically represented treatment outcomes and patient characteristics"

" Methods section:

o More information necessary about the selection/development of criteria/attributes; who and how was this done exactly? Qualitative research (e.g. interviews, focus groups such as the nominal group technique) with the stakeholder group eventually answering the DCE survey is recommended for the selection of DCE attributes. Was qualitative research performed? Were patients/participants of the general public involved for the selection? To what extent was this driven by currently used criteria for reimbursement (i.e. top down approach) vs bottom-up (e.g. via patients/general public)? Another method for limiting the criteria would have been via grouping them. These aspects (rationale, importance of qualitative research) should be discussed/reflected on both in the methods section (why did you choose a certain approach) and any limitations as well as reflections on the different ways for attribute selection and the impact of these different ways on the final results should be discussed in the discussion section.

o Why DCE and not another elicitation method such as swing weighting, best worst scaling? The rationale for DCE should be in the methods and any potential impact of this choice on the results in the discussion

o Why a focus on acute onsets?

o P.5: delete "rigorous". Instead more information on the methods for this literature review should be included (databases consulted, year of literature review)

o P. 7 delete "representative", as this was only tested for the sex and age of participants

o Clarify what "SSI" means

o What types of rewards were given to participants?

o Pilot: how long did it take for participants to fill it in?

" Results:

o More detail necessary what is meant with "substantially"

o was there open comment field/question where participants could give feedback on the survey; how easy/difficult, areas for improvement/relevance of the characteristics/whether the characteristics captured the most important factors influencing their preference? If so this should be discussed in the results. If not, this should be discussed in the discussion

" Discussion:

o More reflection on methodological choices and potential limitations:

" See above for comment regarding attribute selection

" Sample has several potential limitations, representativeness was only tested for sex and age as other characteristics were not captured. This should be discussed in the limitations

" From the general population there is no certainty that they weren't ill at the time of participation since this information was not gathered; this limitation should be clearly discussed in the discussion section and it should be clearly stated in the methods that the general population may have included ill participants

" "our decision to focus on acute onset of non-reversible deterioration of health conditions limits the generalizability of the results": explain in more detail in what way the generalizability is limited

o Discuss more extensively: What does this research learn us moving forward for the priority setting procedure in the Netherlands (and potentially beyond) Should patients or citizens be more actively involved eg via DCE surveys? Any critique on the current process and how this should be approached moving forward, based upon this study? You identified the most influential criteria, how should this result be incorporated in priority setting?

" Language: the article could benefit from a review by a native English speaker as there are some typos, e.g. "A Germany study" should be "a study in Germany"

Reviewer #2: This manuscript describes an original research, designed to investigate whether society and patients have the same interests and views concerning healthcare priorities.

Both the study process and the rationale behind each detail is very well described and explained in detail. The method chosen by the researchers, in which hypothetical scenarios were presented to the participants graphically with written explanations and notes is remarkable.

Overall, the article is very interesting and written extremely well.

A few points should be noted regarding the criteria and results:

1. The meaning of the notion "initial HRQoL" in the study is not clear. On p. 5 of the manuscript, the authors explain that the criterion of initial HRQoL had been chosen to reflect burden of disease. However, from the explanation in S1 Appendix, it is understood that this notion relates to the patient's state before the acute onset, while the burden of disease can be reflected only through the gap between the patient's state before and after the acute onset?! It may be assumed that the preference of patients with higher initial HRQoL was affected by this failure.

2. No wonder that the outcomes of the standard treatment had no significant effect; the relevant effect for the decision to choose a new treatment is the difference between the gains of the new treatment and the standard one. So, the outcomes of the standard treatment would be expected to have an effect on the participants' choice only as long as they are significantly worse than the outcomes of the new treatment. This argument is somewhat reflected in the results, through the opposite correlation between the gains of the standard treatment and the effect related to it by both groups of participants.

Reviewer #3: Review Comments to the Author

Major Comments

My main quibble with this paper is the way that the scenarios and choice tasks are presented in the paper almost incidentally – i.e. without proper justification (for readers), explanation of how the approach works (given it is novel) and consideration of how participants found using the approach.

To me at least, the graphical, symbolic presentation of the scenarios is quite unorthodox – and potentially useful … How was this new approach developed? What theoretical and practical considerations and trade-offs were confronted in order to settle on the final design? How did participants find using the approach? Did any of them experience difficulties? Did others like the approach? Overall, is the method valid and reliable? What improvements (refinements) might be made for future uses of the method? (Should the method be used again?)

The best we are told in terms of justification for the approach is: “Instead of conventional descriptions written in text, the scenarios consisted of graphical representations, which are presumably easier to comprehend and may reduce certain types of bias [27]. To reduce possible framing bias by using graphs and icons to describe hypothetical states, we added written explanations and notes to support the graphical presentation. Such a combination was used because previous research suggested that graphical representation helps the respondent to understand the task better, while written explanations facilitate judgment [64].”

“PRESUMABLY easier to comprehend and MAY reduce certain types of bias”?! Based on just one reference? And where is reference 64?

Are there any drawbacks of using graphs and icons? (It seems obvious to me that there are!) Are there any concerns about participants’ interpretations (surely, meanings and interpretations of images/icons is a notoriously tricky area)? We are not told how you settled on the designs you used … Did you test alternative presentations?

In a nutshell, my concerns in this regard are twofold: (1) Does your graphical and icon-based approach work for participants (such that their preference data are valid and reliable)?; and (2) Will readers be able to understand the approach as you have explained it in the paper?

With respect to question #1, I could find little evidence – reassurance – in the paper that participants understood the approach. (Even if some/many of them did not, that would still be an interesting results, especially what you concluded from such a finding.) The survey did not seem to ask participants what I would have thought are two obvious questions: “How easy / difficult did you find answering the choice questions?” “How confident are you in your answers?” The answers to these questions would be interesting. They might also shed some light on your results with respect to the relative importance of the attributes.

There seems to be no ‘data quality controls’ at all – such as the time taken to answer the questions, consistency checks (repeated questions; or the inclusion of dominated ones), counts of participants clicking the same answer (left or right patient) every time. Overall, how do you know that you have ‘good quality’ data? (Does it matter?)

When I looked at Figure 1, I was initially bamboozled as to what I was seeing – and so I would not have been able to express a valid choice between the two patients. (Are the images in the software dynamic? On page 7 we are told that “three steps” were involved. Maybe the approach is obvious when working on a computer screen?) After re-reading section 2.2 and 2.3 several times and puzzling over Figure 1, I finally got it! (Phew!) However, I have to say that, when I finally understood what I was being asked, I would have no confidence in my answers to questions like that (but perhaps I am unusually stupid – it’s possible :) ).

(Is it possible that requiring participants to, first, notice and understand the six criteria represented in Figure 1, and then cognitively process them and the associated trade-offs, is asking too much of most people? Might the cognitive burdenbe too much? Apart from the eight pilot-testers, how would you know?)

In passing, in the Discussion, you muse, “However, it can be hypothesized that the graphical representation might influence the respondents to some extent. For example, the design might affect the decisions of respondents who focus on the sizes of graph areas differently than respondents who focus more on instructions.” But that seems to be it!

With regards to question #2, you need to explain to your readers how your approach works, in particular, how to interpret Figure 1. (To be honest, I am a bit surprised you have not received this feedback earlier – even from yourselves as readers – and made improvements.) It seems to me – and you can tell by how much I am discussing it here! – that your approach for eliciting people’s preferences is central to the paper. And so if readers can’t understand the approach, then the paper pretty much fails, in my opinion. (Sorry!)

Other Comments

In the second-last paragraph of page 13 you mention a surprising result: “This is not in line with the findings of our study, which revealed that the patients with higher initial HRQoL were favored over those with lower initial HRQoL.” Any ideas why?

At the top of page 14, you say, “respondents did not prefer innovations targeting people with ‘drug addiction’ and ‘obesity’”. Do you mean this? Or instead do you mean, “respondents preferred that innovations did not target people with ‘drug addiction’ and ‘obesity”? (I don’t think these two statements are the same.)

Typos and Stylistic Errors

P. 2: “collected from general population” should be “collected from the general population”.

P. 2: “Preferences were strongly and significantly affected by …” is too vague. Do you mean “the relative importance of the attributes …”?

P. 2: “in the assessment of prioritizing new medical interventions” does not read well.

P. 3: “which consist of direct comparison of health care interventions to determine which work best for which patients and which pose” should be “which consists of direct comparison of health care interventions to determine which works best for which patients and which poses”.

P. 3: “value judgements into their consideration” should be “value judgements in their consideration”.

P. 4: “Summarizing” would be better as “In summary”.

P. 4: Add “for the Netherlands” to the end of the phrase, “Summarizing [In summary], a clear set of criteria to guide priority setting is lacking”.

P. 5: “among two (paired)” is inelegant.

P. 10: “Of the total amount of respondents” is also inelegant. How about just, “Of the respondents”?

P. 11: Perhaps “the importance of the criteria” should be “the relative importance of the criteria”? (And elsewhere.)

P. 13: “gained life years” should be “life years gained”.

P. 14: “These finding triggers” should probably be “These findings trigger”.

P. 14: This sentence does not make sense: “In the present study we assumed that a benefit due to the relative easiness and attractiveness of the graphical format and improving it by visual aids, such as notes and balloons, popping up as additional help.”

Best wishes

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Rosanne Janssens

Reviewer #2: Yes: Ofra G. Golan, LLD

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Jul 9;15(7):e0235666. doi: 10.1371/journal.pone.0235666.r002

Author response to Decision Letter 0


30 Apr 2020

We would like to thank you for your thorough review of our article and for pointing out areas that required improvement. We have considered all of your comments and are providing our responses in the documents attached (Response to Reviewers)

Attachment

Submitted filename: 46414b_corr_Response letter to Reviewer 3.docx

Decision Letter 1

Jan Ostermann

22 Jun 2020

VALUE JUDGMENT OF NEW MEDICAL TREATMENTS:

SOCIETAL AND PATIENT PERSPECTIVES TO INFORM PRIORITY SETTING IN THE NETHERLANDS

PONE-D-19-29090R1

Dear Dr. Nicolet,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Jan Ostermann

Academic Editor

PLOS ONE

Additional Editor Comments:

Congratulations. Please consider making the minor change recommended by Reviewer 1.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: (No Response)

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: I Don't Know

Reviewer #3: I Don't Know

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Congratulations on this revised version and thank you for implementing my comments. Great article. Your findings and conclusions now come out more clearly and you carefully acknowledge the limitations. The conclusion is interesting and important for both policy makers and researchers. Just one minor additional comment: in the abstract methods it is still mentioned "discrete choice model"; consider changing to: "We applied a discrete choice experiment in which respondents were presented with judgmental tasks to elicit their preferences."

Reviewer #2: (No Response)

Reviewer #3: Your manuscript is greatly improved, including the quality of your writing. The paper reads well. Well done.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Rosanne Janssens

Reviewer #2: Yes: Ofra G. Golan, LLD

Reviewer #3: Yes: Paul Hansen

Acceptance letter

Jan Ostermann

26 Jun 2020

PONE-D-19-29090R1

VALUE JUDGMENT OF NEW MEDICAL TREATMENTS: SOCIETAL AND PATIENT PERSPECTIVES TO INFORM PRIORITY SETTING IN THE NETHERLANDS

Dear Dr. Nicolet:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Jan Ostermann

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Graphical explanation.

    Explanation of various options in the scenarios: (A) Possible health states before onset; (B) Example of new treatment for 25-year-old patient with 0.9 HRQoL; (C) Standard treatment for 25-year-old patient with 0.9 HRQoL (if accessible and exist).

    (TIF)

    S1 Table. Parameter estimates (mixed logit) of the 6 criteria for the two sub-samples (based on completed surveys).

    (DOCX)

    S2 Table. Parameter estimates of the 6 criteria with interaction terms (sample type × criteria) for the combined sample of general population and patients.

    (DOCX)

    S1 Appendix

    (DOCX)

    S2 Appendix

    (PDF)

    Attachment

    Submitted filename: 46414b_corr_Response letter to Reviewer 3.docx

    Data Availability Statement

    All data (Stata format) is available at the public repository Figshare, under doi 10.6084/m9.figshare.10000556.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES