Abstract
The research process started in the first week of August 10, 2014 and ended early February 2015, obtaining qualified data for 330 patients from many hospitals in northern Vietnam. Its expansion was performed for an enlarged dataset through May 2015, containing 900 records. This article exemplifies the attempt to examine the likelihood of destitution among Vietnamese patients due to insufficient insurance coverage, cost of treatment and patient׳s status of residency during a curative hospital stay. The result suggests that the patients, who are poor and come from rural areas, face serious obstacles in accessing health care services. This data article presents attributes and values of the data set used in the article provided at DOI: http://dx.doi.org/10.1186/s40064-015-1279-x Vuong (2015) [4].
Specifications Table
Subject area | Medical |
More specific subject area | Patient׳s financial burdens occurring during a curative hospital stay |
Type of data | Table, graph, figure |
How data was acquired | Survey |
Data format | Raw, filtered, and partially analyzed |
Experimental factors | Raw data obtained from a survey patients at many hospitals in Northern Vietnam |
Experimental features | The experiment focuses on examining financial issues, illness, insurance, end result of treatment, health care costs, length of stay, ‘envelope OOP’ and the probability of post-treatment destitution, for different groups of patients |
Data source locations | Viet Duc, Bach Mai, Vietnam-Japan, Hai Duong Polyclinic, Thai Binh Polyclinic Hospitals, Vietnam (and others) |
Data accessibility | Both original datasets for 330 and 900 survey records are provided with this article, and deposited at DOI:http://doi.org/10.17632/rw28hh58y4.1. |
Value of the data
-
•
The data potentially offer a specific insight on financial destitution due to costs of treatment.
-
•
The data can help reveal the likelihood of patient׳s recovery after treatment.
-
•
The data help to understand the effect of health insurance on reducing the likelihood of falling into destitution.
-
•
The data shed light on the risk of “extra thank-you money” becoming an obligatory part of treatment costs for patients, low-income patients in particular.
-
•
The data set structure can be enriched for deeper analysis providing insights and implications for bettering policy making in health insurance and poverty reduction.
1. Data
The first data set, which will be used for computation examples provided in this article, contains 900 records obtained from a survey on Vietnamese inpatients concerning family status, patient׳s income level, patient׳s extra expenses to doctors and hospital׳s staff, and their loans to finance treatment. Fig. 1 shows the distribution of patients hospitalization lengths.
Continuous (numerical) and discrete (categorical) variables are measured and reported in the survey data set. Table 1 presents numerical variables of the data set. Fig. 1a and b present medical expenditures and average daily costs in relation to health status of patients when hospitalized. They are not significantly different from those provided in [4] (Table 2).
Table 1.
Coded name | Explanation | Unit |
---|---|---|
Spent | Total money spent during his/her stay in hospital in millions of Vietnamese Dong. According to official exchange rates at the time of survey, VND 1 million was equivalent to $47.2. | Million of VND |
Dcost | Average daily cost the patient had to pay during the entire treatment period | Million of VND |
Income | Annual income | Million of VND |
Days | Number of days the patient spent in the hospital | Day |
Pins; Pinc; Pchar; Ploan | Portions of finance from sources: insurance reimbursement, income, charity funds from civil organizations or employers, or borrowings | Percent |
Streat, Srel, Senv | Percentage of funds used for the purpose of main treatments, for covering costs of relatives coming to help the patient, or paying “extra thank-you money” or bribing doctors/staff | Percent |
Table 2.
Coded name | Explanation | Values |
---|---|---|
Res | Whether the patient originally resides in the region where the hospital is located | Yes, no |
Stay | A dummy variable to define the patient׳s hospitalization by length of stay | Less than 10 days (S), 10 days or longer (L) |
Insured | Whether a patient has a valid health insurance | Yes, no |
SES | Socio-economic status | High, medium, low |
Based on average IncRank of working members in the family who are able and willing to support the patient if so required | ||
Illness | Severity of illness or injury when hospitalized | Emergency, bad, ill, light |
IncRank | Rankings of income of a patient | High (>180), middle (48–180), low (<48) |
Excel: | ||
=IF([cell]>180,"Hi",IF([cell]>48,"Mid","Lo")) | ||
Burden | Patient׳s and family׳s self-evaluation of their financial position after paying health care costs | strong, no adverse affect at all (A); affected but not the worrying level (B); seriously affected or destitute/bankrupt (C) |
End | Patient׳s health status after treatment | complete recovery (A), partial recovery, needing post-treatment follow-ups (B), stopped whilst being treated (C), or quit early (D) |
AvgCost | Average daily cost the patient had to pay during treatment period | ≤1.5 (Low), 1.5 to 5.4 (Med), and >5.4 (Hi) |
2. Experimental design, materials and methods
The dataset was constructed using information from questionnaires in order to enable the modeling of baseline-category logits (BCL). In addition, the computing of empirical probabilities upon events of hypothetical influence is performed. The logic for designing the experiment and thus data sets are similar to what is described in [1], for data groups in categories of as multinomial with corresponding sets of probabilities , and with the multinomial probability mass function: The data set has been created to enable an BCL analysis to simultaneously model effects of on logits such that the estimating of equations enables the computing of the remaining logits. Therefore, Pearson-type likelihood ratio test statistics or goodness-of-fit, following a multivariate GLM estimations become appropriate for hypothesis testing. For practically estimating multinomial logistic models, consult with Refs. [2], [3]. Practicality of survey data uses is also provided in Ref. [4].
Some possible questions and hypotheses worth testing of, using the data set analyzed by [4], is in Table 3.
Table 3.
Do the residency status of patients and insurance coverage determine the probability of patients falling into debts? The specific factor of residency status is important in Vietnam because society has for long been skeptical about provincial healthcare, leading patients to travel to large hospitals in major cities such as Hanoi, Hai Phong, or HCMC. Doing so not only necessitates accompanying and caretaking of family members but also entails travel costs and informational asymmetry on drug prices, treatment schedules, the best hospital to visit and even the ‘right amount’ of “extra thank-you money” (a kind of out-of-pocket expense; or OOP). |
As for two most important factors to Vietnamese patients/households, i.e. treatment costs and illness, is there evidence to support this view and if yes, whose influence better explains the possibility of end results of treatment, empirically? |
Can the likelihood of paying too little or too much out-of-pocket “extra thank-you money” be determined by the severity of illness and/or income of patients? This OOP amount may be significant but if a patient appreciates the value of service, he/she would be willing to pay depending on his/her availability of finance, before or after the course of treatment. |
The following short R commands help create the data set provided in the file named “table1.csv”
>med=read.csv(“E:/…/Med2015/Data/P330.csv,header=T) |
>attach(med) |
>table1=xtabs(~Res+Insured+Burden) |
>ftable(table1) |
The data set in file name “table1.csv” presents a distribution of patients following by residency status, insurance participation, and extent to which patients fall into destitution due to financial hardships after treatment. Its modeling using the BCL method is performed by R commands as follows:
> burden1=read.csv(“E:/…/Med2015/Data/table1.csv,header=T) |
> attach(burden1) |
> contrasts(burden1$Res)=contr.treatment(levels(burden1$Res),base=2) |
> contrasts(burden1$Insured)=contr.treatment(levels(burden1$Insured),base=2) |
> fit.burden1=vglm(cbind(C,B,A)~Res+Insured,family=multinomial,data=burden1) |
> summary(fit.burden1) |
The above estimation yields coefficients and associated statistics that are used in [4] for estimating empirical probabilities, using the 330-observation dataset. In addition, Fig. 2 is drawn for the 900-observation dataset.
The probabilistic trends for patient׳s financial burdens are in line with [5]. Examples of R commands for creating a specific dataset and corresponding BCL estimations are provided in Appendixes A and B, Appendixes A and B.
In the same vein, Fig. 3 shows declining trends for becoming destitute if a patients is either resident or insured, or both.
Acknowledgements
We are grateful for logistic and research supports from Hanoi-based Khang Foundation and Vuong & Associates, especially Dam Thu Ha, Nghiem Phu Kien Cuong, Do Thu Hang and Vuong Thu Trang.
Footnotes
Transparency data associated with this article can be found in the online version at doi:10.1016/j.dib.2016.09.040.
Supplementary data associated with this article can be found in the online version at doi:10.1016/j.dib.2016.09.040.
Contributor Information
Quan-Hoang Vuong, Email: hoangvq@fsb.edu.vn.
Trong-Khang Nguyen, Email: khangnt@mkgroup.com.vn.
Appendix A.
R commands help create the data set provided in the file named “table2.csv”
>med=read.csv(“E:/…/P330.csv,header=T) |
>attach(med) |
> table2=xtabs(~AvgCost+Insured+Burden) |
> ftable(table2) |
Appendix B.
R commands using data in “table2.csv” estimated model follow as:
> burden2=read.csv(“E:/…/table2.csv,header=T) |
> attach(burden2) |
> contrasts(burden2$AvgCost)=contr.treatment(levels(burden2$AvgCost),base=2) |
> contrasts(burden2$Insured)=contr.treatment(levels(burden2$Insured),base=2) |
> fit.burden2=vglm(cbind(C,B,A)~AvgCost+Insured,family=multinomial,data=burden2) |
> summary(fit.burden2) |
Transparency document. Supplementary material
.
Appendix C. Supplementary material
.
References
- 1.Vuong Q.H. Data on Vietnamese patients׳ behavior in using information sources, perceived data sufficiency and (non)optimal choice of healthcare provider. Data Brief. 2016;7:1687–1695. doi: 10.1016/j.dib.2016.04.066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Agresti A. Categorical Data Analysis. Wiley; Hoboken, NJ: 2002. p. 774. [Google Scholar]
- 3.PennState Science, Analysis of Discrete Data (STAT 504), 〈https://onlinecourses.science.psu.edu/stat504/〉 (accessed 16.10.15).
- 4.Vuong Q.H. SpringerPlus, Springer Nature, Berlin, Germany; 2015. Be Rich or Don׳t be Sick: estimating Vietnamese Patients׳ Risk Falling into Destitution; p. 4. Article #529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.G. Bloom, Primary Health Care Meets the Market: Lessons from China and Vietnam. IDS Working Paper 53, University of Sussex, UK, 1997.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.