Skip to main content
Data in Brief logoLink to Data in Brief
. 2018 Dec 7;22:204–206. doi: 10.1016/j.dib.2018.12.009

Korean public and hospital data for estimating LDL-cholesterol

Taesic Lee a, Juwon Kim b, Young Uh b,, Hyunju Lee a,c,⁎⁎
PMCID: PMC6301972  PMID: 30581927

Abstract

The data is composed of 14,812 subjects, and was obtained from the Korean National Health and Nutritional Examination Survey (KNHANES) from 2009 to 2015. The KNHANES data is publicly available at https://knhanes.cdc.go.kr/knhanes/eng/index.do. By using random annotated ID, all subjects can be anonymous.

The dataset consisting of 4520 participants was obtained from Wonju Severance Christian Hospital (WSCH) in South Korea. To protect the patient׳s personal information, we removed ID and gender of patients. Detail information of the data presented in the present article is available in the research article “Deep neural network for estimating low density lipoprotein cholesterol” (Lee et al., 2018).


Specifications table

Subject area Biology
More specific subject area Medical informatics, Lipids, Lipoproteins
Type of data Online data
How data was acquired KNHANES (Publicly available)
– Total cholesterol, triglyceride, HDL-C, and LDL-C levels were measured by the Hitachi analyzer 7600 (Hitachi, Tokyo, Japan)
WSCH (extracting from hospital database)
– The total cholesterol, triglyceride, HDL-C, and LDL-C for the testing dataset were analyzed using the modular DPE system (Roche Diagnostics, Basel, Switzerland). The lipoprotein subfraction test was performed using the Lipoprint LDL System (Quantimetrix, Redondo Beach, CA, USA).
Data format Raw
Experimental factors Public data including 14,812 subjects with information about age, gender, and levels of lipid profiles.
Korea hospital data including 4520 patients with information about levels of lipid profiles.
Experimental features Medians of total cholesterol, HDL cholesterol, and triglyceride in KNHANES were 186, 47, 120, respectively. Medians of total cholesterol, HDL cholesterol, and triglyceride in Korea hospital data were 165, 37, 103, respectively.
Data source location KNHANES (Seoul, South Korea)
WSCH (Wonju, South Korea)
Data accessibility The KNHANES data is publicly available at https://knhanes.cdc.go.kr/knhanes/eng/index.do.
Related research article Taesic Lee, Juwon Kim, Young Uh, Hyunju Lee, Deep neural network for estimating low density lipoprotein cholesterol (Clin Chim Acta; in press) [1].

Value of the data

  • Representative for Korea: The KNHANES study was carried out by Korean Centers for Disease Control and Prevention (K-CDC), which is nation-wide and population-based data. Participants are selected by using cluster-based random sampling according to demographic features to make the KNHANES representative of Korea.

  • Clinical application: After constructing the final model using the KNHANES, the model was applied to the local hospital dataset. The present framework suggests that it is immediately applicable to a clinical setting.

  • Potential to construct more accurate model: The data has potential to be used for constructing more accurate and feasible LDL cholesterol estimating model

1. Data

Subjects with lipid profiles in 2009–15 of KNHANES and in 2008–13 of a Korean hospital were included in the healthy person-based and the patient-based data, respectively. Note that LDL cholesterol of both data was directly measured. The KNHANES data is publicly available at https://knhanes.cdc.go.kr/knhanes/eng/index.do.

2. Experimental design, materials and methods

We obtained a dataset of 14,812 anonymized laboratory test records from 2009–15 KNHANES. Subjects with any missing results of total cholesterol, triglyceride, HDL cholesterol, and LDL cholesterol were excluded. Participants with dyslipidemia medication or higher triglyceride concentration of more than 400 mg/dL [2] were not excluded. All lipid profiles were tested on fasting state more than 12 h.

For the Korean hospital data, lipoprotein subfraction test results were obtained from September 2008 to March 2013 recorded at WSCH. All subjects fasted overnight for 12 h before blood collection, and the levels of four lipid profiles were measured on the day of blood collection. If a patient record contained multiple test results of the collection during the study period, only the first result was chosen in the dataset. The total cholesterol, triglyceride, HDL-C, and LDL-C for the Korea hospital dataset were analyzed using the modular DPE system (Roche Diagnostics, Basel, Switzerland) [1]. The lipoprotein subfraction test was performed by using the Lipoprint LDL System (Quantimetrix, Redondo Beach, CA, USA) [1]. The Lipoprint system utilizes polyacrylamide gel electrophoresis to separate the various lipoprotein subfractions on the basis of particle size as VLDL band, three mid-bands [MID-C = VLDL remnants, MID-B = large intermediate density lipoprotein (IDL), and MID-A = small IDL], and seven LDL bands and an HDL band. LDL-1 and -2 are defined as large LDL subfractions whereas LDL-3, 4, 5, 6, and -7 are defined as small LDL subfractions [1], [3], [4].

Acknowledgements

This research was supported by a National Research Foundation of South Korea (NRF) grant funded by Ministry of Science and Information and Communication Technologies (ICT) (NRF-2016R1A2B2013855) and (NRF-2018M3C7A1054932).

Footnotes

Transparency document

Transparency data associated with this article can be found in the online version at https://doi.org/10.1016/j.dib.2018.12.009.

Contributor Information

Taesic Lee, Email: ddasic123@gist.ac.kr.

Juwon Kim, Email: juwon76@yonsei.ac.kr.

Young Uh, Email: u931018@yonsei.ac.kr.

Hyunju Lee, Email: hyunjulee@gist.ac.kr.

Transparency document. Supplementary material

Supplementary material.

mmc1.docx (13.2KB, docx)

References

  • 1.Lee T.S., Kim J.W., Uh Y., Lee H.J. Deep neural network for estimating low density lipoprotein cholesterol. Clin. Chim. Acta. 2018 doi: 10.1016/j.cca.2018.11.022. [DOI] [PubMed] [Google Scholar]
  • 2.Martin S.S., Blaha M.J., Elshazly M.B., Toth P.P., Kwiterovich P.O., Blumenthal R.S. Comparison of a novel method vs the Friedewald equation for estimating low-density lipoprotein cholesterol levels from the standard lipid profile. JAMA. 2013;310:2061–2068. doi: 10.1001/jama.2013.280532. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Arai H., Kokubo Y., Watanabe M., Sawamura T., Ito Y., Minagawa A. Small dense low-density lipoproteins cholesterol can predict incident cardiovascular disease in an urban Japanese cohort: the Suita study. J. Atheroscler. Thromb. 2013;20:195–203. doi: 10.5551/jat.14936. [DOI] [PubMed] [Google Scholar]
  • 4.Hoogeveen R.C., Gaubatz J.W., Sun W., Dodge R.C., Crosby J.R., Jiang J. Small dense low-density lipoprotein-cholesterol concentrations predict risk for coronary heart disease: the Atherosclerosis Risk In Communities (ARIC) study. Arterioscler. Thromb. Vasc. Biol. 2014;34:1069–1077. doi: 10.1161/ATVBAHA.114.303284. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material.

mmc1.docx (13.2KB, docx)

Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES