MIMIC-SBDH
Dataset containing 7,025 discharge summary notes from the MIMIC III dataset annotated for 7 SBDHs
MIMIC-SBDH is a data set containing 7,025 discharge summary notes randomly selected from the MIMIC III dataset. The notes are annotated for the patient’s status of the following Social and Behavioral Determinants of Health (SBDHs):
- Community
- Education
- Economics
- Environment
- Alcohol Use
- Tobacco Use
- Drug Use.
In addition, we mark SBDH-related keywords are also marked in the notes. The folder contains two files:
MIMIC-SBDH.csv: This file contains the labels for the seven SBDHs:
- Community-Present and Community-Absent (0: False, 1: True)
- Education (0: False, 1: True)
- Economics (0: None, 1: True, 2: False)
- Environment (0: None, 1: True, 2: False)
- Alcohol Use (0: None, 1: Present, 2: Past, 3: Never, 4: Unsure)
- Tobacco Use (0: None, 1: Present, 2: Past, 3: Never, 4: Unsure)
- Drug Use (0: None, 1: Present, 2: Past, 3: Never, 4: Unsure)
MIMIC-SBDH-keywords.csv: This file contains the start and end indices of the SBDH-related keywords.