Abstract
Keywords: MRI, Imaging Sequences, Ultrasound, Mammography, CT, Angiography, Conventional Radiography
Published under a CC BY 4.0 license.
See also the commentary by Whitman and Vining in this issue.
Keywords: MRI, Imaging Sequences, Ultrasound, Mammography, CT, Angiography, Conventional Radiography
Summary
The Scottish Medical Imaging (SMI) Archive is a collection of population-based, medical radiology images from real patient records for use in health care research and the development or validation of artificial intelligence algorithms within the Scottish National Safe Haven. The images are linkable to other routinely collected electronic health care records (such as hospital, prescribing, birth, and death data).
Key Points
■ The Scottish Medical Imaging Archive is a collection of population-based, routinely collected medical radiology images available to researchers for use within the nationwide Scottish safe.
■ This archive provides access to “analytics-ready” extracts for images between January 1, 2010, and August 31, 2018, which can be used for health care research and the development or validation of artificial intelligence algorithms.
■ This archive is fully compliant with the U.K. Data Protection Act and preserves patient confidentiality using pseudonymization techniques.
■ An archive of 57.3 million radiology studies linked to their medical records from the whole Scottish population has been made available for research within a trusted research environment on a cost recovery basis.
■ The end-to-end service to access the data is provided through the electronic Data Research and Innovation Service team (https://www.isdscotland.org/Products-and-Services/eDRIS/Scottish-Medical-Imaging-Service/).
■ An open-source software platform hosts and manages the data, providing the capability to extract relevant images based on complex cohort definitions and to capture research annotations and ground truth.
Introduction
Using clinical images for research and linking them to other routinely collected clinical data is challenging because of the following reasons:
A clinical picture archiving and communication system (PACS) is (quite rightly) designed and optimized for clinical care rather than research. A clinical PACS makes it easy to find all images for a particular patient, but it is not designed to facilitate searching for images with particular characteristics, such as section thickness, scanning protocol, contrast agent, and patient medication.
Reuse of clinical images for research requires de-identification, yet identifiable data can be present in many areas of the associated Digital Imaging and Communications in Medicine (DICOM) file metadata and/or may be present within the pixel data itself and therefore “burned onto” the actual image.
Reuse can require approval from multiple data controllers, and the complexity of de-identification increases the risk of rejection of applications for research given the amount of work the data controller may have to do to ensure that no identifiable data are released.
Ground truth or reference standard data developed by one research project are not easily shared with other research projects because of both the technical and data governance challenges of doing so when data are sensitive.
We have addressed these four challenges by collecting routinely captured radiology images from the Scottish population, linking these data to longitudinal electronic health records, and building a software platform and end-to-end service to support researchers in accessing de-identifiable cohort-specific subsets of the data for research. There is a single data governance application process for accessing the resource (although the data were captured by multiple data custodians), and the software platform can store and share ground truth and annotations from one research project with another (provisional on consent to share).
Materials and Methods
Ethics
Public Health Scotland has a generic ethics approval for a research database. This covers the collation, linkage, and secondary use of data held by Public Health Scotland and analyzed within the secure environment of the National Safe Haven (NSH). No ethical approval was required to build this database. The Health and Social Care Public Benefit and Privacy Panel (PBPP) approved the application requesting data access to bring the data over from the national PACS into a research database to be held in the NSH.
Data Resource Collection
Access to “analytics-ready” extracts of the archive is through the Scottish Medical Imaging (SMI) Service, part of the electronic Data Research and Innovation Service (eDRIS) team (1) within Public Health Scotland.
The SMI Archive holds some 2.47 billion DICOM images in 94.9 million series across 57.3 million studies and essentially mirrors the national PACS data archive for the entire country. It covers 36 imaging modalities, including CT, MRI, PET, structured reports, radiography, and US. These are images collected through routine clinical care across all the 14 National Health Service (NHS) health boards within Scotland, hence providing wide coverage of the Scottish population. The CT, MRI, PET, and structured report modalities (approximately 75% of the studies within the archive) have been curated and are available for researchers to access (Table 1). The remaining modalities are being curated. Detailed descriptions of the SMI fields are available from the SMI Research Dataset (2).
Table 1:
The initial dataset spans from January 1, 2010, to August 31, 2018 (process to update the archive is ongoing). Figure 1 shows the total number of DICOM studies by modality and year. Because the data are available for only a fraction of 2018, the volume of studies for the year is less than the previous year.
There are 2 182 123 female patients (13 167 604 unique study identifiers) and 2 081 040 male patients (11 265 313 unique study identifiers) recorded in the SMI Archive. Because the population of Scotland is approximately 5.5 million, this means that about 1.2 million people have never undergone a radiologic examination (approximately 22% of the population). Patient years of birth range from 1900 to 2020, with a median year of birth of 1960–1970 (aggregated data). Information on patients’ ethnicities is shown in Table 2.
Table 2:
Of patients with scans, 94.2% have associated longitudinal health care records. As might be expected, the percentage of patients who appear in different longitudinal health care records differs depending on the type of record (eg, 16.8% of patients have linked cancer records, 22.3% of patients have linked maternity-related records, and 92.4% of patients have linked hospital outpatient records).
Resulting Dataset
Data Resource Use: Study Dataset Extraction and Cohort Assembly
We have worked for the past 7 years to build a suite of software tools for hosting and managing more than 3 petabytes of data from different sources (both imaging data and longitudinal electronic health records) (3). An end-to-end service to extract researcher-specified, relevant cohorts of data from the large data resources and share these with researchers within a secure environment is provided by eDRIS. Data extracts can be requested for a given study based on DICOM tag data, routinely collected clinical data (eg, prescribing, hospital admissions), the researcher's existing cohort, and/or the results of natural language processing classification of structured reports. Extraction is performed by an eDRIS analyst using tools specially developed by PICTURES (4) to derive the study population from clinical data (5) (eg, age >40 years with any lung cancer diagnosis). The study cohort can be further refined by the analyst (eg, patients with CT chest images showing lung nodules) and can include concepts derived from the processing of any structured reports associated with the images in the study. We are developing support for cohort assembly from pixel data directly (eg, by running an image classification algorithm).
Nind et al (3) provided an architectural description of the open-source software tools (5,6) supporting data management and cohort extraction. In summary, there are three zones within the architecture, as shown in Figure 2.
Data Linkage
SMI image extracts can be linked at an individual level to other health care datasets available within the NSH using the Scottish community health index number, a unique numeric identifier allocated to each patient on first registration with the Scottish health care system (5). Such health care datasets include hospital records (Scottish Morbidity Records [SMR]00 and SMR01), maternal and neonatal records (SMR02), the Scottish cancer registry (SMR06), prescribed and dispensed drugs (6), and mortality records; a full list is available from the Scottish National Data Catalogue (7). Other externally produced datasets from clinical trials, local health boards, disease registries, and nonhealth administrative data can also be linked.
Data Resource Access
Researchers can request pseudonymized, linkable SMI data by contacting the eDRIS team. Each research proposal is assigned an experienced research coordinator, who will guide researchers through the selection of appropriate data items and any limitations applicable to the proposed study, as well as guidance on the completion of the PBPP application process for approval to access NHS Scotland data.
For an application requesting images linked to other data to the PBPP, the mean time from submission to approval is 66 days. When the time taken for applicants to respond to queries from the panel is deducted, the mean approval time is 21 days. Submission to PBPP occurs when the eDRIS research coordinator, based on their knowledge and experience, agrees with the applicant that their application is of a suitable standard.
Once the PBPP approves the research study, data are extracted and provided to researchers through individual project accounts within the NSH, Scotland's national trusted research environment. Researchers can build their own software stack and import this into the NSH to be linked to the data. The NSH is a locked-down environment adhering to the five safe principles of trusted research environments: safe data, projects, people, settings, and outputs (8,9). Access to the internet is controlled, and disclosure controls are applied to any data to be exported.
To preserve patient confidentiality and ensure that all research with SMI linked data are fully compliant with the U.K. Data Protection Act (9), the eDRIS team applies pseudonymization techniques on data extracts for researchers’ use. All personally identifiable information is removed from linked datasets, and patient identifiers are replaced with unique pseudoidentifiers. SMI data are available to researchers affiliated with U.K. public sector organizations, and requests can be made to provide access to those outside the United Kingdom. For more information regarding application and costs, contact eDRIS (1). Details of the metadata can be found at the SMI Research Dataset (2).
The dataset is “open” because, given a successful PBPP application, access to the data will be provided. The PBPP application process is an independent process that reviews whether the project is in the public interest and ensures that patient confidentiality is maintained. There is no requirement to collaborate with the specific research group who developed the platform or to pay for access via a commercial company, as might be expected for a “closed” dataset. If a group wishes to develop a commercial artificial intelligence (AI) product, providing the product at a discounted price to NHS Scotland may be required because the product will have been trained on data from this population.
The dataset has been created through routine health care provision without the consent of the participants for their health data to be shared for research purposes. Because of linkage to longitudinal health care records, in many cases it is not possible to ensure that the images and associated data are fully anonymous. In such cases, it is not possible within the U.K. laws of common duty of confidentiality and the U.K. Data Protection Act (9) to make these data available for direct download; therefore, access is provided within the NSH environment. The NSH environment allows analysis and AI development, but the data itself cannot be exported. Only a specific subset of the dataset is provided to answer the specific research question (eg, foot radiographs are not provided for a study investigating or training on lung MRI examinations). The environment and data are provided on a cost recovery basis (ie, the NHS data controllers do not charge for access to the dataset, but there are associated costs to provide the services needed for the research project).
Example Studies
The resource is live and available for access, and it is being used by many research groups. Current approved studies include the development and validation of the CT clock tool for estimating the time of ischemic stroke onset (10), whole-population automated reading of brain imaging reports in linked electronic health records (11), making retinal images from chain retail optometrists’ research ready and linkable to other data with the Scottish Collaborative Optometry–Ophthalmology Network E-research (SCONe) (12), and linking brain imaging to the cohort of professional rugby and football players (Football's InfluencE on Lifelong Health and Dementia risk: Late Outcomes and NeuroradioloGy) (13). Chest radiographs, MR images, and CT scans of hospitalized COVID-positive and COVID-negative patients were provisioned to the U.K. National COVID-19 Chest Image Database (14).
Discussion
The SMI data resource is a population-scale, heterogeneous dataset of radiology images linked to longitudinal health care records. An end-to-end service helps researchers access subsets of the data relevant to their particular study within a trusted research environment.
There are many strengths of this resource:
The Scottish population is a relatively stable population with comparatively little immigration or emigration (the non-UK born population is 9.7%) (15).
The population is relatively unhealthy compared to other Western European countries, for example, there are no Western European countries whose females have a lower life expectancy (16).
The NHS provides excellent longitudinal health care records (linked by the community health index for decades). Images have been collected from different health boards using different scanner types and processes, thus providing highly heterogeneous real-world data, well suited for training robust, generalizable AI algorithms.
As the images are population-wide, the bias in the data is minimal, as the number of scans is proportional to the scans requested to provision routine health care.
The ability to link the clinical imaging data relatively easily to other datasets is highly advantageous.
There are also several limitations of this data resource:
The Scottish population is not ethnically diverse (approximately 96% White) (16).
AI models might accidentally encode personal data, which may later be susceptible to external hacking (eg, reverse engineering training data). We are researching effective disclosure control methods for AI models (17) to support extraction from the NSH, but this process is not yet implemented in production.
At present, no automated methods prevent three-dimensional reconstruction of a face from images of the head (governance rules prohibit this), nor are there restrictions on the extraction of images of people who have unique conditions (or uniquely identifying features like tattoos) that could allow reidentification. Cases of this nature still require substantial manual inspection by eDRIS analysts before such data can be released.
Stratifying images by sequence type (eg, T1-weighted images, T2-weighted images) and/or body part is a challenge with relying solely on clinically gathered DICOM tag data. We are developing methods to improve this (eg, algorithmic cross-referencing with pixel and/or structured report data).
We are working to enhance the resource with future developments including the following:
Additional years of data: At the time of publication, the SMI Archive is a single snapshot of all radiology images taken in Scotland between January 1, 2010, and August 31, 2018. However, we are bringing the archive up to date and setting up a regular feed to update the resource from the Scottish national clinical PACS.
Additional modalities of data: As researcher demand dictates and resource allows, we will process and make available additional DICOM modalities. The first addition will cover the radiography modalities (computed radiography, digital radiography, and panoramic radiography).
Additional DICOM tags: As researcher demand dictates and resources allow, we intend to make additional DICOM tags available as variables that can be selected as part of research cohort construction. The review process for tag promotion will consider aspects such as overall demand, future use and benefits for PHS, and future research requests.
Annotation and enrichment of data: Research and development work within the PICTURES program is exploring capturing annotations and ground truth from researchers. We will use this feedback to augment and enrich the archive, either as additional information for cohort creation or as additional derived data variables that can be offered to future researchers, so the annotations available for the resource increase over time. In addition, several ongoing projects are focused on classifying image subsets (eg, body part, scan type, compression detection).
In conclusion, the SMI data resource is available for international researchers and innovators to perform health care research and for the development or validation of AI algorithms. We welcome the opportunity to collaborate to enhance the capability of the resource and will respond to new user requests.
The Scottish Medical Imaging (SMI) Archive is a single snapshot of all radiology images taken in Scotland From the central National Picture Archiving and Communication System which is publicly funded by the National Health Service in Scotland. SMI itself is hosted within the Scottish National Safe Haven, hosted at and operated by The University of Edinburgh for Public Health Scotland. The development and operation of SMI are a collaboration between electronic Data Research and Innovation Service (eDRIS), EPCC at the University of Edinburgh, and the Health Informatics Centre (HIC) at the University of Dundee. This work was supported by the Medical Research Council (MRC) grant no. MR/M501633/1 and the Wellcome Trust grant no. WT086113 through the Scottish Health Informatics Programme (SHIP); the PICTURES 5-year program grant From the MRC and Engineering and Physical Sciences Research Council (EPSRC) (grant no. MR/S010351/1); the Chief Scientist Office of the Scottish Government Health and Social Care Directorates via a leverage grant made to Farr Scotland; the Scottish government through the “Imaging AI” grant award; Health Data Research (HDR) UK, which receives its funding from HDR UK Ltd (HDR-5012) funded by the UK MRC, Engineering and Physical Sciences Research Council, Economic and Social Research Council, Department of Health and Social Care (England), Chief Scientist Office of the Scottish Government Health and Social Care Directorates, Health and Social Care Research and Development Division (Welsh Government), Public Health Agency (Northern Ireland), British Heart Foundation (BHF), and the Wellcome Trust; and the Edinburgh and South-East Scotland City Region Deal; the University of Edinburgh; and UK charities.
Disclosures of conflicts of interest: R.B. The Medical Research Council (MRC) grant no. MR/M501633/1 and the Wellcome Trust grant no. WT086113 through the Scottish Health Informatics Programme (SHIP); the PICTURES 5-year program grant from the MRC and EPSRC (grant no. MR/S010351/1); the Chief Scientist Office of the Scottish Government Health and Social Care Directorates via a leverage grant made to Farr Scotland; The Scottish Government through the “Imaging AI” grant award; Health Data Research (HDR) UK, which receives its funding from HDR UK Ltd (HDR-5012) funded by the UK MRC, Engineering and Physical Sciences Research Council, Economic and Social Research Council, Department of Health and Social Care (England), Chief Scientist Office of the Scottish Government Health and Social Care Directorates, Health and Social Care Research and Development Division (Welsh Government), Public Health Agency (Northern Ireland), British Heart Foundation (BHF), and the Wellcome Trust; The Edinburgh and South-East Scotland City Region Deal, the University of Edinburgh, and UK charities. T.N. No relevant relationships. J.S. The Medical Research Council (MRC) grant no. MR/M501633/1 and the Wellcome Trust grant no. WT086113 through the Scottish Health Informatics Programme (SHIP); the PICTURES 5-year programme grant from the MRC and EPSRC (grant no. MR/S010351/1); Chief Scientist Office of the Scottish Government Health and Social Care Directorates via a leverage grant made to Farr Scotland; the Scottish Government through the “Imaging AI” grant award; Health Data Research (HDR) UK, which receives its funding from HDR UK Ltd (HDR-5012) funded by the UK MRC, Engineering and Physical Sciences Research Council, Economic and Social Research Council, Department of Health and Social Care (England), Chief Scientist Office of the Scottish Government Health and Social Care Directorates, Health and Social Care Research and Development Division (Welsh Government), Public Health Agency (Northern Ireland), British Heart Foundation (BHF), and the Wellcome Trust; the Edinburgh and South-East Scotland City Region Deal, the University of Edinburgh, and UK charities. (Grants received by author's employer, the University of Dundee, and/or partners the University of Edinburgh). G.M. No relevant relationships. D.H. Support from University of Dundee. A.H. No relevant relationships. R.M. The PICTURES 5-year program grant from the MRC and EPSRC (grant no. MR/S010351/1) to institution. J.C. No relevant relationships. S.K. UK Research and Innovation, Medical Research Council (MRC) grant number MR/S010351/1 - under which The Scottish Medical Imaging Archive was established (the topic of this manuscript). L.T. No relevant relationships. R.T. No relevant relationships. K.G. No relevant relationships. D.S. No relevant relationships. I.B. No relevant relationships. A.B. University of Edinburgh (employer). B.P. The Medical Research Council (MRC) grant no. MR/M501633/1. W.K. No relevant relationships. D.S.M. Employee of University of Edinburgh EPCC at time of writing. J.F.R.H. The Farr Institute (Scotland). E.J.R.v.B. No relevant relationships. P.S.R. No relevant relationships. S.R. No relevant relationships. E.M.B. No relevant relationships. R.M. Salary from University of Dundee Public Health Scotland, support from University of Dundee for travel. D.S. MRC (MR/S010351/1), £4.4M Program Grant, ‘MICA: Interdisciplinary Collaboration for efficient and effective use of clinical images in big heath care research: PICTURES’. A.D. No relevant relationships. E.T. No relevant relationships. C.M. No relevant relationships. R.W. No relevant relationships. A.M. No relevant relationships. M.P. No relevant relationships. E.J. MRC, EPSRC, CSO, Aidence (company - leveraged), Scottish Government – Programme for Government: supporting Artificial Intelligence (AI) research using clinical images at scale. 2019-2020. Funder: Scottish Government – Programme for Government. PI. Total award: £80,000; interdisciplinary Collaboration for efficient and effective Use of clinical images in big data health care Research: PICTURES. 2019-2024. Funder: MRC and EPSRC Programme Grant. PI. Total award: £2.9M + £900k leveraged funds from industry; Scottish Research Images Repository. Informatics research to develop a software platform for processing, anonymizing, and provision in a Safe Haven environment all of the routinely collected images stored in the National PACS system. 2013–2018. Funder: MRC (The Scottish eHealth Informatics Research Centre (E-HIRCs)/Farr) & CSO (several separate grants). Responsible for delivery rather than Co-I as obtained before author's academic contract. Award: ~£540k; NHS Education for Scotland, Health Foundation, SODOH, CSO, MRC, GSK, NHS Fife Endowment Fund, HDR UK - Research Councils, EU, DataLab, Health Foundation, NIHR, EPSRC, EHDEN, NHSX, Tenovus: Guidelines and Resources for AI Model Access from TrusTEd Research environments (GRAIMATTER). 2022-2022 (sprint). Funder: MRC – DARE. PI (£315,488); Trusted Research Environment and Enclave for Hosting Open Original Science Exploration (TREEHOOSE). 2022-2022 (sprint). Funder: MRC – DARE. Co-I (£202,664); Alleviate: Hub for Pain (Pain Research Data Hub – UKRI and Versus Arthritis Strategic Priority Fund (SPF) Advanced Pain Discovery Platform). 2021-2024. Funder: MRC. Co-PI. Total Award: £2,000,000; CO-CONNECT: COVID - Curated and open analysis and research platform. 2020-2022. Funder: MRC. Co-PI. Total Award: £4,091,229; Linking and Mapping Primary Care Data for Studying COVID-19 in the Community (COVID-19 call). 2020-2021. Funder: Tenovus. Co-PI. Awarded amount: £20,000; National COVID-19 Chest Imaging Database (NCCID). 2020-2021. Funder: NHSX. Collaborator. Award to Dundee £30,000; World Class Labs (Multiomics) Capital Award. 2020-2021. Funder: HDR UK. PI. Total award: £400,000; creating a national platform for powerful molecular studies of multiple conditions: the HDRUK multiomics consortium. 2020-2022. Funder: MRC/HDR UK. Co-I and Workstream lead (Responsible for £73,860). Total Award: £1,088,605; The Safe Haven Metadata (SHEMA) Project. 2020-2021. Funder: Chief Scientist Office (CSO). PI. Total award: £20,000 (initial phase, next phase £100k); data standardization using the OMOP common data model. 2020-2022. Funder: European Health Data & Evidence Network (EHDEN). Co-PI. Total award: £86,489; Cambridge Mathematics of Information in Healthcare (CMIH). 2020-2023. Funder: EPSRC. Co-I (Responsible for £56,000). Total Award: £1,275,504; Centre for Antimicrobial Resistance. 2019-2022. Funder: NIHR. Co-I and Workstream lead (Responsible for £260,369). Total Award: £2,253,124; Primary Care – Improved Data for Improved Outcomes. 2019-2021. Funder: Health Foundation. PI (Responsible for £69,336). Total Award: £190,009; Building the Knowledge Graph for UK Healthcare Data Science. 2019-2020. Funder: HDR UK. Co-I (Responsible for £64,449). Total award: £272,657; Multimorbidity and clinical guidelines: using epidemiology to quantify the applicability of trial evidence to inform guideline development. 2019-2020. Funder: Chief Scientist Office (CSO) Scotland. Co-I and Workstream lead (Responsible for £143,218). Total award: £356,667; Learning healthcare system for stroke precision medicine pathway (P4Me). 2019-2022. Funder: DataLab. Co-I (Responsible for £79,933). Total award: £94,690; Integration of Knowledge and Biobank Resources in Comprehensive Translational Approach for Personalized Prevention and Treatment of Metabolic Disorders (INTEGROMED). 2019-2021. Funder: EU. Co-I (Responsible for £9,257). Total award: £124,778; Graph Based Data Federation for Healthcare Data Science. 2019-2020. Funder: HDR UK. Co-I and workstream lead (Responsible for (£76,000)). Total Award: £272,657; Defining & Redefining Disease Using Multimodal Data on a National Scale: the HDR UK Phenomics Resource. 2019-2022. Funder: MRC/HDR UK. Co-I and Workstream lead (Responsible for £119,466). Total Award: £1,087,168; Health Data Research UK Scottish Substantive Site. 2018-2023. Funder: Research Councils. Co-I (Responsible for £366,000). Total award: £5,151,421; Enabling Learning NHS Care Systems utilising Electronic Medical Records (ELectra) in Fife. Innovative and opportunities to support NHS Fife Clinical Strategy. 2018-2019. Funder: NHS Fife Endowment Fund. Co-I and Workstream lead (Responsible for: £91,505). Total award: £174,636; Omic-Based Strategies for Improved Diagnosis and Treatment of Endocrine Hypertension (ENSAT-HT). 2015-2022. Funder: EU-H2020 Societal Challenges SC1 - Health and Wellbeing. Co-I and Workstream lead (Responsible for: £857,423). Total award: £7,418,982; Utilising Routinely Collected Electronic Medical Records to Predict Dementia. 2015. Funder: CSO. PI. Award: £36,877; Discovery of Novel Disease Mechanisms through Advanced Biomedical Informatics. 2015-2019. Funder: MRC Case PhD Studentship in collaboration with GSK. PI. Award: £127,774; Scottish Improvement Science Research, Development and Knowledge Translation Collaborating Centre (SISCC). 2014-2021. Funders: NHS Education for Scotland, Health Foundation, SODOH-CSO. Co-I (Responsible for £600,000). Award: £2,764,551; consulting fees from Wellcome Trust, Pain Data Prize (grant scoping, £3000); payment/honoraria from Pfizer for lunch and learn (£1125 Overcoming the Challenges of Providing Access to Population Scale, Routinely Collected Health and Imaging Data for AI Development whilst Protecting Patient Confidentiality); support for travel for MRC panels (quarterly meeting for two panels - £80 per meeting).
Abbreviations:
- AI
- artificial intelligence
- DICOM
- Digital Imaging and Communications in Medicine
- eDRIS
- electronic Data Research and Innovation Service
- NHS
- National Health Service
- NSH
- Scottish National Safe Haven
- PACS
- picture archiving and communication system
- PBPP
- National Health Service Scotland Public Benefit and Privacy Panel
- SMI
- Scottish Medical Imaging
References
- 1. ISD Services . Electronic Data Research and Innovation Service (eDRIS) . ISD Scotland . https://www.isdscotland.org/Products-and-Services/eDRIS/. Accessed November 9, 2022.
- 2. Scottish Medical Imaging (SMI) Research Dataset . https://web.www.healthdatagateway.org/dataset/1c49a822-6432-468b-8ba5-6aab534654b9. Accessed November 9, 2022.
- 3. Nind T , Sutherland J , McAllister G , et al . An extensible big data software architecture managing a research resource of real-world clinical radiology data linked to other health data from the whole Scottish population . Gigascience 2020. ; 9 ( 10 ): giaa095 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. PICTURES . Supporting the use of data for health care research - Image on a Mission. Pictures . https://www.imageonamission.ac.uk/. Accessed November 9, 2022.
- 5. Research Data Management Platform . HicServices . https://github.com/HicServices/RDMP. Published 2023. Accessed May 9, 2023.
- 6. The Scottish Medical Imaging Project (SMI) . GitHub . https://github.com/SMI. Accessed November 9, 2022.
- 7. ISD Services . National Data Catalogue . ISD Scotland . https://www.ndc.scot.nhs.uk/. Accessed November 9, 2022.
- 8. UK Health Data Research Alliance . NHSX . Building Trusted Research Environments - Principles and Best Practices; Towards TRE ecosystems . Zenodo; . Published December 8, 2021. Accessed May 21, 2022. [Google Scholar]
- 9. Scottish Government . Charter for Safe Havens in Scotland: Handling Unconsented Data from National Health Service Patient Records to Support Research and Statistics . http://www.gov.scot/publications/charter-safe-havens-scotland-handling-unconsented-data-national-health-service-patient-records-support-research-statistics/. Accessed May 21, 2022.
- 10. Mair G , Alzahrani A , Lindley RI , Sandercock PAG , Wardlaw JM . Feasibility and diagnostic accuracy of using brain attenuation changes on CT to estimate time of ischemic stroke onset . Neuroradiology 2021. ; 63 ( 6 ): 869 – 878 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Whole population automated reading of brain imaging reports in linked electronic health records (WARBLER) . The University of Edinburgh . https://www.ed.ac.uk/usher/clinical-natural-language-processing/our-research/whole-population-automated-reading-brain-imaging. Published 2020. Accessed May 18, 2023.
- 12. SCONe . The University of Edinburgh . https://www.ed.ac.uk/clinical-sciences/ophthalmology/scone. Published 2023. Accessed May 18, 2023.
- 13. Russell ER , Stewart K , Mackay DF , MacLean J , Pell JP , Stewart W . Football's InfluencE on Lifelong health and Dementia risk (FIELD): protocol for a retrospective cohort study of former professional footballers . BMJ Open 2019. ; 9 ( 5 ): e028654 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. National COVID-19 Chest Imaging Database (NCCID) . NHS Transformation Directorate . https://transform.england.nhs.uk/covid-19-response/data-and-covid-19/national-covid-19-chest-imaging-database-nccid/. Accessed May 18, 2023.
- 15. Population by Country of Birth and Nationality. Scotland, July 2020 to June 2021 . National Records of Scotland . National Records of Scotland . https://www.nrscotland.gov.uk/statistics-and-data/statistics/statistics-by-theme/population/population-estimates/population-by-country-of-birth-and-nationality/jul-20-jun-21. Published 2022. Accessed November 9, 2022. [Google Scholar]
- 16. Tackling Health Inequalities – an NHS Response . https://www.sehd.scot.nhs.uk/nationalframework/documents/tackling%20healthinequalities240505.pdf. Accessed November 24, 2022.
- 17. Jefferson E , Liley J , Malone M , et al . GRAIMATTER Green Paper: Recommendations for disclosure control of trained Machine Learning (ML) models from Trusted Research Environments (TREs) . Zenodo . Published September 21, 2022. Accessed November 9, 2022. [Google Scholar]