Skip to main content
. 2021 May 10;8:587768. doi: 10.3389/fcvm.2021.587768

Table 1.

The 15 largest databases found using methodology stated in the Methods section.

Database name Recruitment year Sample size Longitudinal Genome Methylome Transcriptome Metabolome Proteome Phenome Microbiome Intended Speciality Link
Registre Gironí del Cor (REGICOR) 1978 700,000 Y Y Y Y Y Y Y N General https://www.revespcardiol.org/es-regicor-35-years-of-excellence-articulo-S1885585713002739?redirect=true
UK BioBank 2006 500,000 Y Y Y Y Y Y Y Y General https://www.ukbiobank.ac.uk/
Netherlands Twin Registry 2004 240,000 Y Y Y Y Y Y Y Y General http://www.tweelingenregister.org
LifeLines 2006 167,729 Y Y Y Y Y Y Y Y General http://www.lifelines.nl
Nord-Trøndelag Health Study (The HUNT Study) 1984 120,000 Y Y Y Y Y Y N N General https://www.ntnu.edu/hunt/hunt-samples
FINRISK 1972 101,451 Y Y Y Y Y Y Y Y General https://thl.fi/en/web/thlfi-en/research-and-expertwork/population-studies/the-national-finrisk-study
UK Household Longitudinal Study 2009 100,000 Y Y Y Y N N Y N Societal https://www.understandingsociety.ac.uk/
The Tromsø Study 1974 93,287 Y Y N N N Y N N General https://en.uit.no/om/enhet/artikkel?p_document_id=80172&p_dimension_id=88111
100,000 Genomes Project 2012 70,000 Y Y Y Y Y Y Y Y Rare Disease https://www.genomicsengland.co.uk/about-genomics-england/the-100000-genomes-project/
Estonian Biobank of the Estonian Genome Center, University of Tartu 1999 52,000 Y Y Y Y Y Y Y N General http://www.biobank.ee
INTERVAL 2012 50,000 N Y Y N N Y N N Blood Donation https://www.nature.com/articles/s41586-018-0175-2
National Health and Nutrition Examination Survey (NHANES) 1960 31,126 Y Y N N Y Y Y N Nutrition https://www.cdc.gov/nchs/nhanes/index.htm
EPIC-Norfolk Study 1993 30,000 Y Y Y Y Y Y Y N Oncology http://www.mrc-epid.cam.ac.uk/research/studies/epic-norfolk/
Rotterdam Study (Charge) 1990 19,000 Y Y Y Y Y Y Y Y General http://www.epib.nl/research/ergo.htm
Cooperative Health Research in the Region of Augsburg, Southern Germany (KORA) 1984 18,000 Y Y Y Y Y Y Y N General http://epi.helmholtz-muenchen.de/kora-gen/index_e.php
Multiethnic Cohort (MEC) Study 199 3 215,000 Y Y Y Y Y N Y Y Oncology https://www.uhcancercenter.org/mec
The Singapore Multi-Ethnic Cohort (MEC) study 2004 14,465 Y Y Y Y Y N Y Y General https://pubmed.ncbi.nlm.nih.gov/29452397/
NIHR Cambridge BioResource 2005 17,300 Y Y Y Y N N Y N General https://www.sciencedirect.com/science/article/pii/S0092867416314465
Atherosclerosis Risk in Communities Study (ARIC) (CHARGE) 1987 15,792 Y Y Y Y Y Y Y N Cardio http://www.cscc.unc.edu/aric/
Framingham (CHARGE) 1948 15,447 Y Y Y Y Y Y Y Y Cardio https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4159698/
UK Adult Twin Registry (TwinsUK) 1992 14,274 Y Y Y Y Y Y Y Y General Paediatric http://www.twinsuk.co.uk/
Avon Longitudinal Study of Parents and Children (ALSPAC) 1991 13,988 Y Y Y Y Y Y Y Y Paediatric https://academic.oup.com/ije/article/42/1/97/694445
Fenland Study 2015 12,435 Y Y Y Y Y N Y N Endocrine http://www.mrc-epid.cam.ac.uk/Research/Studies/Fenland/index.html
Northern Finland Birth Cohort 1966 1966 12,058 Y Y Y N N Y Y Y General https://jmg.bmj.com/content/56/9/607
Pain-OMICS 2013 12,000 Y Y Y Y Y N N N Pain https://cordis.europa.eu/project/rcn/110070/factsheet/en
A Large-Scale Schizophrenia Association Study in Sweden 2005 11,850 Y Y Y Y N N N N Psychiatry https://www.nature.com/articles/ng.2742
Metabolic Syndrome in Men (METSIM) 2005 10,197 Y Y Y Y Y N Y Y General https://academic.oup.com/hmg/article/27/10/1830/4939377#118176243
Global Genomics Group (G3) GLOBAL Study 2012 10,000 Y Y Y Y Y Y Y N General https://www.g3therapeutics.com/
COPDGene 2008 10,000 Y Y Y Y Y Y N N COPD http://www.copdgene.org/
Oxford BioBank 1999 8,000 Y Y Y Y N N Y N General https://www.oxfordbiobank.org.uk/
Ontario Familial Colon Cancer Registry (OFCCR) 1998 7,377 Y Y Y N N N N N Oncology https://www.zanecohencentre.com/gi-cancers/ofccr
Multi-Ethnic Study of Atherosclerosis (MESA) 2000 6,814 Y Y Y Y Y Y Y N Cardio https://www.mesa-nhlbi.org/Publications.aspx
National Institute on Aging (NIA) SardiNIA Study 2001 6,148 Y Y Y Y N Y N Y Geriatric http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000338.v1.p1
Corogene 2006 5,809 Y Y Y Y N N Y N Cardio http://ije.oxfordjournals.org/content/early/2011/06/02/ije.dyr090.extract
Age, Gene/Environment Susceptibility-Reykjavik Study (AGES) 2002 5,764 Y Y Y Y Y Y Y N Geriatric http://www.hjarta.is/english/ages
Cardiovascular Risk in Young Finns Study 1980 4,320 Y Y Y Y Y Y Y N Cardio http://youngfinnsstudy.utu.fi/index.html
Study of Health in Pomerania (SHIP) 1997 4,308 Y Y Y Y Y Y N Y General https://pubmed.ncbi.nlm.nih.gov/22736157/
Environment And Genetics in Lung cancer Etiology (EAGLE) 2002 4,000 Y Y Y N N N N N Oncology https://eagle.cancer.gov/background.html
Accessible Resource For Integrated Genomics (ARIES) 2012 3,948 Y Y Y Y N N Y N General http://www.ariesepigenomics.org.uk/
IMT-Progression as Predictors of Vascular Events in a High-Risk European Population (IMPROVE) 2004 3,711 Y Y N Y N Y N N Cardio https://link.springer.com/article/10.1007%2Fs00125-014-3215-y#Sec2
Subpopulations and Intermediate Outcome Measures in COPD (SPIROMICS) 2010 2,981 Y Y N N N Y N N COPD https://www.spiromics.org/spiromics/
Athero-Express Biobank Studies 2002 2,500 Y Y Y Y Y Y N N Cardio https://www.atheroexpress.nl/
Leiden Longievity Study 2002 2,415 N Y Y Y Y Y Y N Geriatric https://www.nature.com/articles/5201508#Sec2
TRAILS (Tracking Adolescents' Individual Lives Survey) 2000 2,230 Y Y Y N N N Y N Paediatric https://www.trails.nl/en
The Orkney Complex Disease Study (ORCADES) (EUROSPAN) 2005 2,080 Y Y Y Y N N Y N General https://www.ed.ac.uk/viking/about-us/orcades
Helsinki Birth Cohort Study 2001 2,003 Y Y Y N Y N N N Geriatrics http://www.ktl.fi/portal/english/research_people_programs/health_promotion_and_chronic_disease_prevention/units/diabetes_unit/idefix_study/
Lothian Birth Cohort 1921 & 1936 1999 1,641 N Y Y Y Y Y Y N Cognitive Ageing https://www.lothianbirthcohort.ed.ac.uk/content/scottish-mental-survey-1947
Conditions Affecting Neurocognitive Development andLearning in Early Childhood Study (CANDLE) 2006 1,503 Y Y Y Y N N Y Y Neuro-Paediatric https://candlestudy.uthsc.edu/
InCHIANTI 1998 1,453 Y Y Y Y N Y Y N Geriatric http://inchiantistudy.net/wp/
The Study Of Colorectal Cancer in Scotland (SOCCS) 1999 1,298 Y Y Y Y Y N Y N Oncology https://www.ed.ac.uk/usher/molecular-epidemiology/our-studies/the-study-colorectal-cancer
Cardiovascular Health Study (CHARGE) 1989 1,250 N Y Y Y N N N N Cardio https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000287.v7.p1
Growing Up in Singapore Towards healthy Outcomes (GUSTO) 2009 1,176 Y Y Y Y Y Y Y Y Paediatric Metabolism https://academic.oup.com/ije/article/43/5/1401/695117
Northern Sweden Population Health Study (EUROSPAN) 2006 1,069 Y Y Y Y Y Y Y N General http://eurospan.gen-info.hr/partners.html
HELMi (Health and Early Life Microbiota) 2016 1,055 Y Y N N N N N Y Microbiome & Paediatrics https://bmjopen.bmj.com/content/9/6/e028500.long
Prospective Investigation of the Vasculature in Uppsala Seniors (PIVUS) 2001 1,016 Y Y Y N N N Y N Cardio https://bmcmedgenomics.biomedcentral.com/articles/10.1186/s12920-016-0235-0
VIS (part of EUROSPAN) 2003 1,008 Y Y Y Y N N Y N General http://eurospan.gen-info.hr/partners.html
Milieu Intérieur cohort 2012 1,000 N Y Y Y Y Y Y Y Immunology https://www.nature.com/articles/s41590-018-0049-7
GOLDN study 968 Y Y Y Y N N N N Cardio https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2952572/
Brisbane systems genetics study (BSGS) 962 Y Y Y Y N N Y N Complex Disease https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0035430
KORCULA (Part of EUROSPAN) 1999 944 N Y Y Y Y N N N Cardio https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2657564/
Diet, Obesity, and Genes (DIOGenes) 2005 932 N Y Y Y N Y N N Obesity https://www.nature.com/articles/s41467-017-02182-z
Center for the Health Assessment of Mothers and Children of Salinas (CHAMACOS) cohort 1999 800 Y Y Y Y Y N Y Y Farm exposure eg pesticides https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6444381/
Alzheimer's Disease Neuroimaging Initiative (ADNI) 2004 800 Y Y Y Y Y Y N Y Alzheimer's http://adni.loni.usc.edu/
AddNeuroMed 700 Y Y Y Y N Y N N Alzheimer's https://consortiapedia.fastercures.org/consortia/anm/
Emory Twin Study (ETS) 1946 614 Y Y Y N Y N Y N General https://link.springer.com/article/10.1186/s13148-016-0189-2
Cross-sectional analyses conducted in the Cohort on Diabetes and Atherosclerosis Maastricht (CODAM) 1999 574 N Y Y Y Y N N N Cardio https://www.sciencedirect.com/science/article/pii/S0009898103005308#aep-section-id12
Qatar Metabolomics Study on Diabetes (QMDiab) 2012 388 Y Y Y Y Y Y Y N Endocrine https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5886112/
Human Microbiome Project 2008 300 Y Y N Y Y Y N Y General https://hmpdacc.org/ihmp/overview/data-model.php
Human Adult Cerebellum Samples - 153 N Y Y Y N N N N Psychiatry https://www.sciencedirect.com/science/article/pii/S000292971000087X
Human Adult Brain Samples-Cerebellum, Frontal Cortex, Caudal Pons and Temporal Cortex - 150 N Y Y Y N N N N Neurology https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1000952
Whole blood from healthy individuals of Dutch origin - 148 N Y Y Y N N N N General https://bmcgenomics.biomedcentral.com/articles/10.1186/1471-2164-13-636#Sec13
Japanese Study on CSF Proteomic Profile 133 N Y N N N Y N N Neuro https://academic.oup.com/hmg/article/26/1/44/2595397

Y refers to the given data type being found, and N means it was not found.

Recruitment year: The year when participants were recruited, not the year of any retrospective historical event. Sample size: Total database sample size was chosen because sub-population omic data may desirably characterize the overall sample. Longitudinal: Longitudinal study design. Genome: Availability of whole-genome data. Methylome: Deoxyribonucleic acid (DNA) methylation data available as methylation arrays or deep sequencing. Transcriptome: Single-base ribonucleic acid (RNA) reads or mRNA expression data obtained via cRNA microarray chips. Metabolome and Proteome: Appropriate separation and detection methods, such as gas chromatography coupled with mass spectrometry or nuclear magnetic resonance. Broad coverage immuno-assays were also acceptable. Routine clinical blood results do not constitute metabolomics data. Phenome: Traits in individuals not recorded for clinical purposes or clinical techniques, for example, a heart rate monitor to characterize an individual's daily exercise rate. Microbiome: Characterization of participants' microbiomes either with genomic sequencing or growth characterization.