Skip to main content
PLOS One logoLink to PLOS One
. 2025 May 28;20(5):e0324189. doi: 10.1371/journal.pone.0324189

The UK Biobank mental health enhancement 2022: Methods and results

Katrina AS Davis 1,2,*, Jonathan RI Coleman 1,2, Mark Adams 3, Gerome Breen 1,2, Na Cai 4,5,6, Helena L Davies 1,7,8, Kelly Davies 9,10, Alexandru Dregan 1, Thalia C Eley 1,2, Elaine Fox 11, Jo Holliday 9,10, Christopher Hübel 1,12,13, Ann John 14,15, Aliyah S Kassam 1, Matthew J Kempton 1, William Lee 16, Danyang Li 1, Jared Maina 1, Rose McCabe 17, Andrew M McIntosh 3, Sian Oram 1, Marcus Richards 18, Megan Skelton 1, Fenella Starkey 9,10, Abigail R ter Kuile 1,18, Laura M Thornton 19, Rujia Wang 1, Zhaoying Yu 1, Johan Zvrskovec 1,2, Matthew Hotopf 1,2
Editor: Sandar Tin Tin20
PMCID: PMC12118920  PMID: 40435106

Abstract

Background

This paper introduces the UK Biobank (UKB) second mental health questionnaire (MHQ2), describes its design, the respondents and some notable findings. UKB is a large cohort study with over 500,000 volunteer participants aged 40–69 years when recruited in 2006–2010. It is an important resource of extensive health, genetic and biomarker data. Enhancements to UKB enrich the data available. MHQ2 is an enhancement designed to enable and facilitate research with psychosocial and mental health aspects.

Methods

UKB sent participants a link to MHQ2 by email in October-November 2022. The MHQ2 was designed by a multi-institutional consortium to build on MHQ1. It characterises lifetime depression further, adds data on panic disorder and eating disorders, repeats ‘current’ mental health measures and updates information about social circumstances. It includes established measures, such as the PHQ-9 for current depression and CIDI-SF for lifetime panic, as well as bespoke questions. Algorithms and R code were developed to facilitate analysis.

Results

At the time of analysis, MHQ2 results were available for 169,253 UKB participants, of whom 111,275 had also completed the earlier MHQ1. Characteristics of respondents and the whole UKB cohort are compared. The major phenotypes are lifetime: depression (18%); panic disorder (4.0%); a specific eating disorder (2.8%); and bipolar affective disorder I (0.4%). All mental disorders are found less with older age and also seem to be related to selected social factors. In those participants who answered both MHQ1 (2016) and MHQ2 (2022), current mental health measure showed that fewer respondents have harmful alcohol use than in 2016 (relative risk 0.84), but current depression (RR 1.07) and anxiety (RR 0.98) have not fallen, as might have been expected given the relationship with age. We also compare lifetime concepts for test-retest reliability.

Conclusions

There are some drawbacks to UKB due to its lack of population representativeness, but where the research question does not depend on this, it offers exceptional resources that any researcher can apply to access. This paper has just scratched the surface of the results from MHQ2 and how this can be combined with other tranches of UKB data, but we predict it will enable many future discoveries about mental health and health in general.

Introduction

Mental health disorders cause considerable suffering for a substantial portion of the population. There is still a lot about mental health and disorder that we do not understand, and there is an urgent need for research to improve our ability to promote good mental health or treat disordered mental health [1,2]. Early mortality of people with severe mental disorders from physical health conditions is likely part of a reciprocal relationship between poor physical and mental health, which reinforces the importance of considering mental health in wider medical research [35].

The UK Biobank (UKB) cohort study was established with the scale and depth to help researchers grapple with complex questions about conditions that are life-threatening or disabling in adults [6,7]. Such conditions seldom have a single cause; instead, their onsets and prognoses are associated with many risk factors, including genetic variants at multiple loci, exposures at any time from conception onwards, and interactions between them [8]. Many conditions also tend to cluster, causing further problems in investigating specific causal mechanisms [9]. These features of high complexity are true for most mental disorders [1012]. Traditional epidemiological designs of modest sample sizes and limited data capture are not well suited to these challenges [13]. The very large sample size and detail of data in UKB offers great potential for investigating many facets of mental disorders in adults.

UKB recruited over half a million individuals aged 40–69 between 2006 and 2010. The baseline assessment was extensive, with physical measures, biological sample collection (blood and urine in all, saliva in 100,000), a touchscreen questionnaire and an interview that gathered information on early life and current behaviour, as detailed on their website [14]. An impressive list of potential biomarkers has been established, and participants’ approximate addresses have been used to provide measures such as area deprivation, access to green spaces and air pollution [15]. A subset of participants has been characterised even further through ‘enhancements’, including via multimodal imaging (including neuroimaging), activity/sleep monitoring, and cognitive testing. Further data, including information on health outcomes, have been obtained through consented linkage to routinely collected information, including hospital in-patient statistics, cancer registry, COVID-19 vaccination data, death certification, and data from the primary care electronic health records for a subset of the cohort [16].

To realise the potential of UKB for mental health research, and mental aspects of physical health, a mental health outcomes (MHO) consortium was created, which made recommendations for an online questionnaire about current and past mental disorders and associated features. In 2016–17 participants were invited via email and postal newsletter to complete this first web-based questionnaire (MHQ1), leading to responses from over 157,000 individuals by July 2017 [17]. The core of the questionnaire was an assessment of lifetime depression and generalised anxiety status, which allowed genome-wide association studies (GWAS) for these disorders [18,19], but also facilitated non-genetic studies, touching on many other aspects of mental health research, as detailed in a recent review of the use of UKB for mental health research [20].

Recognising the changeability of mental health and the limitations of a single questionnaire to capture complex diagnoses, the MHO consortium proposed a second mental health questionnaire (MHQ2). Alterations were made to reflect the changing priorities of research following the COVID-19 pandemic. Like its predecessor, MHQ2 was not a distinct entity, but a collection of ‘modules’ that contained measures on a particular diagnostic or exposure topic. When choosing measures, consideration was given to comparability and complementarity with other cohort studies in the UK [21] and elsewhere [22], and experiences of using MHQ1 data. Scales on current depression, anxiety and alcohol use were repeated from MHQ1 to allow examination of possible fluctuation of these symptoms. MHQ2 also aimed to:

  • Enrich the lifetime depression phenotype including treatment response

  • Investigate new disorders: panic disorder and eating disorders

  • Update aspects of participants’ social circumstances.

Materials and methods

Aim

This paper outlines the content of the MHQ2 questionnaire, describes the cohort, and gives an overview of participant responses. We aim to report (A) who responded to the MHQ1 and MHQ2, (B) the mental health phenotypes and social factors ascertained in MHQ2, (C) changes in the current mental health of the cohort, and (D) the consistency of lifetime phenotypes.

UK Biobank

UKB invited people aged 40–69 years living close to one of 22 assessment centres in England, Scotland, and Wales between 2006–2010 to take part. 503,309 (5.5%) agreed, and 91% remain in the cohort. Three main sources of data in UKB are: baseline assessment and samples; linkages such as hospital inpatient records; and ‘enhancements’ such as imaging and the MHQs.

Linkage and enhancements have variable coverage and have been conducted at various timepoints, so caution is needed in the analysis and interpretation of such data, as shown in a recent timeline [20]. Participants in UKB differ from the wider UK population by more likely being female, having higher educational attainment, and living in areas with less deprivation, though more urban than average [23]. The ethnicity of participants is predominantly White British, with small numbers of participants from other ethnicities [24]. The health and health behaviours of the UKB participants are better than average, with less smoking, lower incidence of cancer and lower mortality [23,25].

UKB operates inside the ethical framework of the UK Health Research Authority. Participants gave consent for their data to be used and can withdraw at any time, with an option to withdraw their existing data from future analysis. Research Ethics Committee opinion has been sought for UKB and each enhancement from the North West - Haydock Research Ethics Committee, including November 2022 (21/NW/0157, amendment 07).

UKB data can be accessed by submitting a research protocol for approval [14], subject to UKB procedures including that researchers have no ability to identify participants. Over 4,000 research proposals had been approved by January 2024. Most of the data in this paper were accessed through UKB-approved research number 82087, downloaded 23/10/2023, with withdrawals downloaded 11/06/2024. The exception was for the participation numbers, which were provided by UKB’s scientific team with a censoring date of 11/06/2024.

Questionnaire administration

UKB participants who had a valid email address on record were sent an invitation email in October to November 2022, which included a hyperlink to a personalised questionnaire. If there was no response, a reminder email was sent two weeks after the initial invitation. Respondents who started, but did not complete, the questionnaire were sent a reminder two weeks after they last accessed it. Finally, a second invitation was sent to non-responders two months after their previous one (around January 2023). All participants were also able to access the online questionnaire via the UKB participant website, as explained in annual postal newsletters (sent to people with no email address). Respondents were asked to enter their date of birth as an identity check, and the data for any respondents whose entered date of birth did not match the information previously provided are not released. We define someone as having completed the MHQ2 if they completed the mental health modules shown in Table 1 (there was also a COVID-19 module) and their MHQ2 data was available, which excludes any participants who failed the identity check or withdrew their data between answering the questionnaire and our data access. This means that ‘non-completed’ includes a few participants who have answered some or all of the modules.

Table 1. Modules of the UK Biobank MHQ2.

Module Domain Source/tool Selected phenotypes defined
N = new, R = repeated, R’ = repeated with change
Mental Health Screening MHO consortium Any self-report (SR) diagnosis [R’]
Family history MHO consortium, based on other studies Known family history of mental health conditions [N]
Depression Current depression Patient Health Questionnaire 9-item version (PHQ-9) [34] PHQ-9 total score (symptoms) [R]
PHQ-9 derived depression [R]
• Algorithm-based
Current depression case [R]
• Requires case on both PHQ9 (current) and CIDI-SF (lifetime) depression algorithms
Lifetime depression Composite International Diagnostic Interview – Short Form (CIDI-SF) [35] depression module, lifetime version [36] PLUS subtype questions Depression ever [R]
Depression with melancholic features [N]
Depression with atypical features [N]
Antidepressant and therapy response MHO consortium, based on other studies Medication helped [N]
Non-medication therapy helped [N]
Mood change Lifetime manic symptoms UKB baseline questionnaire (based on CIDI questions mapping to DSM-IV) [37] Hypomania/mania ever (symptoms) [R]
Bipolar affective disorder type I [R]
Anxiety and panic Current anxiety disorder Generalised Anxiety Disorder Questionnaire - 7 item (GAD-7) [34] GAD-7 total score (symptoms) [R]
GAD-7 derived anxiety disorder [R’]
• As CIDI-SF for GAD not repeated, not able to require positive for lifetime GAD
Lifetime anxiety disorder (panic) CIDI-SF [35] panic disorder, lifetime version Panic attack ever (symptoms) [N]
Panic disorder ever [N]
Adverse life events Adverse events in childhood Childhood Trauma Screener – 5 item (CTS-5) [38] Childhood adverse events [R]
Adverse events in adulthood MHO consortium, based on APMS and National Crime Survey [39] Adult abuse events [N]
Adverse events 12 months [R’]
Alcohol use Alcohol use disorder Alcohol Use Disorders Identification Test (AUDIT) [40] AUDIT total score [R]
Harmful drinking (12 month) [R]
Cannabis use Cannabis use MHO consortium Cannabis use ever [R]
Daily cannabis use ever [R]
Harm behaviours Self-harm and suicidal thoughts MHO consortium and service-user group Harm to self ever [R]
Suicide attempt ever [R]
Eating patterns Eating disorders MHO consortium, based on other studies Anorexia nervosa [N]
Bulimia nervosa [N]
Binge-eating disorder [N]
Purging disorder [N]
General health General health and functioning EuroQoL 5 dimensions 5 levels (EQ-5D-5L) plus visual-analogue scale (VAS) [41] VAS score [R]
Social situation Social situation UKB baseline questionnaire. Includes cohabitation, social contact, employment status Social isolation [R]
Virtually connected [N]
Loneliness Abbreviated UCLA loneliness scale [42] Short scale UCLA Loneliness total score [N]
General wellbeing Resilience Brief Resilience Scale [43] Brief Resilience total score [N]
Subjective wellbeing MHO consortium QoL total score [R]

Algorithms to define ‘phenotypes’ are available on OSF [33].

Questionnaire development

The questionnaire domains were agreed upon by the consortium after a prioritisation survey and feedback from the Psychiatric Genomics Consortium (PGC) leads [26]. Measures were discussed internally and with investigators who had experience with several other studies, including Genetic Links to Anxiety and Depression (GLAD) [27], The Scottish Health Research Register & Biobank (SHARE) [28], English Longitudinal Study of Ageing (ELSA) [29], the Eating Disorders Genetics Initiative (EDGI) [30], and the Adult Psychiatric Morbidity Survey (APMS, or Survey of Mental Health and Wellbeing) [31].

The measures chosen are provided in Fig 1, which shows how some modules were repeated exactly from MHQ1, and some were changed. For example, MHQ1 asks participants if they had been diagnosed with “panic attacks or panic disorder”, but in MHQ2, people are separately asked whether they have been diagnosed with “panic attacks” and then “panic disorder” to discriminate between them. Such changes mean some sections were not exactly equivalent, so were labelled ‘repeated with changes’ in Fig 1. To maximise the acceptability of the questionnaire length, some of the measures from MHQ1 were not repeated. These choices were made according to the relative support in the consortium for inclusion. The modules are also shown in Table 1, which shows the measures used in each domain and the phenotypes that could be derived from these measures. A more detailed explanation is available on the UKB data showcase [32].

Fig 1. Summary of the topics included in the UK Biobank MHQ1 and MHQ2, illustrating the overlap.

Fig 1

Algorithms and code in R

Symptoms, questionnaire scores, experiences and (probable) diagnostic status that can be derived from the questionnaire are generically termed ‘phenotypes’ (with no assumption of genetic causation). For each module or group of modules, small groups of MHO members with expertise in those subjects produced algorithms to derive these phenotypes from questionnaire responses. These were then checked and edited to create a consistent style and circulated to the whole consortium for comments. The final document is available on our Open Science Framework (OSF) page for this project [33].

A team of coders, including members of the MHO and junior colleagues, worked in small groups to render the algorithms for each module in R code, with a check by another coder outside of that module subgroup. Lastly, an over-arching script was created that brought the code from all the modules’ subgroups together, including a final few definitions that required responses from multiple modules. The algorithms and code for processing MHQ1 and MHQ2 are available on our OSF and GitHub pages [33,44].

A clinical diagnosis is informed by diagnostic criteria, including the lack of alternative explanations and the presence of impact on the patient. However, when we use questionnaires, we are unable to take wider issues into account. The criteria we use for the “lifetime depression” phenotype, for example, are therefore referred to as quasi-diagnostic because they are informed by diagnostic criteria, but fail to fully examine a participant. When we refer to a group with “lifetime depression”, we mean only that they met the criteria for lifetime depression on our measure, not that they were fully assessed.

Analysis

We planned a descriptive summary of the questionnaire respondents and responses. We refer to the proportion of cases in the respondents. Due to the issues with representativeness, we do not attempt to make inferences about population prevalence, so hence do not include confidence intervals. Significance testing on large cohorts may identify many small differences between comparators that have very little effect [45], therefore we have emphasised the magnitude of difference rather than calculating p-values. Percentages are given in whole percentage points unless under 10% when one decimal place is used, as recommended for clarity [46]. Where rare disorders and detailed breakdown of age or ethnicity intersected to make a cell size of less than 10 participants we have merged fields out of caution for confidentiality [47].

Our analysis according to our aims were:

  • A: Describing Respondents

  • We describe participants in the 2022 MHQ2 wave, including respondents of MHQ2 regardless of whether they participated in MHQ1. The 2016 MHQ1 wave and the full original UKB cohort are presented for comparison. The age variable was handled differently for the three groups. Age for the MHQ2 wave was their age on completion of the MHQ2, and for the MHQ1 wave age at completion of MHQ1. For the UKB cohort as a whole, age represents the age they would have been at the median date of MHQ2 completion, including those who died before this date, excluding those with missing baseline data (n = 2).

  • B: Mental Health Phenotypes and Social Factors

  • We present numbers with lifetime mental health disorders in MHQ2 and describe the personal characteristics in the groups responding to different mental health disorders. Selected social data - social isolation, loneliness, resilience and health-related quality of life – is reported, stratified by lifetime mental disorder phenotypes.

  • C: Changes in Current Mental Health

  • We describe and visualise the proportion of respondents who were positive for specified current mental disorders in 2022 and in 2016 in those participants who answered in both waves. Participants are defined as having current mental disorder if they meet case criteria according to PHQ-9 (depression), GAD-7 (anxiety) and AUDIT (harmful alcohol use). These are analysed in age-group and sex-stratified blacks, and plotted as line graphs for each disorder, showing the proportion positive by age-group for each sex and each wave. Age groups in seven-year blocks, which reflects the amount of time between the two waves, such that most participants move one age block between MHQ1 and MHQ2, although not exact. Relative risk for 2022 compared to 2016 was calculated by using ‘proportion positive’ as the risk of disorder in each cohort and then risk in 2022 by that in 2016. This is done in age-and-sex stratified groups such that those aged, for example, 73–79 years in 2016 are compared with those aged 73–79 in 2022 for the four age groups where possible and also for the cohort overall.

  • D: Consistency of lifetime phenotypes

  • We compare ‘lifetime status’ in 2022 compared to 2016 in those who completed both waves, to give test-retest stability. Status for each participant was meeting case criteria for depression ever (CIDI-SF lifetime), bipolar ever (adapted CIDI), self-harm ever (self-report), cannabis ever (self-report) and any clinician diagnosis (self-report). Percentage agreement for each status was calculated as status agreeing in 2016 and 2022 (both positive or both negative) as a proportion of the cohort, and Cohen’s kappa for each status, with kappa being agreement corrected for agreement by chance [48]. Where a respondent was negative in 2016 but positive in 2022, we also took into account the date that the respondent gave for onset in MHQ2 (except for clinician diagnosis, where this was not asked). This helps separate these respondents into those whose status could have been picked up in the MHQ1 wave (date of onset from MHQ2 was before 2016), from those who might not have done (either date of onset after start of 2016 or answered ‘don’t know’ to date of onset). We decided to include unknown with the later onsets, despite no information to suggest these are more likely to be late onset, to explain the greatest amount of test-retest variability.

Results

Sample

Data provided by UKB’s scientific team showed that, after accounting for deaths and withdrawals, 457,653 participants were eligible to complete the MHQ2 (Fig 2). Of these, UKB was able to invite 329,902 (72%) participants by email, and 53% of those contacted answered at least one module of the questionnaire, alongside a small number (n = 530) who accessed the questionnaire through their participant website. 175,266 participants completed at least one module with 169,252 completing all the mental health modules in the questionnaire (97% of those who started). The response dates cluster around early November 2022 (approx. 80%), when most emails went out, with a smaller peak in approximately January 2023 when reminder emails were sent.

Fig 2. Number of UKB participants invited, starting and completing the MHQ2 questionnaire, as a proportion of those eligible for the questionnaire. UKB scientific team numbers (n = 165,262). Participants could participate in questionnaire without being invited. Started qnn = all respondents who completed at least one module on MHQ2; Completed qnn = respondents who completed all mental health modules in MHQ2.

Fig 2

Fig 3 shows that the 169,252 UKB participants who completed MHQ2 consisted of 111,272 who had also completed MHQ1 plus 57,980 who had not. This gives a sample of 111,272 who have longitudinal data (both waves), and a sample of 215,250 who have data for either MHQ.

Fig 3. The flow of participants illustrating those completing the first and second mental health questionnaires, the cross-over and participants who have died or withdrawn.

Fig 3

UKB scientific team numbers. Completed MHQ2 = completed all 12 mental health modules; Not completed MHQ2 = available but didn’t start MHQ2, started but didn’t finish, or failed their identity check; Not available = died or withdrew from the cohort prior to MHQ1/ MHQ2, or withdrew with no further access between MHQ2 and data access.

A: Describing respondents

Data available to researchers was of 169,253 respondents. These numbers vary slightly to the data from the UKB scientific team (169,252), and may vary due to late completion, quarantine of questionnaires when the identity check was failed (and resolution thereof), and the small delay in communicating participants withdrawing from UKB. Future researchers may find different numbers, also due to these factors.

The characteristics of the respondents in the MHQ2 wave are shown in Table 2. Compared to those for whom we have data from MHQ1, they are an average of five years older (after an interval between MHQ1 and 2 of approximately six years), and very similar in sex, ethnicity, and the distribution of selected social factors. The comparison of the MHQ2 wave with the UKB cohort suggested possible differences from the base cohort. These include respondents being 4.6 percentage points less likely to live in a deprived area, 5.6 percentage points less likely to have a longstanding illness and 2.8 percentage points more likely to be White, rather than a minority ethnic group, and shows that MHQ1/2 may under-represent people from Black and Asian ethnicities.

Table 2. Characteristics of the respondents to each of the mental health questionnaire waves with comparison to the UK Biobank cohort. Research team numbers n = 165,253.

Absolute Difference
Characteristic MHQ2 wave (2022) MHQ1 wave (2016) UKB full cohort MHQ2 vs
MHQ1
MHQ2 vs
UKB
169,253 % 157,274 % 502,177 %
Demographics from baseline
Age 1
 45–54 1,577 1% 23,449 15% 5,254 1% −14% −0.1%
 55–64 44,319 26% 51,830 33% 122,298 24% −6.8% +1.8%
 65–74 70,462 42% 70,122 45% 174,121 35% −3.0% +7.0%
 75–84 52,572 31% 11,873 8% 198,269 39% +24% −8.4%
 >84 323 0% 0 0% 2,235 0% +0.2% −0.3%
Median 70 65 72 +5y −2y
Sex
 Female 97,646 58% 89,048 57% 273,184 54% +1.1% +3.3%
 Male 71,607 42% 68,226 43% 228,993 46% −1.1% −3.3%
Ethnicity
 White 163,795 96.8% 152,063 96.7% 471,837 94.0% +0.1% +2.8%
 Black 1,123 0.7% 1,146 0.7% 8,022 1.6% −0.1% −0.9%
 Asian 1,448 0.9% 1,336 0.8% 9,829 2.0% 0.0% −1.1%
 Chinese 375 0.2% 364 0.2% 1,573 0.3% 0.0% −0.1%
 Mixed 863 0.5% 819 0.5% 2,902 0.6% 0.0% −0.1%
 Other 902 0.5% 876 0.6% 4,553 0.9% 0.0% −0.4%
 Missing 747 0.4% 670 0.4% 3,461 0.7% 0.0% −0.2%
Social factors from baseline
Relative deprivation2 20,015 12% 19,281 12% 82,263 16% −0.4% −4.6%
Degree educated 75,311 44% 70,940 45% 161,019 32% −0.6% +12%
Rents home3 7,909 5% 7,939 5% 46,403 9% −0.4% -4.6%
Self-reports illness > 1y 44,296 26% 43,420 28% 159,800 32% −1.4% −5.6%
Smoker 11,861 7% 11,330 7% 52,940 11% −0.2% −3.5%
Physically active4 62,371 37% 57,788 37% 169,109 34% +0.1% +3.2%
Neuroticism score, mean (sd)5 3.87 (3.16) 3.87 (3.17) 4.12 (3.27) 0.0pt −0.25pt

(1) Age in the MHQ2 wave is at date of completion of MHQ2. Age in the MHQ1 wave is at approximate date of completion of MHQ1. Age for full cohort is the age a participant would have been on the median date of MHQ2 completion.

(2) Townsend Deprivation Score of residence ≥ +2 (where 0 is average, and higher scores are more deprived, [23]).

(3) Private or social rented accommodation at baseline.

(4) Reported at least moderate activity for 20 minutes three times per week.

(5) Eysenck neuroticism scale.

B(1): Mental health phenotypes

In Table 3 and the UPSET plot S1 Fig in S1 File, participants are categorised by lifetime disorder status based on symptom-based definitions for depression, panic disorder, eating disorders and bipolar type I. Participants were most likely to be positive for lifetime depression (18%), followed by panic disorder (4.0%), any specific eating disorder (2.8%), and bipolar affective disorder type I (0.4%). Seventy-one percent of the MHQ2 respondents did not meet the criteria for any of the studied lifetime disorders.

Table 3. Respondents categorised by symptom-based criteria, which are not exclusive, with characteristics based on baseline questionnaire (BL) and second mental health questionnaire (MHQ2/Q2). Research team numbers n = 165,253.

Characteristic Overall No lifetime criteria Depression Panic Disorder Any eating disorder Bipolar type I
N % item N. % item % no dx N. % item % dep’n N. % item % panic dx N. % item % eating dx N. % item % bp I
N 169,253 100% 132,975 NA 79% 31,243 NA 18% 6,703 NA 4.0% 4,762 NA 2.8% 721 NA 0.4%
Age 1
45-54 1,577 0.9% 1,041 0.8% 66% 444 1.4% 28% 133 2.0% 8.4% 99 2.1% 6.3% 15 2.1% 1.0%
55-64 44,319 26% 30,781 23% 69% 11,488 37% 26% 2,880 43% 6.5% 2,238 47% 5.0% 337 47% 0.8%
65-74 70,462 42% 55,443 42% 79% 12,962 41% 18% 2,642 39% 3.7% 1,899 40% 2.7% 264 37% 0.4%
75+ 52,895 31% 45,710 34% 86% 6,349 20% 12% 1,048 16% 2.0% 526 11% 1.0% 105 0 0.2%
Median 70 71 67 66 65 65
Sex
Female 97,646 58% 71,372 54% 73% 22,169 71% 23% 4,913 73% 5.0% 4,447 93% 4.6% 473 66% 0.5%
Male 71,607 42% 61,603 46% 86% 9,074 29% 13% 1,790 27% 2.5% 315 7% 0.4% 248 34% 0.3%
Ethnicity
White 163,795 97% 128,605 97% 79% 30,306 97% 19% 6,493 97% 4.0% 4,598 97% 2.8% 682 95% 0.4%
Black 1,123 0.7% 921 0.7% 82% 176 0.6% 16% 43 0.6% 3.8% NA NA NA NA NA NA
Asian 1,448 0.9% 1,232 0.9% 85% 184 0.6% 13% 38 0.6% 2.6% NA NA NA NA NA NA
Chinese 375 0.2% 319 0.2% 85% 49 0.2% 13% 11 0.2% 2.9% NA NA NA NA NA NA
Mixed 863 0.5% 603 0.5% 70% 223 0.7% 26% 56 0.8% 6.5% NA NA NA NA NA NA
Other 902 0.5% 713 0.5% 79% 164 0.5% 18% 36 0.5% 4.0% NA NA NA NA NA NA
Missing 747 0.4% 582 0.4% 78% 141 0.5% 19% 26 0.4% 3.5% NA NA NA NA NA NA
[Combined]2 NA NA NA NA NA NA NA NA NA NA NA 164 3.4% 3.0% 39 5.4% 0.7%
Degree educated (BL) 75,311 44% 58,565 44% 78% 14,280 46% 19% 2,818 42% 3.7% 2,522 53% 3.3% 340 47% 0.5%
Resides in deprived area (BL)3 20,015 12% 14,659 11% 73% 4,629 15% 23% 1,156 17% 5.8% 848 18% 4.2% 165 23% 0.8%
Rents home (BL) 7,909 4.7% 5,249 3.9% 66% 2,365 7.6% 30% 641 10% 8.1% 394 8.30% 5.0% 110 15% 1.4%
Self-report illness for>1y (BL) 44,296 26% 32,101 24% 72% 10,847 35% 24% 2,513 37% 5.7% 1,392 29% 3.1% 394 55% 0.9%
Smoker (BL) 11,861 7.0% 8,396 6.3% 71% 3,024 10% 25% 751 11% 6.3% 464 10% 3.9% 116 16% 1.0%
Physically active (BL)4 62,371 37% 48,899 37% 78% 11,741 38% 19% 2,513 37% 4.0% 1,538 32% 2.5% 263 36% 0.4%
Childhood adverse experience (Q2)5 71,049 42% 50,170 38% 71% 18,067 58% 25% 4,280 64% 6.0% 3,019 63% 4.2% 536 74% 0.8%
Adult adverse experience (ever) (Q2) 38,027 22% 23,584 18% 62% 12,465 40% 33% 3,124 47% 8.2% 2,425 51% 6.4% 399 55% 1.0%
Low resilience (Q2)6 26,367 16% 13,103 10% 50% 11,944 38% 45% 3,127 47% 12% 1,640 34% 6.2% 406 56% 1.5%
Social isolation (Q2)7 17,365 10% 12,099 9.1% 70% 4,675 15% 27% 1,077 16% 6.2% 706 15% 4.1% 119 17% 0.7%
Loneliness score, median (IQR) (Q2)8 3 3–5 3 3–4 4 3–6 5 3–6 5 3–6 5 3–7
Self-rated health median (IQR) (Q2)9 83 72–90 85 75–91 79 65–89 75 60–85 80 65–90 70 49–83
Neuroticism score, mean (sd) (BL)10 4 -3.16 3 -2.93 6 -3.34 7 -3.36 6 -3.35 7.03 -3.42

BL = based on answers to baseline questions; Q2 = based on answers to MHQ2 questions; dx = disorder.

(1) Age at MHQ2 is at date of completion of MHQ2.

(2) To preserve participant privacy, non-White and missing ethnicities are collapsed to a single value (“Combined”) if one cell size is smaller than 10.

(3) Townsend Deprivation Score of residence ≥ +2 (where 0 is average, and higher scores are more deprived, [23]).

(4) Reported at least moderate activity for 20 minutes three times per week.

(5) Any domain positive on the Childhood Trauma Screener.

(6) Brief Resilience Scale, low = 1 to 2.99, where scale is 1–5.

(7) Scores on at least two of three isolation questions.

(8) Loneliness score from 3 to 9 points, with 3 being least lonely.

(9) EQ-5D visual analogue scale, from 0 to 100, higher indicating better health.

(10) Eysenck neuroticism scale.

The symptom-based definition of panic disorder was included for the first time in the MHQ2 wave. Self-reported clinician diagnosis of panic disorder appears to be much lower (0.2%) than the symptom-based definition (4.0%), although 4.5% reported a clinician diagnosis of a panic attack. For context, in the MHQ1, the symptom-based definition of lifetime generalised anxiety disorder (GAD) was met by 7%.

Within the eating disorders category were four specific disorders, and respondents could fall into more than one in their lifetime. Anorexia nervosa criteria were met by 1.7% of respondents, purging disorder by 0.9%, bulimia nervosa by 0.4%, and binge-eating disorder by 0.2%. This can be compared to self-report of an eating disorder diagnosis, for example, self-report of anorexia nervosa was 0.5%.

The proportion with any lifetime disorder (symptom-based definition) may be patterned by age, ethnicity and sex. Lifetime disorder appears to be higher in younger age (45–64: 34%), higher in those of Mixed ethnicity at 30%, than White (21%) or Asian (15%); higher in female (27%) than male (14%). We particularly saw apparent higher rates in females was particularly marked for eating disorders (F: 4.6%, M: 0.4%).

S1 Fig in S1 File shows the degree of comorbidity between the phenotypes in Table 3 with the addition of self-harm ever. The lifetime self-harm phenotype includes 7765 (4.6%) participants, most (57%) also meeting the criteria for another phenotype, most commonly depression. Of those with lifetime depression, 29% also have one or more other phenotype, most commonly self-harm.

B(2): Social factors

The distributions of social factors are shown in Table 3, with the results in female participants in S1 Table in S1 File and male participants in S2 Table in S1 File. The variables assessed at baseline were area-level deprivation, education (‘degree educated’), housing tenure (‘rents home’), longstanding illness, smoking, physical activity, and neuroticism – of which area-level deprivation, renting a home and smoking seem likely to be related to lifetime disorder. The group of respondents that met the criteria for eating disorders and panic disorder appear to have similar characteristics to those respondents with depression and bipolar, although it was remarkable that over half of those with lifetime eating disorders had a degree qualification (53%).

Adverse events in childhood were reported by 42% and adverse events in their adult life by 22%. In the group that met no symptom-based definition in the MHQ2, 38% reported a childhood event, increasing to 58% in those with lifetime depression, and rising further for rarer disorders – with the same pattern for events in adult life. The pattern of adverse childhood event report stratified by major phenotypes is the same in both men and women, S1 and S2 Table in S1 File.

Resilience was ‘low’ on the brief resilience scale in 10% of the group that met no symptom-based definition in the MHQ2 but 34% to 56% in those who met at least one symptom-based definition for a lifetime disorder. There also appeared to be a pattern for higher social isolation and loneliness in those who met at least one symptom-based definitions, but this was less marked than the pattern for low resilience.

Self-rated health was taken from the EQ-5D-5L visual analogue scale, where respondents rate their health ‘today’ using a slider from 0 to 100 [41]. This rating was an average of 88 for people with no lifetime criteria, 80 for those with depression and eating disorder history, 75 for those with panic disorder and 70 for those with bipolar disorder. More than a quarter of those with lifetime bipolar disorder had marked their health as below 50 on the scale.

C: Changes in current mental health

Three measures of current mental health were included in both questionnaires: AUDIT for harmful alcohol use “in the last year”, PHQ-9 for depression “in the last two weeks”, GAD-7 for generalised anxiety “in the last two weeks”. These were identical in MHQ1 and MHQ2. Restricting to people with data from both the MHQ1 and MHQ2 waves (n = 111,275), S3 Table in S1 File shows the proportions meeting the criteria, stratified by age and sex at time of answering. This is illustrated in Fig 46 where the lines show the proportion of the cohort positive for each disorder across the age range. The figures show that, within each wave, the slope of proportion with depression, generalised anxiety and harmful alcohol use by age is downwards, suggesting current mental disorder is less common with increasing age. Between questionnaires, harmful alcohol use decreased along lines predicted for the age at the later wave (Fig 4). For depression (Fig 5) and generalised anxiety (Fig 6), the curves for the older MHQ2 wave do not follow the age curve to the same extent as for harmful alcohol use, but shift towards greater depression and anxiety in 2022 than the same age respondents in 2016.

Fig 4. Proportion in each age group (at completion of relevant questionnaire) with harmful alcohol use according to AUDIT by sex and wave, restricted to those who completed both MHQ1 ≈ 2016 and MHQ2 ≈ 2022 (n = 111,275).

Fig 4

Fig 6. Proportion in each age group (at completion of relevant questionnaire) with generalised anxiety disorder according to ‘GAD-7-derived current anxiety’ by sex and wave, restricted to those who completed both MHQ1 ≈ 2016 and MHQ2 ≈ 2022 (n = 111,275).

Fig 6

Fig 5. Proportion in each age group (at completion of relevant questionnaire) with ‘PHQ-9-derived current depression’ outcome by sex and wave, restricted to those who completed both MHQ1 ≈ 2016 and MHQ2 ≈ 2022 (n = 111,275).

Fig 5

S4 Table in S1 File quantifies this a little more by reporting the relative risk of each current mental disorder in 2022 versus 2016 in each sex and age-group stratification and overall. A relative risk of 1 would suggest no change of risk in the group for the current disorder in 2022 compared to 2016 wave, with values above 1 suggesting a higher risk in 2022. The relative risk for every sex and age-group stratified group was above 1, between 1.07 and 2.00, with the latter representing a doubling of the presence of current depression in women aged 73–79 in 2022 compared to 2016. Despite age-group stratified relative risks being above 1, the relative risk for the whole cohort are close to 1 for depression (1.07) and anxiety (0.98), suggesting little change between 2016 and 2022, or below 1 for alcohol use disorder (0.84), due to the gradient of fewer cases in older age groups.

D: Consistency of lifetime phenotypes

Table 4 shows the difference in results from measures that report lifetime phenotypes in those who completed both MHQ1 and MHQ2: two symptom-based definitions of lifetime disorder (depression and bipolar), two lifetime behaviours (self-harm and cannabis use), and self-report of any clinician diagnosis. The criteria for positive status were the same in both questionnaires. Across these phenotypes, respondents seem more likely to meet the criteria in MHQ1 than in MHQ2, except for self-harm. Kappa values for the two symptom-based definitions are depression 0.53 and bipolar 0.30, with values being higher in the other categories (any clinician diagnosis 0.66, self-harm 0.67, cannabis 0.81). A potential cause of lack of agreement between the questionnaire waves are cases that started after completing MHQ1. Looking at depression, of those 5,947 who were positive in MHQ2 but not MHQ1, at least 4,445 (75%) had an onset before the MHQ1 wave, so could have been detected then. For self-harm 77%, bipolar 90% and cannabis 93% of those who were negative in MHQ1 could have been detected according to self-reported onset, therefore new-onset can explain only a small amount of discordant cases. S5 and S6 Tables in S1 File show the sex-stratified results, and they are similar; for example, self-harm reporting in women had a kappa of 0.68, and in men 0.65.

Table 4. Comparison of lifetime phenotype status in MHQ1 and MHQ2 in those participants who completed both questionnaires (n = 111,275). Percentage is proportion of those participants in each agreement category, except last column. Last column takes into account the date of onset in lack of agreement, percentage is proportion with onset prior to 2016 (approximate date of MHQ1).

Status MHQ1 Positive Positive Negative Negative Agree (%) Kappa Negative
Status MHQ2 Positive Negative Positive Negative Positive AND onset <2016*
Case criteria met Both 1 not 2 2 not 1 Neither % of 2 not 1
Depression 14,612/111,275 (13.1%) 11,670/111,275 (10.5%) 5,947/111,275 (5.3%) 79,046/111,275 (71.0%) 84% 0.53 4,445/5,947 (74.7%)
Bipolar 157/111,275 (0.1%) 448/111,275 (0.4%) 279/111,275 (0.3%) 110,391/111,275 (99.2%) 99% 0.30 250/279 (89.6%)
Self-harm 3,498/111,275 (3.1%) 1,411/111,275 (1.3%) 1,788/111,275 (1.6%) 104,578/111,275 (94.0%) 97% 0.67 1,378/1,788 (77.1%)
Cannabis use 21,630/111,275 (19.4%) 4,015/111,275 (3.6%) 3,393/111,275 (3.0%) 82,237/111,275 (73.9%) 93% 0.81 3,139/3,393 (92.5%)
Any SR clinician diagnosis 25,701/111,275 (23.1%) 11,678/111,275 (10.5%) 4,286/111,275 (3.9%) 69,610/111,275 (62.6%) 86% 0.66 NA

* date of onset (of depression, bipolar, self-harm or cannabis use) was reported in MHQ2 to be before MHQ1 wave, rather than onset that was definitely after MHQ1 wave or was unknown.

Discussion

This paper introduced UK Biobank’s second mental health questionnaire. The original MHQ was one of the largest mental health surveys ever reported. This second wave gives an even larger sample size, more detail, and coverage of broader aspects of mental health. Responses from 169,253 UK Biobank participants are available for the MHQ2, which brings to 215,252 the number who have at least one wave of the mental health questionnaire that can be analysed. We have selected a few notable features to describe, which we hope will stimulate interest in this resource. These were: (A) who responded to the MHQ2; (B) the mental health phenotypes and social factors; (C) changes in the mental health of the cohort; and (D) the consistency of lifetime phenotypes.

Respondents: There was a substantial overlap of participants who completed the previous wave, and participant characteristics of the two waves were similar. As shown in the previous paper [17] the biases in completion of the mental health questionnaires resembled and amplified the biases in recruitment to UKB. The respondents to MHQ2 appeared less likely to be deprived or have a chronic illness than the overall UKB cohort, and 97% were from a White ethnic group.

Mental health and social factors: Depression was the most common lifetime disorder at 18% in the MHQ2 wave, although it was 24% in the MHQ1 wave. Next in this questionnaire was panic disorder (4.0%), eating disorder (2.8%) and bipolar type I (0.4%). The pattern of social and health characteristics appear to be different for people who met criteria for any of those four disorders, including suggestions of increased adverse child events, more difficult and deprived personal circumstances, and poorer health. This pattern seems more pronounced with rarer disorders. If those people in UK Biobank with a history of mental health disorder or disorders had worse self-rated health, this can be due to health being poorer both because of the mental disorders themselves or because they are at greater risk of developing physical illness or, conversely, due to them having a physical disorder that places them at higher risk of developing a mental disorder [9]. UK Biobank is a valuable data source for studying these issues, and repeated measures to give longitudinal data will help.

Repeated measures

Current mental health: In the MHQ1 wave, both lifetime and current mental health disorders appeared to be rarer in old age. This could be an age effect (mental health gets better as people age) or a cohort effect (people born earlier have better mental health). Using repeated current mental disorder scales, the frequency of alcohol use disorder appeared to decrease between MHQ1 and MHQ2 (six years), suggesting an age effect. For depression and anxiety, the frequency appeared to be nearly unchanged as the cohort aged, suggesting a cohort effect. Another possibility is that 2022 was an era of higher anxiety and depression due to the COVID pandemic and knock-on effects, for example, prolonged health service disruption, which may make it difficult for any age-related improvement in mental health to be demonstrated at this time. The short-term deviation of mental distress from the COVID pandemic has been well-documented (for example,[49]), but the long-term mental health outcomes of the period are still unknown [50], and may merit monitoring. There are already some measures that can be examined longitudinally, for instance, selected questions from the PHQ-9 were asked at baseline and in several enhancements (sleep, cognitive testing, imaging visits, MHQ1 and MHQ2) and the EQ-5D-5L (health-related quality of life) reported here from MHQ2 was also asked in the UKB pain questionnaire in 2019, and in the pain 2 questionnaire in 2024.

Lifetime phenotypes: We investigated the test-retest consistency for phenotypes that we would expect to be similar between waves: lifetime depression, bipolar affective disorder type I, self-harm, cannabis use and self-report of any clinician diagnosis. In all cases, there was high agreement on phenotype status (from 84% for depression to 99% for bipolar), although kappa (which corrects for agreement by chance) revealed that lifetime depression and lifetime bipolar statuses were somewhat inconsistent (0.53 for depression, 0.30 for bipolar), and this was not accounted for by new-onset. To meet criteria for depression and bipolar, several criteria needed to be met, frequently hinging on a dichotomous scoring of a Likert-like scale. For instance, for the question “In your worst ever episode… How much did these problems interfere with your life or activities? – A lot/ Somewhat/ A little/ Not at all”, an answer of “A lot” or “Somewhat” would support a depression status, but “A little” would rule it out. We can speculate that a person’s appraisal could change from “Somewhat” to “A little” over time, perhaps to a more positive interpretation as they age, or that the worst episode they were thinking about in 2016 was different in this respect from the one they were thinking about in 2022. Therefore, a change in status according from criteria may occur with only modest changes in reporting. To meet the criteria for bipolar, participants had to meet criteria for both depression and mania. The definition for ever self-harm and cannabis consisted of a few yes/no questions. For yes/no questions, and when talking about behaviours rather than feelings, there would be less reappraisal. It is not clear why the recall of self-report clinician diagnosis might be inconsistent, perhaps it hinges on recall of whether a disorder was diagnosed, rather than merely discussed or mentioned. These inconsistencies may complicate the interpretation of some of these outcomes, but it is possible they reflect the underlying uncertain boundaries between health and disorder [51,52].

Strengths and limitations

UK Biobank is a valuable resource for investigating mental health, but it also presents challenges. The cohort consists of a convenience sample, and the non-representativeness observed at baseline assessments is exacerbated in the voluntary enhancements (such as the MHQs). The multiple enhancements with different samples and timings can complicate planning and interpreting analyses. We can speculate that people with poorer health, including mental health, would be less likely to start a UK Biobank questionnaire, or may have not been able to complete (6,000 people or 3% of those that started did not complete). Because of non-representativeness, researchers should not make population inferences based solely on UK Biobank data. We have not tested what effect the non-representativeness of the sample has had on our ascertainment of mental disorders and the pattern with social factors.

We have designed the UKB MHQ1 and MHQ2 with researchers familiar with large cohorts and clinical academics – leading to the use of widely accepted measures and clinically relevant outcomes. The completion of the questionnaire was also high once started, assisted by reminder emails. However, the MHQs are reliant on self-report, frequently of an episode of psychopathology that may have been many years ago. This may result in some inconsistencies, such as seen in the test-retest statistics – which we suggest is due to inconsistent recall and indistinct boundaries in mental disorders, which have no diagnostic tests.

Our approach to developing this paper and facilitating further research using MHQ2 has incorporated quality assurance processes when developing algorithms and code, which we are sharing with the wider research community. We enhance transparency by sharing resources, however, errors are still inevitable in such projects [53] so we encourage caution when using these resources, and feedback from others if they suspect any problems. This paper is just the beginning, which we hope will encourage other researchers to investigate more deeply. Further advice for people considering using UKB for mental health research is available in a recent review paper [20].

Conclusion

The mental health questionnaires enhance UKB with large amounts of data on common mental disorders (particularly depression), and also on disorders that have not received much research in an older population, such as panic disorder and eating disorders, and update social factors a decade after baseline. Importantly, the UKB now has included transdiagnostic symptoms (self-harm, body weight/shape preoccupation in MHQ2, psychotic experiences in MHQ1) and psychological constructs (resilience, loneliness and neuroticism) that are infrequently or never available from clinical data. Repeated measures for items such as depressive symptoms and health-related quality of life, along with linkage to routine health data for follow-up, allow longitudinal and causal analyses. UKB offers a large sample size, extensive genetic information, biomarkers and imaging data, alongside detailed mental health data, which we hope will be used to help us understand more about mental health, and how we might prevent and treat poor mental health in the future.

Supporting information

S1 File. S1 Fig, S1-6 Tables.

(PDF)

pone.0324189.s001.pdf (449KB, pdf)

Abbreviations

AUDIT

Alcohol Use Disorder Identification Tool

CIDI-SF

Composite International Diagnostic Interview Short Form

DSM-IV

Diagnostic and Statistical Manual of Mental Disorders 4th ed.

EQ-5D-5L

EuroQol questionnaire of 5 dimensions and 5 levels, plus VAS

GAD-7

Generalised Anxiety Disorder scale (for current GAD)

MHO

Mental Health Outcomes [consortium]

MHQ

Mental health questionnaire

OSF

Open Science Framework

PHQ-9

Patient Health Questionnaire (for current depression)

R

Statistical software

UKB

UK Biobank

VAS

Visual Analogue Scale (used in EQ-5D-5L)

Data Availability

Data cannot be shared publicly for participant confidentiality, but can be accessed subject to conditions by application to UK Biobank https://www.ukbiobank.ac.uk/enable-your-research. Supporting information for reproducing variables is available https://osf.io/5a8yu/

Funding Statement

No specific funding was received for this work. KASD was supported by the National Institute of Health and Care Research (NIHR) Maudsley Biomedical Research Centre at South London and Maudsley National Health Service (NHS) Foundation Trust and King’s College London (NIHR203318, https://www.maudsleybrc.nihr.ac.uk/about-us.html). The views expressed are those of the authors and not necessarily those of the NIHR or the Department of Health and Social Care. No funder had any role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.GBD 2019 Mental Disorders Collaborators. Global, regional, and national burden of 12 mental disorders in 204 countries and territories, 1990-2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Psychiatry. 2022;9(2):137–50. doi: 10.1016/S2215-0366(21)00395-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Arango C, Díaz-Caneja CM, McGorry PD, Rapoport J, Sommer IE, Vorstman JA, et al. Preventive strategies for mental health. Lancet Psychiatry. 2018;5(7):591–604. doi: 10.1016/S2215-0366(18)30057-9 [DOI] [PubMed] [Google Scholar]
  • 3.Prince M, Patel V, Saxena S, Maj M, Maselko J, Phillips MR, et al. No health without mental health. Lancet. 2007;370(9590):859–77. doi: 10.1016/S0140-6736(07)61238-0 [DOI] [PubMed] [Google Scholar]
  • 4.Firth J, Siddiqi N, Koyanagi A, Siskind D, Rosenbaum S, Galletly C, et al. The Lancet Psychiatry Commission: a blueprint for protecting physical health in people with mental illness. Lancet Psychiatry. 2019;6(8):675–712. doi: 10.1016/S2215-0366(19)30132-4 [DOI] [PubMed] [Google Scholar]
  • 5.Ohrnberger J, Fichera E, Sutton M. The dynamics of physical and mental health in the older population. J Econ Ageing. 2017;9:52–62. doi: 10.1016/j.jeoa.2016.07.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12(3):e1001779. doi: 10.1371/journal.pmed.1001779 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Allen NE, Lacey B, Lawlor DA, Pell JP, Gallacher J, Smeeth L, et al. Prospective study design and data analysis in UK Biobank. Sci Transl Med. 2024;16(729):eadf4428. doi: 10.1126/scitranslmed.adf4428 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Wu H, Eckhardt CM, Baccarelli AA. Molecular mechanisms of environmental exposures and human disease. Nat Rev Genet. 2023;24(5):332–44. doi: 10.1038/s41576-022-00569-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Ronaldson A, Arias de la Torre J, Prina M, Armstrong D, Das-Munshi J, Hatch S, et al. Associations between physical multimorbidity patterns and common mental health disorders in middle-aged adults: A prospective analysis using data from the UK Biobank. Lancet Reg Health Eur. 2021;8:100149. doi: 10.1016/j.lanepe.2021.100149 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Martins J, Yusupov N, Binder EB, Brückl TM, Czamara D. Early adversity as the prototype gene × environment interaction in mental disorders? Pharmacology Biochemistry and Behavior. 2022;215:173371. doi: 10.1016/j.pbb.2022.173371 [DOI] [PubMed] [Google Scholar]
  • 11.Brainstorm Consortium, Anttila V, Bulik-Sullivan B, Finucane HK, Walters RK, Bras J, et al. Analysis of shared heritability in common disorders of the brain. Science. 2018;360(6395):eaap8757. doi: 10.1126/science.aap8757 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Uher R, Zwicker A. Etiology in psychiatry: embracing the reality of poly-gene-environmental causation of mental illness. World Psychiatry. 2017;16(2):121–9. doi: 10.1002/wps.20436 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Burton PR, Hansell AL, Fortier I, Manolio TA, Khoury MJ, Little J, et al. Size matters: just how big is BIG?: Quantifying realistic sample size requirements for human genome epidemiology. Int J Epidemiol. 2008;38(1):263–73. doi: 10.1093/ije/dyn147 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.UK Biobank. UK Biobank Website. 2024. [cited Aug 2024]. https://www.ukbiobank.ac.uk [Google Scholar]
  • 15.Mamouei M, Zhu Y, Nazarzadeh M, Hassaine A, Salimi-Khorshidi G, Cai Y, et al. Investigating the association of environmental exposures and all-cause mortality in the UK Biobank using sparse principal component analysis. Sci Rep. 2022;12(1):9239. doi: 10.1038/s41598-022-13362-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.UK Biobank. Health Outcomes Overview v.2. Contract No.: Resource 596. https://biobank.ndph.ox.ac.uk/showcase/refer.cgi?id=5962024 [Google Scholar]
  • 17.Davis KAS, Coleman JRI, Adams M, Allen N, Breen G, Cullen B, et al. Mental health in UK Biobank - development, implementation and results from an online questionnaire completed by 157 366 participants: a reanalysis. BJPsych Open. 2020;6(2):e18. doi: 10.1192/bjo.2019.100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Coleman JRI, Peyrot WJ, Purves KL, Davis KAS, Rayner C, Choi SW, et al. Genome-wide gene-environment analyses of major depressive disorder and reported lifetime traumatic experiences in UK Biobank. Mol Psychiatry. 2020;25(7):1430–46. doi: 10.1038/s41380-019-0546-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Purves KL, Coleman JRI, Meier SM, Rayner C, Davis KAS, Cheesman R, et al. A major role for common genetic variation in anxiety disorders. Mol Psychiatry. 2020;25(12):3292–303. doi: 10.1038/s41380-019-0559-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Davis KAS, Mirza L, Clark SR, Coleman JRI, Kassam AS, Mills N, et al. The role of UK Biobank in mental health research: a review and commentary. Center for Open Science; 2024. doi: 10.31219/osf.io/bsywk [DOI] [Google Scholar]
  • 21.Catalogue of Mental Health Measures. 2024. www.cataloguementalhealth.ac.uk [Google Scholar]
  • 22.Psychiatric Genomics Consortium. About Us. In: PGC; 2024. https://pgc.unc.edu/about-us/ [Google Scholar]
  • 23.Fry A, Littlejohns TJ, Sudlow C, Doherty N, Adamska L, Sprosen T, et al. Comparison of Sociodemographic and Health-Related Characteristics of UK Biobank Participants With Those of the General Population. Am J Epidemiol. 2017;186(9):1026–34. doi: 10.1093/aje/kwx246 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Carress H, Lawson DJ, Elhaik E. Population genetic considerations for using biobanks as international resources in the pandemic era and beyond. BMC Genomics. 2021;22(1):351. doi: 10.1186/s12864-021-07618-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Schoeler T, Speed D, Porcu E, Pirastu N, Pingault J-B, Kutalik Z. Participation bias in the UK Biobank distorts genetic associations and downstream analyses. Nat Hum Behav. 2023;7(7):1216–27. doi: 10.1038/s41562-023-01579-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Sullivan PF, Agrawal A, Bulik CM, Andreassen OA, Børglum AD, Breen G, et al. Psychiatric Genomics: An Update and an Agenda. Am J Psychiatry. 2018;175(1):15–27. doi: 10.1176/appi.ajp.2017.17030283 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Davies MR, Kalsi G, Armour C, Jones IR, McIntosh AM, Smith DJ, et al. The Genetic Links to Anxiety and Depression (GLAD) Study: Online recruitment into the largest recontactable study of depression and anxiety. Behav Res Ther. 2019;123:103503. doi: 10.1016/j.brat.2019.103503 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Iveson MH, Ball EL, Doherty J, Pugh C, Vashishta S, Palmer CNA, et al. Cohort profile: The Scottish SHARE Mental Health (SHARE-MH) cohort - linkable survey, genetic and routinely collected data for mental health research. BMJ Open. 2024;14(1):e078246. doi: 10.1136/bmjopen-2023-078246 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.CLOSER. English Longitudinal Study of Ageing. 2024. [cited 29 Oct 2024]. https://closer.ac.uk/study/english-longitudinal-study-of-ageing/ [Google Scholar]
  • 30.Bulik CM, Thornton LM, Parker R, Kennedy H, Baker JH, MacDermod C, et al. The Eating Disorders Genetics Initiative (EDGI): study protocol. BMC Psychiatry. 2021;21(1):234. doi: 10.1186/s12888-021-03212-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Catalogue of Mental Health Measures. Adult Psychiatric Morbidity Survey (APMS). In: Studies. 2019. https://www.cataloguementalhealth.ac.uk/?content=study&studyid=APMS [Google Scholar]
  • 32.UK Biobank. Mental well-being web questionnaire. 2023. [cited Aug 2024]. https://biobank.ndph.ox.ac.uk/showcase/refer.cgi?id=2800 [Google Scholar]
  • 33.Davis K, Coleman J, UKB Mental Health Outcomes Consortium. UKB 2nd mental health questionnaire open science framework home. 2024. [cited 2024 August]. https://osf.io/c65t7/ [Google Scholar]
  • 34.Kroenke K, Spitzer RL, Williams JBW, Löwe B. The Patient Health Questionnaire Somatic, Anxiety, and Depressive Symptom Scales: a systematic review. Gen Hosp Psychiatry. 2010;32(4):345–59. doi: 10.1016/j.genhosppsych.2010.03.006 [DOI] [PubMed] [Google Scholar]
  • 35.Kessler RC, Andrews G, Mroczek D, Ustun B, Wittchen H. The World Health Organization Composite International Diagnostic Interview short‐form (CIDI‐SF). Int J Methods Psych Res. 1998;7(4):171–85. doi: 10.1002/mpr.47 [DOI] [Google Scholar]
  • 36.Levinson D, Potash J, Mostafavi S, Battle A, Zhu X, Weissman M. Brief Assessment Of Major Depression For Genetic Studies: Validation Of Cidi-Sf Screening With Scid Interviews. European Neuropsychopharmacology. 2017;27:S448. doi: 10.1016/j.euroneuro.2016.09.514 [DOI] [Google Scholar]
  • 37.Smith DJ, Nicholl BI, Cullen B, Martin D, Ul-Haq Z, Evans J, et al. Prevalence and characteristics of probable major depression and bipolar disorder within UK biobank: cross-sectional study of 172,751 participants. PLoS One. 2013;8(11):e75362. doi: 10.1371/journal.pone.0075362 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Glaesmer H, Schulz A, Häuser W, Freyberger HJ, Brähler E, Grabe H-J. The childhood trauma screener (CTS) - development and validation of cut-off-scores for classificatory diagnostics. Psychiatr Prax. 2013;40(4):220–6. doi: 10.1055/s-0033-1343116 [DOI] [PubMed] [Google Scholar]
  • 39.Office for National Statistics. Crime in England and Wales: year ending. 2019. https://www.ons.gov.uk/peoplepopulationandcommunity/crimeandjustice/bulletins/crimeinenglandandwales/yearendingseptember20192019 [Google Scholar]
  • 40.Babor T, Higgins-Biddle J, Saunders J, Monteiro M, World Health Organization. Audit: The alcohol use disorders identification test: Guidelines for use in primary health care. 2001. [Google Scholar]
  • 41.Herdman M, Gudex C, Lloyd A, Janssen M, Kind P, Parkin D, et al. Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Qual Life Res. 2011;20(10):1727–36. doi: 10.1007/s11136-011-9903-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Demakakos P, Nunn S, Nazroo J. 10. Loneliness, relative deprivation and life satisfaction. In: English Longitudinal Study of Ageing, editor. Retirement, health and relationships of the older population in England (ELSA Wave 2). The Institute for Fiscal Studies; 2006. [Google Scholar]
  • 43.Smith BW, Dalen J, Wiggins K, Tooley E, Christopher P, Bernard J. The brief resilience scale: assessing the ability to bounce back. Int J Behav Med. 2008;15(3):194–200. doi: 10.1080/10705500802222972 [DOI] [PubMed] [Google Scholar]
  • 44.Coleman J. R code for UKB MHQ2 algorithms. github. 2024. https://github.com/ColemanResearchGroup/MHQ2 [Google Scholar]
  • 45.Aguinis H, Vassar M, Wayant C. On reporting and interpreting statistical significance and p values in medical research. BMJ Evid Based Med. 2021;26(2):39–42. doi: 10.1136/bmjebm-2019-111264 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Cole TJ. Too many digits: the presentation of numerical data. Arch Dis Child. 2015;100(7):608–9. doi: 10.1136/archdischild-2014-307149 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.UK Biobank. Reporting small numbers in results in research outputs using UK Biobank data. https://www.ukbiobank.ac.uk/media/ydilu0ug/numbers-for-publications-uncommon-and-geographical-variables.pdf2023 [Google Scholar]
  • 48.Byrt T, Bishop J, Carlin JB. Bias, prevalence and kappa. J Clin Epidemiol. 1993;46(5):423–9. doi: 10.1016/0895-4356(93)90018-v [DOI] [PubMed] [Google Scholar]
  • 49.Ahmed N, Barnett P, Greenburgh A, Pemovska T, Stefanidou T, Lyons N, et al. Mental health in Europe during the COVID-19 pandemic: a systematic review. Lancet Psychiatry. 2023;10(7):537–56. doi: 10.1016/S2215-0366(23)00113-X [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Patel V, Fancourt D, Furukawa TA, Kola L. Reimagining the journey to recovery: The COVID-19 pandemic and global mental health. PLoS Med. 2023;20(4):e1004224. doi: 10.1371/journal.pmed.1004224 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Boorse C. The Meaning of Clinical Normality. MRAJ. 2022;10(7). doi: 10.18103/mra.v10i7.2866 [DOI] [Google Scholar]
  • 52.Stein DJ, Palk AC, Kendler KS. What is a mental disorder? An exemplar-focused approach. Psychol Med. 2021;51(6):894–901. doi: 10.1017/S0033291721001185 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Bishop DVM. Fallibility in Science: Responding to Errors in the Work of Oneself and Others. Advances in Methods and Practices in Psychological Science. 2018;1(3):432–8. doi: 10.1177/2515245918776632 [DOI] [Google Scholar]

Decision Letter 0

Sandar Tin Tin

19 Feb 2025

PONE-D-24-54764The UK Biobank Mental Health Enhancement 2022: Methods and ResultsPLOS ONE

Dear Dr. Davis,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Apr 05 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org . When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols .

We look forward to receiving your revised manuscript.

Kind regards,

Sandar Tin Tin

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1.Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, we expect all author-generated code to be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

3. Please note that funding information should not appear in any section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. Please remove any funding-related text from the manuscript.

4. Thank you for stating the following in the Competing Interests section:

“I have read the journal's policy and the authors of this manuscript have the following competing interests: AJ received a fee to talk at the centenary of the Scottish Action for Mental Health (SAMH). The other authors declare no conflict of interest.”

Please confirm that this does not alter your adherence to all PLOS ONE policies on sharing data and materials, by including the following statement: "This does not alter our adherence to  PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests).  If there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared.

Please include your updated Competing Interests statement in your cover letter; we will change the online submission form on your behalf.

5. We note that you have indicated that there are restrictions to data sharing for this study. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.

Before we proceed with your manuscript, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially identifying or sensitive patient information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., a Research Ethics Committee or Institutional Review Board, etc.). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of recommended repositories, please see

https://journals.plos.org/plosone/s/recommended-repositories. You also have the option of uploading the data as Supporting Information files, but we would recommend depositing data directly to a data repository if possible.

We will update your Data Availability statement on your behalf to reflect the information you provide.

6. Please remove your figures from within your manuscript file, leaving only the individual TIFF/EPS image files, uploaded separately. These will be automatically included in the reviewers’ PDF.

7. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Summary of the work: This study reports findings from the UK Biobank’s second mental health questionnaire wave. First, they explain the question changes from MHQ1 to MHQ2. Then they evaluate many characteristics of MHQ2, including comparisons between MHQ1 and MHQ2. This work not only provides a useful understanding of the mental health progression of the UKB participants, but it also provides a useful test-retest evaluation of the lifetime prevalence measures of the UKB.

Overall thoughts: This study is an exciting contribution to the field that will allow many researchers to expand their investigation of mental health in the UK Biobank. Their explanation of the differences in questions, participant overlap, and demographics between MHQ1 and MHQ2 are clear and valuable to the field. The figures are nice and effectively convey the information they intend to. Their analytical evaluation of MHQ2 however could be improved by further methodological explanation and potentially increased statistical rigor. Some analyses are performed with minimal explanation and some comparison results have no stated statistical analyses where the authors may wish to include them. With that said, the comparisons they provided are useful and important for researchers of the field to see. Therefore, I excitedly endorse its acceptance at PLOS ONE after minor revisions.

Minor comments:

1. Methodological clarity would be improved if the sections in the Methods that explain the analyses/comparisons of results were more clearly linked. For example using 1, 2, 3/A, B, C/or identical headers to link the method and its associated result.

2. Many methods, particularly analyses, do not seem to be seem to be explained in detail anywhere in the paper or supplement. This not only often leaves the reader with confusion about what analysis was done but also how to evaluate the results. Relatedly, many tables seem to include statistical quantifications that are not explained in the paper or table legend.

a. For example, please elaborate on how relative risk for depression and anxiety, stated in line 376, were

calculated. The authors state that the relative risk for these disorders are approximately the same, was this

determined using statistical analysis?

b. For example, details of the evaluation of lifetime diagnosis between MHQ1 and MHQ2, and its associated Table

4, are unclear. Particularly, the line “after taking into account onsets after MHQ1 (including unknown onsets),

between 75% to 93% of the test-retest variation in direction ‘Met 2 not 1’ and all in direction ‘Met 1 not 2’

remains unexplained” should be improved for clarity. Please explain how unknown onsets are incorporated (I assume the authors included any unknown onset into ‘onsets after MHQ1’ which seems an odd choice because it is not clear why these unknown onsets would be more likely to occur after MHQ1 as opposed to before). Additionally, how is test-retest variation determined and what does 75%-93% variation mean for this finding? What indicates that this variation is unexplained? Table 4, which relates to this result, could also be improved by further elaboration. What does the “Agree (%)” column refer to? The column “New onset (% Met 2 not 1)” should also explicitly state in the legend that this is the percent of individuals in the Met 2 not 1 group that reported new onset since MHQ1. Please also explain how new onset was determined.

3. The results include many comparisons between quantitative measures, but do not state a statistical difference. For example, starting on line 343, “those with lifetime eating disorders are more likely to have a degree qualification (eating disorder 53%; depression 46%; no mental disorder 44%)” compares the degree qualification of these 3 groups but not does indicate if these groups are statistically different. I recognize that this work is intended more as a descriptor for the new MHQ2, but I would refrain from drawing conclusions about these comparisons without statistical analysis. I recommend either including this statistical analysis or dampening claims about such comparisons throughout the paper.

4. For the methodological explanation starting at line 256, “Age at MHQ2…”, why was age of participants who did not complete MHQ2 included in the average age calculated for MHQ2? I assume that this age calculation is used in the demographics comparison table, which seems problematic as this does not accurately reflect the age of the MHQ2 participants to my mind. Though perhaps I misunderstand the explanation of this methodology, in which case I recommend improving the clarity of writing for this section.

5. I recognized that criteria for lifetime status is unchanged from MHQ1, but it may be of value to restate these criteria in this work.

6. I assume “dx” in Table 3 is short for diagnosis. Please include this abbreviation in the table legend.

Reviewer #2: This manuscript describes the procedure and content of the UK Biobank (UKB) MHQ2 questionnaire, provides a general cohort description, and presents overall comparisons between the two mental health data collections in UKB. The authors report a significant enhancement to the UKB resource by conducting a second mental health data collection in 169,253 individuals, bringing the total number of UKB participants with mental health data from at least one time point to 215,252, and those with data from two time points to 111,275. This represents a major advancement for the UKB mental health phenotypes and is an important contribution to the research field.

The manuscript clearly outlines the rationale and procedural decisions made during the design phase. The authors describe the characteristics of participants across the two waves and the broader UKB cohort, as well as across symptom-based criteria for lifetime disorders. They also examine changes in mental health using ‘current’ assessments and assess the consistency of measures using ‘lifetime’ assessments.

Comments and suggestions

1. It appears that the authors defined participants based on completion of all questionnaire modules. Given the difference between the number of individuals who started (175,266) and those who completed (169,252), 6,014 individuals did not finish the questionnaire. Inability to complete the full questionnaire may be linked to more severe mental health conditions, such as difficulties with concentration, fatigue, or other relevant health issues. The responses provided by these individuals before discontinuation could still be valuable for analyzing MHQ2 data. It would be helpful if the authors could clarify their decision to exclude these individuals in this context.

2. If data on completion times are available, it would be informative to include basic descriptive statistics on the time required to complete the full questionnaire. This would help estimate the time investment expected from participants.

3. Is it possible to determine whether the worst episodes described in detail in MHQ1 and MHQ2 refers to the same episode or two different episodes? This distinction would be useful both for assessing changes in symptoms across different episodes and for evaluating recall inconsistency for symptoms of the same episode. As the questionnaire does not seem to ask about the age at which the worst episode occurred, making this distinction may be challenging. Clarification on this point would be valuable.

4. It is unclear why psychotic experiences were omitted from MHQ2. While balancing questionnaire length and topic coverage is important, multiple assessments of psychotic symptoms could be highly relevant for investigating their links to, for example, cannabis use, medication use, or changes over the second half of the lifespan. Related to this, the wording in lines 502–504 of the Conclusions section may unintentionally suggest that psychotic experiences were included in this data collection. Rewording this sentence for clarity is recommended.

5. There appears to be a minor inconsistency in the reported sample size. The abstract and Tables 2 and 3 state that the number of respondents was 169,253, while the Results section (line 276) and Figure 2 state 169,252. It would be helpful to double-check and ensure consistency across the manuscript.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy .

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/ . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org . Please note that Supporting Information files do not need this step.

PLoS One. 2025 May 28;20(5):e0324189. doi: 10.1371/journal.pone.0324189.r003

Author response to Decision Letter 1


1 Apr 2025

Reviewer #1: Summary of the work: This study reports findings from the UK Biobank’s second mental health questionnaire wave. First, they explain the question changes from MHQ1 to MHQ2. Then they evaluate many characteristics of MHQ2, including comparisons between MHQ1 and MHQ2. This work not only provides a useful understanding of the mental health progression of the UKB participants, but it also provides a useful test-retest evaluation of the lifetime prevalence measures of the UKB.

Overall thoughts: This study is an exciting contribution to the field that will allow many researchers to expand their investigation of mental health in the UK Biobank. Their explanation of the differences in questions, participant overlap, and demographics between MHQ1 and MHQ2 are clear and valuable to the field. The figures are nice and effectively convey the information they intend to. Their analytical evaluation of MHQ2 however could be improved by further methodological explanation and potentially increased statistical rigor. Some analyses are performed with minimal explanation and some comparison results have no stated statistical analyses where the authors may wish to include them. With that said, the comparisons they provided are useful and important for researchers of the field to see. Therefore, I excitedly endorse its acceptance at PLOS ONE after minor revisions.

Minor comments:

1. Methodological clarity would be improved if the sections in the Methods that explain the analyses/comparisons of results were more clearly linked. For example using 1, 2, 3/A, B, C/or identical headers to link the method and its associated result.

Response: Thank you for this suggestion. We have now lettered our aims, and used these to structure the analysis section and results. We aim to report (A) who responded to the MHQ1 and MHQ2, (B) the mental health phenotypes and social factors ascertained in MHQ2, (C) changes in the current mental health of the cohort, and (D) the consistency of lifetime phenotypes.

2. Many methods, particularly analyses, do not seem to be seem to be explained in detail anywhere in the paper or supplement. This not only often leaves the reader with confusion about what analysis was done but also how to evaluate the results. Relatedly, many tables seem to include statistical quantifications that are not explained in the paper or table legend.

Response: We have rewritten the analysis section to be sure that it describes those analyses that follow.

a. For example, please elaborate on how relative risk for depression and anxiety, stated in line 376, were calculated.

Response: Thank you for highlighting this. We now state in the analysis section that:

“Changes in Current Mental Health: We describe and visualise the proportion of respondents who were positive for specified current mental disorders in 2022 and in 2016 in those participants who answered in both waves… Relative risk for 2022 compared to 2016 was calculated by using ‘proportion positive’ as the risk of disorder in each cohort and then risk in 2022 by that in 2016. This is done in age-and-sex stratified groups such that those aged, for example, 73-79 years in 2016 are compared with those aged 73-79 in 2022 for the four age groups where possible and also for the cohort overall.”

The authors state that the relative risk for these disorders are approximately the same, was this determined using statistical analysis?

Thank you, the answer is no. We agree that this may have sounded as though we were making a formal comparison where we had not intended to do so. We have clarified in the methods: “Significance testing on large cohorts may identify many small differences between comparators that have very little effect (35), therefore we have emphasised the magnitude of difference rather than calculating p-values.”

We have also rephrased this section for greater clarity:

“Supplementary Table 4 quantifies this a little more by reporting the relative risk of each current mental disorder in 2022 versus 2016 in each sex and age-group stratification and overall. A relative risk of 1 would suggest no change of risk in the group for the current disorder in 2022 compared to 2016 wave, with values above 1 suggesting a higher risk in 2022. The relative risk for every sex and age-group stratified group was above 1, between 1.07 and 2.00, with the latter representing a doubling of the presence of current depression in women aged 73 to 79 in 2022 compared to 2016. Despite age-group stratified relative risks being above 1, the relative risk for the whole cohort are close to 1 for depression (1.07) and anxiety (0.98), suggesting little change between 2016 and 2022, or below 1 for alcohol use disorder (0.84), due to the gradient of fewer cases in older age groups.”

b. For example, details of the evaluation of lifetime diagnosis between MHQ1 and MHQ2, and its associated Table 4, are unclear. Particularly, the line “after taking into account onsets after MHQ1 (including unknown onsets), between 75% to 93% of the test-retest variation in direction ‘Met 2 not 1’ and all in direction ‘Met 1 not 2’ remains unexplained” should be improved for clarity.

Please explain how unknown onsets are incorporated (I assume the authors included any unknown onset into ‘onsets after MHQ1’ which seems an odd choice because it is not clear why these unknown onsets would be more likely to occur after MHQ1 as opposed to before). Additionally, how is test-retest variation determined and what does 75%-93% variation mean for this finding? What indicates that this variation is unexplained? Table 4, which relates to this result, could also be improved by further elaboration. What does the “Agree (%)” column refer to? The column “New onset (% Met 2 not 1)” should also explicitly state in the legend that this is the percent of individuals in the Met 2 not 1 group that reported new onset since MHQ1. Please also explain how new onset was determined.

Response: We agree that our description of onset timing and discrepancy of findings was not clear. Regarding new-onset, we knew that readers would want to know how much of the discrepancy between MHQ1 and MHQ2 were due to this, so we extracted data to distinguish between those that should have been picked up by MHQ1 from those we were less sure or knew could not have been picked up. Although we cannot know the onset when respondents marked ’don’t know’, we included them along with documented recent onset in order to give the proportion that definitely could have been detected at MHQ1 if a true disorder. For clarity, we now only report those figures: of those that were negative in MHQ1 and positive in MHQ2, the proportion that, from their date of onset, also had the disorder at the time of the MHQ1. This meant losing a small section in the discussion that was about the magnitude of new-onset cases that, on reflection, was speculative and not part of our aims.

We have made changes to the analysis description in the methods, results section and table, which we hope explain more of our intentions and findings.

In methods we have written:

“D: Consistency of lifetime phenotypes…. Percentage agreement for each status was calculated as status agreeing in 2016 and 2022 (both positive or both negative) as a proportion of the cohort, and Cohen’s kappa for each status, with kappa being agreement corrected for agreement by chance(38). Where a respondent was negative in 2016 but positive in 2022, we also took into account the date that the respondent gave for onset in MHQ2 (except for clinician diagnosis, where this was not asked). This helps separate these respondents into those whose status could have been picked up in the MHQ1 wave (date of onset from MHQ2 was before 2016), from those who might not have done (either date of onset after start of 2016 or answered ‘don’t know’ to date of onset). We decided to include unknown with the later onsets, despite no information to suggest these are more likely to be late onset, to explain the greatest amount of test-retest variability.”

In results we have written:

“A potential cause of lack of agreement between the questionnaire waves are new-onset cases that started after completing MHQ1. Of those 5,947 who were positive in MHQ2 but not MHQ1 for depression, 4,445 (75%) reported an onset before the MHQ1 wave, so could have been detected then. For self-harm 77%, bipolar 90% and cannabis 93% of those who were negative in MHQ1 could have been detected according to self-reported onset, therefore new-onset explains only a small amount of discordant cases.”

For the table, we have added some additional label rows, so that rather than a column being labelled “Met 1 not 2” it is labelled as ‘Positive’ in the MHQ1 status row, ‘Negative’ in the MHQ2 row, and so on, as we think this will be more intuitive. We have also put the denominator into the cells for clarity. The legend has been changed to: “Comparison of lifetime phenotype status in MHQ1 and MHQ2 in those participants who completed both questionnaires (n=111,275). Percentage is proportion of those participants in each agreement category, except last column. Last column takes into account the date of onset in lack of agreement, percentage is proportion with onset prior to 2016 (approximate date of MHQ1).” And with a footnote “*date of onset (of depression, bipolar, self-harm or cannabis use) was reported in MHQ2 to be before MHQ1 wave, rather than onset that was definitely after MHQ1 wave or was unknown.”

3. The results include many comparisons between quantitative measures, but do not state a statistical difference. For example, starting on line 343, “those with lifetime eating disorders are more likely to have a degree qualification (eating disorder 53%; depression 46%; no mental disorder 44%)” compares the degree qualification of these 3 groups but not does indicate if these groups are statistically different. I recognize that this work is intended more as a descriptor for the new MHQ2, but I would refrain from drawing conclusions about these comparisons without statistical analysis. I recommend either including this statistical analysis or dampening claims about such comparisons throughout the paper.

Response: Thank you. It is true that we wish to be mainly descriptive. Another reason for being reluctant to include significance testing is that the large sample size means that many comparisons will be statistically significant. Therefore, we have stated in the analysis plan: “Significance testing on large cohorts may identify many small differences between comparators that have very little effect (35), therefore we have emphasised the magnitude of difference rather than calculating p-values.” We have made minor changes to the results sections to avoid implying we have made formal comparisons. For the result mentioned above, we have changed this to read:

“The group of respondents that met the criteria for eating disorders and panic disorder appear to have similar characteristics to those respondents with depression and bipolar, although it was remarkable that over half of those with lifetime eating disorders had a degree qualification (53%).”

4. For the methodological explanation starting at line 256, “Age at MHQ2…”, why was age of participants who did not complete MHQ2 included in the average age calculated for MHQ2? I assume that this age calculation is used in the demographics comparison table, which seems problematic as this does not accurately reflect the age of the MHQ2 participants to my mind. Though perhaps I misunderstand the explanation of this methodology, in which case I recommend improving the clarity of writing for this section.

Response: Thank you, we hope we have now provided more clarity on this. For the group that answered the MHQ2, their age is taken as when they completed the questionnaire, and this is what is presented in table 2 in the MHQ2 column. Our dilemma was that most in the full cohort did not complete the questionnaire, so we had to find an equivalent point in order to compare the MHQ2 respondents with the base cohort, which we decided was the median data of completion of the MHQ2. We then applied this to everyone (regardless of whether had completed MHQ2 or not), and that is what is presented in table 2 in the UK Biobank cohort column.

After revisions, in the methods section we say:

“The age variable was handled differently for the three groups. Age for the MHQ2 wave was their age on completion of the MHQ2, and for the MHQ1 wave age at completion of MHQ1. For the UKB cohort as a whole, age represents the age they would have been at the median date of MHQ2 completion, including those who died before this date”

In the footnote to table 2 we say:

“Age in the MHQ2 wave is at date of completion of MHQ2. Age in the MHQ1 wave is at approximate date of completion of MHQ1. Age for full cohort is the age a participant would have been on the median date of MHQ2 completion.”

5. I recognized that criteria for lifetime status is unchanged from MHQ1, but it may be of value to restate these criteria in this work.

Response: Thank you for the suggestion. We have added :

“The criteria for positive status were the same in both questionnaires.”

6. I assume “dx” in Table 3 is short for diagnosis. Please include this abbreviation in the table legend.

Response: Thanks, added.

Reviewer #2: This manuscript describes the procedure and content of the UK Biobank (UKB) MHQ2 questionnaire, provides a general cohort description, and presents overall comparisons between the two mental health data collections in UKB. The authors report a significant enhancement to the UKB resource by conducting a second mental health data collection in 169,253 individuals, bringing the total number of UKB participants with mental health data from at least one time point to 215,252, and those with data from two time points to 111,275. This represents a major advancement for the UKB mental health phenotypes and is an important contribution to the research field.

The manuscript clearly outlines the rationale and procedural decisions made during the design phase. The authors describe the characteristics of participants across the two waves and the broader UKB cohort, as well as across symptom-based criteria for lifetime disorders. They also examine changes in mental health using ‘current’ assessments and assess the consistency of measures using ‘lifetime’ assessments.

Comments and suggestions

1. It appears that the authors defined participants based on completion of all questionnaire modules. Given the difference between the number of individuals who started (175,266) and those who completed (169,252), 6,014 individuals did not finish the questionnaire. Inability to complete the full questionnaire may be linked to more severe mental health conditions, such as difficulties with concentration, fatigue, or other relevant health issues. The responses provided by these individuals before discontinuation could still be valuable for analyzing MHQ2 data. It would be helpful if the authors could clarify their decision to exclude these individuals in this context.

Response: Thank you for this suggestion. We agree there is good reason to think that there may be a systematic bias against the very people who would be cases had they completed the questionnaire. However, in the absence of being able to test this, we are not able to comment on this. It may be helpful to know that the data is available on these people for all of the sections that they managed to fill out, but for the sake of simplicity in this report (different ns for different disorders etc.), we have omitted them.

In the strengths and limitations section we have written: “We can speculate that people with poorer health, including mental health, would be less likely to start a UK Biobank questionnaire, or may have not been able to complete (as around 6,000 people or 3% of those that started).”

2. If data on completion times are available, it would be informative to include basic descriptive statistics on the time required to complete the full questionnaire. This would help estimate the time investment expected from participants.

Response: We see that this would be useful. Unfortunately, we do not have this in any form that would be robust enough to quote. It is po

Attachment

Submitted filename: mhq2_response to reviewers.docx

pone.0324189.s003.docx (32.9KB, docx)

Decision Letter 1

Sandar Tin Tin

22 Apr 2025

The UK Biobank mental health enhancement 2022: methods and results

PONE-D-24-54764R1

Dear Dr. Davis,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager®  and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Sandar Tin Tin

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy .

Reviewer #1: No

**********

Acceptance letter

Sandar Tin Tin

PONE-D-24-54764R1

PLOS ONE

Dear Dr. Davis,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

You will receive further instructions from the production team, including instructions on how to review your proof when it is ready. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few days to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Sandar Tin Tin

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File. S1 Fig, S1-6 Tables.

    (PDF)

    pone.0324189.s001.pdf (449KB, pdf)
    Attachment

    Submitted filename: mhq2_response to reviewers.docx

    pone.0324189.s003.docx (32.9KB, docx)

    Data Availability Statement

    Data cannot be shared publicly for participant confidentiality, but can be accessed subject to conditions by application to UK Biobank https://www.ukbiobank.ac.uk/enable-your-research. Supporting information for reproducing variables is available https://osf.io/5a8yu/


    Articles from PLOS One are provided here courtesy of PLOS

    RESOURCES