Skip to main content
BMJ Open logoLink to BMJ Open
. 2026 Feb 17;16(2):e100092. doi: 10.1136/bmjopen-2025-100092

Genetic, lifestyle and environmental influences on health: a Finnish biobank recall study protocol (BioRecall)

Elina Sillanpää 1,2,, Tiina Föhr 1, Elisa Kurtti 3,4, Kirsikka Aittola 5,6, Johanna Mäkelä 7, Tom Southerington 7,8, Timo A Lakka 9,10,11, Tiina Jokela 12,13, Maarit Ahtiainen 13, Eija Laakkonen 1,14, Merja Rantakokko 1,2,14, Suvi Ravi 1,14
PMCID: PMC12918690  PMID: 41702659

Abstract

Abstract

Introduction

Non-communicable diseases are the leading causes of premature mortality worldwide. Both genetic predispositions and environmental exposures affect disease risk. While biobanks have increased understanding of genetic predictors of these diseases, environmental influences are expected to have a greater impact on disease development. Individuals also create their own environments and lifestyles based on genetically regulated preferences, leading to gene–environment interactions that require large datasets to study. Finnish biobanks typically lack sufficient lifestyle and environmental data, which limits their use. We present a protocol for a biobank-recall study (BioRecall) to collect data on lifestyle and environmental exposures and combine these findings with genotypes, biological samples and clinical outcomes.

Methods and analysis

All previously genotyped donors from the Central Finland Biobank who have been diagnosed with type 2 diabetes and have consented to recall will be invited to participate in the pilot study. The preliminary feasibility assessment reveals that there are 1580 suitable candidates. Participants will complete an electronic questionnaire on a secure online platform. The questionnaire includes validated questions on lifestyles, anthropometrics, weight loss history, health, symptoms, work characteristics, emotional states and residential environments. Postcode information will facilitate the addition of spatial environmental data. Genotype and related clinical data will be provided in the study in accordance with the Finnish Biobank Act and combined with questionnaire data.

Ethics and dissemination

The Human Sciences Ethics Committee of the University of Jyväskylä delivered a favourable statement regarding the study protocol (1671/13.00.04.00/2023). Central Finland Biobank approved the research plan (no: BB24-0333-A01). The data collected will be returned to the Central Finland Biobank for research purposes with the participants’ consent. Permission for data usage can then be applied through standard protocols of the Fingenious service (https://site.fingenious.fi/en/). If successful, the study will be expanded to other donors and Finnish biobanks.

Keywords: GENETICS; Diabetes Mellitus, Type 2; EPIDEMIOLOGY; PUBLIC HEALTH; Electronic Health Records


STRENGTHS AND LIMITATIONS OF THIS STUDY.

  • Low-cost, scalable approach: a fully electronic consent and data collection concept with the potential to expand data collection to all genotyped Finnish individuals with biobank consent will be employed.

  • Extensive dataset: the study will form an extensive dataset covering lifestyle and environmental exposures to investigate the prevention of non-communicable diseases. In the pilot phase, the sample size is limited, and only one biobank is included.

  • Automated feedback: as the survey progresses, respondents receive feedback on their own answers, primarily in relation to national lifestyle recommendations. Automated instant feedback may improve response rates. This feature could facilitate low-cost health promotion interventions.

  • Innovative physical activity assessment: novel questions on strength training and balance that complement the WHO’s Global Physical Activity Questionnaire (GPAQ) will enable the assessment of compliance with current physical activity recommendations.

  • Lack of wearable data: The concept does not include data collection using wearables.

Introduction

Non-communicable diseases (NCDs) account for over 70% of deaths worldwide.1 While the risk of these diseases increases with age at the population level, individual risk and onset times vary significantly. Many common NCDs tend to cluster within families,2 3 and recent research has made significant advances in understanding their heritability and individual risk measurement.4,6 This progress is largely due to the development of large datasets from biobanks and advancements in genome-wide genetic analysis and polygenic scoring. Genome-wide risk scores can estimate heritable disease risk, aiding in the development of personalised prevention and treatment strategies.7 8 However, for many diseases, the predictive capability of genetic scores at the individual level is low.9

Genetic predispositions do not explain population-level changes in NCD prevalence, such as the rapid increase of type 2 diabetes (T2D). In Western countries, which are characterised by long life expectancies and high sociodemographic indices, leading health risks include unhealthy lifestyle habits (such as smoking, alcohol consumption, sedentary behaviour and suboptimal dietary patterns), markers of poor cardiometabolic health (such as high body mass index, fasting glucose, cholesterol and blood pressure) and ambient particulate matter pollution.10 These findings underscore the importance of lifestyle and environmental factors alongside biological risk factors in predicting individual risk for NCDs.

However, the current literature likely oversimplifies or underestimates the influence of various important factors due to challenges in measuring exposures and their cumulative impact on NCDs. The reductionist approach (ie, examining exposures mainly as isolated factors or small groups of exposures) may not be optimal for studying population health. In reality, individuals are simultaneously exposed to numerous behavioural and environmental factors that can either increase risk or provide resistance to common NCDs. The concept of the exposome, which encompasses the totality of environmental exposures from conception onwards, has gained considerable interest among researchers.11 The exposome is classified into three categories: (1) the general external environment (climate, urbanisation and socioeconomic factors), (2) the individual environment (lifestyle, environmental pollutants, family relations and work) and (3) the internal environment (metabolic factors, inflammation and ageing).12 Unlike genetic risk scores, environmental risk scores vary with age and over time, depending on exposure accumulation.

Gene–environment interactions constitute another challenge in predicting individual disease risks. Genetic inheritance, which influences health behaviours such as physical activity, can directly affect disease risk regardless of the adopted behaviour,13 suggesting the presence of genetic pleiotropy, in which the same genetic variation affects both health-related behaviour and disease incidence.13 14 Inherited disease risk may also be nonlinearly related to environmental factors. For example, individuals tend to select their environments based on their genetic preferences. The role of heredity in obesity development is greater in obesogenic environments.15 A better identification of lifestyle and environmental risks and a deeper understanding of gene–environment interactions could help identify risk factors at individual and population levels and assist in the development of predictive strategies for public health promotion. However, large datasets that include a broad spectrum of different levels of data are rare. UK Biobank includes broad phenotype data and is widely used in epidemiological research. Finnish biobank datasets, however, lack lifestyle and environmental information.

Here, we present a protocol for the biobank recall study concept (BioRecall), which allows for the collection of comprehensive data on traditional and emerging lifestyle and environmental risk factors through electronic surveys and open-source data. These data will be combined with genome data, biological samples and register-based disease outcomes from the biobank registry. The BioRecall concept, developed in collaboration with the Finnish Biobank Cooperative (FINBB), aims to navigate complex administrative processes not yet customised for clinical or public health recall activities in Finland. The concept also presents practical solutions for crossing interfaces in recontacting, consenting and contracting activities between multiple parties.

Methods and analysis

Study design and setting

BioRecall is a cross-sectional pilot study that will be conducted in Central Finland Biobank, Finland, and at the University of Jyväskylä, Finland, in collaboration with the FINBB and the University of Eastern Finland. This study pilots a recall study procedure (figure 1) in which lifestyle and environmental data are collected and combined with biological and clinical data stored at the biobank registry. Lifestyle data will be collected by an electronic survey. The survey also includes a few questions on the perceived environment. Most of the environmental data will be collected from national open-source registers through a postal code linkage.

Figure 1. Key steps in the process of the biobank recall study. The solid line represents the sequential progression of the process from one stage to the next. The dashed line represents the availability of biobank registry data for researchers and the subsequent incorporation of collected data into the biobank registry. FINBB, Finnish Biobank Cooperative; ICD-10, International Classification of Diseases, 10th Revision.

Figure 1

Eligibility criteria

The target group of this pilot study will consist of individuals diagnosed with T2D, which is one of the most common NCDs worldwide and strongly linked to lifestyle factors. The inclusion criteria for the invited participants are as follows: have a diagnosis of T2D (International Classification of Diseases, 10th Revision codes E11, E11.0, E11.1, E11.2, E11.3, E11.4, E11.5, E11.6, E11.7, E11.8 or E11.9), included in the Central Finland Biobank registry, have available genome-wide DNA polymorphic data and provided consent to be recontacted for research purposes. All individuals in the Central Finland Biobank are adults (≥18 years old); hence, we do not apply any age restrictions. According to a feasibility assessment conducted by the Central Finland Biobank, 1580 participants are eligible for participation in this study.

Recruitment

Researchers submit a data request through the Fingenious service for genotyped participants with T2D included in the Central Finland Biobank registry. The Fingenious service is a digital platform that provides researchers with access to Finnish biobank samples, data and participant recruitment service. The Fingenious service is provided by the FINBB.

Based on this request, the Central Finland Biobank will establish a research cohort for this recall study. The biobank will create pseudonymised IDs for all invited participants prior to the initiation of the study. These pseudonymised IDs will allow the Central Finland Biobank to link the collected survey data to the biobank’s registry data, ensuring that directly identifiable information, such as personal identification numbers or contact information, need not be collected via the survey form. Hence, the data are pseudonymised from the beginning, ensuring that participants’ identities are not directly accessible to the researchers.

The Central Finland Biobank registry contains the names and postal addresses of individuals. Before sending the invitations, the biobank will verify that the invitees are alive and have consented to be recalled. The invitations will primarily be sent electronically to those who have consented to electronic communication. This will be done via the online healthcare portal OmaHyvis, which is owned by the Wellbeing Services County of Central Finland, which is also a co-owner of the biobank. In cases involving missing consent for electronic contact in OmaHyvis, invitations will be sent by mail. At this stage of the work, the researchers will not be privy to the identities of the individuals to whom the information is sent. The invitation will include the study information sheet and the biobank opt-out form. After the invitations are sent, the responsibility for the study lies with the researchers; for example, all participant inquiries related to the study will be directed to the researchers. One month after the research invitation, a reminder will be sent to those who have not responded at all or who have responded only partially to the electronic survey.

Survey implementation

Participants are instructed to log in to MyBiobank (https://omabiopankki.fingenious.fi) on the information sheet. MyBiobank is a portal for Finnish biobanks operated by FINBB. The portal allows study participants to securely log in, identify themselves, read the privacy notice, provide consent and access the study survey tools.16 Authentication is guaranteed through Suomi.fi, a secure identification system available to Finnish residents. MyBiobank links a participant to the electronic survey study, which is implemented using the REDCap platform. REDCap is a secure web-based tool for collecting sensitive data, as the collected data is stored on the University of Jyväskylä’s own secure server. The questionnaire takes approximately 30–40 min to complete. As the respondent progresses through the survey, he/she will receive immediate feedback on his/her answers, primarily in relation to Finnish national lifestyle recommendations.

Informed consent

Electronic consent for participation in the study will be collected using the REDCap software before the survey can be accessed by the participants. Participants will proceed with the survey only after accepting their consent via the form. Additionally, participants must consent to the combination of survey data with biobank registry data, as well as to the archiving of the collected survey data in the Central Finland Biobank, before they can participate. This allows the data to be made available for future research approved by the biobank.

Participant coverage analysis

Central Finland Biobank will obtain information regarding individuals who are invited but choose not to participate in the study. The biobank will then form a pseudonymised patient cohort from these individuals. This will allow for an examination of how the groups of participants and non-participants differ in terms of, for example, age, gender and place of residence.

Measures

Physical activity

The Global Physical Activity Questionnaire (GPAQ)17 will be used to assess physical activity participation at work, travelling and recreational activities, and sedentary behaviour. The GPAQ was developed by the WHO, and it is a valid measure for assessing moderate-to-vigorous physical activity.18

In addition to the GPAQ, we supplement the physical activity questions with additional questions to assess the respondents’ participation in muscle-strengthening and balance exercises. This allows assessment of the implementation of the current physical activity recommendations.19 Complementary questions were developed by an expert group that included sports science academics. Engagement in muscle-strengthening activities will be assessed using three items: ‘Do you engage in muscle-strengthening activities during your leisure time (for example, do you go to the gym or outdoor fitness parks, or do kettlebell training, fitness classes, Bodypump, CrossFit or Pilates)?’, ‘How many days a week do you typically engage in muscle-strengthening activity?’ and ‘What type of muscle-strengthening exercises do you typically perform? Select all the options you engage in regularly (alternatives: gym/outdoor fitness park workouts, kettlebell training, circuit training/fitness class, Pilates, BodyPump, CrossFit, other, what?)’. Furthermore, adherence to the strength training recommendations set by the American College of Sports Medicine20 will be assessed by asking, ‘Do you engage in strength training exercises targeting the lower, middle, and upper body at least twice a week, using a resistance that allows you to perform each movement a maximum of 12 times per set?’. Additionally, engagement in balance training will be assessed using two items: ‘Do you engage in balance training during your leisure time (such as dancing, yoga, tai chi, skating, ball games, cross-country skiing, or stand-up paddleboarding)?’ and ‘How many days a week do you typically engage in balance training?’.

Finally, life-long physical activity history will be assessed by asking the participants to describe their physical activity level at all applicable ages from the following options: 7–12, 13–16, 17–19, 20–29, 30–39, 40–49, 50–59, 60–69, 70–79 and 80 or older. The response options are as follows: (1) no exercise, (2) regular independent leisure-time physical activity, (3) regular other supervised physical activity in a sport club, etc and (4) regular competitive sport and training related to said sport. Briefly, regular independent leisure-time physical activity was defined as regularly walking or cycling to school/work, exercising or doing daily activities that cause sweating and are not organised by a school, sports club, fitness centre, etc. Regular competitive sport and related training were defined as goal-oriented sports within a club, including competing and training, while supervised physical activity was defined as regular non-competitive activities organised by a club, fitness centre or similar.21

Diet and alcohol consumption

Diet quality will be measured using the Healthy Diet Index (HDI),22 a dietary screener based on the D2D Food Intake Questionnaire23 with slight updates. This questionnaire evaluates key food groups on both a daily and weekly basis: meals and snacks, fast food, dishes made with vegetables and legumes, fish, poultry, meat and sausages, fat and cream quality used in cooking, spreads and salad dressings, fruits, vegetables and berries, nuts and seeds, dairy products, grain products, processed meat products, sweets, pastries, desserts, sugary drinks and alcoholic beverages.

The HDI assesses overall diet quality on a scale from 0 to 100 points, with higher scores indicating better adherence to Finnish nutrition recommendations.22 24 It also evaluates seven specific components of the diet weighted by their relative importance to an overall healthy diet: meal pattern: 10 points; grains: 20 points; fruits and vegetables: 20 points; fats: 15 points; dairy: 10 points; fish and meat: 10 points; and snacks and treats including beverages: 15 points.

Additionally, special diets, food allergies and the use of vitamin D and calcium supplements that may influence diet quality and overall nutrient adequacy are assessed. Finally, to complement the weekly alcohol consumption data collected as part of the HDI, respondents are asked to report on the frequency of their alcohol consumption using the following response options: daily, several times a week, once a week, several times a month, approximately once a month or more rarely, or not at all.

Anthropometrics and weight loss history

Participants are to report their current weight (or their pre-pregnancy weight if currently pregnant or recently given birth) in kilogrammes and height in centimetres. They will also measure their waist circumference according to the provided instructions. In addition, participants will report on the number of times they have lost at least 5 kg of weight and whether a healthcare professional has recommended that they lose weight. In the case of a positive response to the latter question, follow-up questions will be used to examine the settings as well as perceived quality and quantity of the support provided by healthcare professionals.

Smoking and substance use

Participants will report their smoking habits using questions adapted from the Finnish Twin Cohort study.25 Additionally, information will be collected on the use of snus, nicotine pouches, cigars, cigarillos, pipes, cannabis and other intoxicating substances.

Medical history and current health status

Participants will be presented with a list of medical diagnoses and asked to indicate any that have been confirmed by a doctor. Additionally, they will report their current medications. Finally, participants will evaluate their subjective experiences of their health, physical fitness and functional performance compared with people of similar ages.

Questions for women

Questions regarding menstrual history, parity and the use of hormonal contraceptives or hormone replacement therapy will be directed to those who report being women. The age at menarche and whether secondary amenorrhea, which is defined as the absence of menses for at least 3 months for reasons other than pregnancy or hormonal contraceptive use,26 has ever occurred will be self-reported by the participants. They will also provide information on their menopausal age (if applicable), specifying whether menopause occurred spontaneously or as a result of medical intervention (eg, through oophorectomy, radiation therapy or chemotherapy). Additionally, participants will report on how many pregnancies they had and when they had them. Finally, details about their current use of hormonal contraceptives or hormone replacement therapy will be obtained.

Work characteristics

Participants who have been employed or are currently employed will answer work-related questions. These questions will be answered based on participants’ most recent employment. Participants will be asked several questions related to their work characteristics, including their job titles. The number of hours per week engaged in paid employment will be investigated. Information on the work schedule will be obtained by assessing the participant’s work type: fixed days, fixed nights, rotating shift with a two-shift pattern without night shift, rotating shift with a two-shift pattern with night shift, and rotating shift with a three-shift pattern.25

Occupational physical loading will be assessed using a single item: ‘What kind of work do you do?’. The response alternatives will be: (1) mainly sedentary work, which requires very little physical activity; (2) work that involves standing and walking, but no other physical activity; (3) work that, in addition to standing and walking, requires lifting and carrying and (4) heavy physical work. Participants will select only one category to describe their work.25

Self-rated work ability will be measured using a single item: ‘Let’s assume that your work ability at its all-time best would be given 10 points, and 0 points would indicate that you are completely unable to work. How would you score your current work ability?’. This single-question measure is based on the first item of the Work Ability Index’s (WAI) 7-item questionnaire,27 which is also referred to as the Work Ability Score (WAS). WAS has been shown to have a similar validity to WAI while being a simpler alternative.28

Work stress will be assessed by measuring effort–reward imbalance (ERI). ERI will be determined by four questions from the 10-item scale by Siegrist.29,31 Participants will respond to a 5-point Likert scale (1=very little, 5=very much) to one question concerning effort and three questions concerning reward. ERI will be calculated by dividing the effort score by the mean score of reward variables.

Work engagement will be assessed using three items from the Utrecht Work Engagement Scale (UWES)32: ‘At my work, I feel full of energy’ (vigour), ‘I am enthusiastic about my work’ (dedication) and ‘Time flies when I am working’ (absorption). Participants will respond on a 5-point Likert scale (1=always, 5=never).

Emotional states

Positive and negative affectivity will be assessed using the Internationally Reliable Short form of the Positive and Negative Affect Schedule, which contains 10 adjectives (five for positive affect and five for negative affect): ‘Think about yourself and how you normally feel. How do you usually feel? Choose the most appropriate option’. Participants will respond to each item on a 5-point Likert scale (1=does not describe me at all, 5=describes me very well).33

Sleep characteristics

The Jenkins Sleep Scale34 will be used to assess sleep quality. This scale consists of four items that evaluate sleep problems during the last month: trouble falling asleep, waking up several times during the night, trouble staying asleep and waking up feeling tired. Response options for each item are on a 6-point scale from ‘not at all’ (0 points) to ‘22–31 days’ (5 points).34 Information on the duration of nightly sleep and potential naps will also be gathered.

Residential environment

Factors related to the local environment will be assessed using questions modified from the Health 2011 Survey carried out in Finland.35 36 These questions examine both the availability of walking and cycling paths near the home and access to certain daily services, such as grocery stores, supermarkets, libraries, post offices, sports facilities and green spaces. In addition, the residential area will be defined based on the postal code reported in the questionnaire. Environmental exposures at the postal code area level will be curated from different sources using both open-source statistical and geospatial data and data received through collaboration in exposome networks. The annual average air quality and pollution exposures of the closest observation station will be retrieved from the Finnish Meteorological Institute. The National Land Survey of Finland offers information on natural features, such as forest and agricultural areas, rivers, lakes and built environments (eg, roads and buildings). Access to green and blue spaces and their respective quality can be estimated by, for example, tree coverage, vegetation and biodiversity indices. The Finnish Environment Institute offers information on land cover, land use and urban zones. These data are openly available for Central Finland from the European Environmental Agency (Europe-wide Urban Atlas) and USGS Landsat or Copernicus Open Access Hub (Sentinel-2 Data). Socioeconomic characteristics (eg, demographics, education, income, employment, housing, crime and social inequality, political orientation and voting behaviour in the postal code area) will be obtained from Statistics Finland.

Background characteristics

The questionnaire will cover questions about age, sex, the economic well-being of the household, education, occupational group and employment status.

Participant incentivisation and automatic feedback

We have developed an automatic feedback function for the REDCap software that provides participants with feedback on their personal lifestyle habits in relation to national lifestyle recommendations for health promotion and work stress. This was done to make the questionnaire more engaging for participants and to motivate invited individuals to complete the questionnaire, thus potentially increasing the response rate. Feedback will be provided from the following parts of the questionnaire: physical activity, dietary habits, waist circumference, work stress and sleep.

The feedback on physical activity summarises the participant’s weekly amount of exercise, categorised into moderate and vigorous activity as well as strength and balance training. Additionally, the feedback provides information on recommended levels of physical activity based on current guidelines.19

The feedback on dietary habits uses HDI scoring and provides automated feedback based on nutrition recommendations to support a health-promoting diet.24 37 This feedback system has been refined through prior research projects.38 39

Participants are informed about healthy waist circumference and its risk thresholds in relation to their reported waist circumference. To supplement data on their reported sleep duration, the participants are informed about the recommended sleep duration and the potential risks associated with sleep deprivation.

The feedback on work stress is based on ERI.29,31 The participants receive an individual-level ERI ratio score (the ratio of the self-assessed effort score and the mean of the self-assessed reward scores) and an interpretation of their score. The ratio scores of ERI range from 0.2 to 5.0. An imbalance between work efforts and rewards is present when the ratio score of ERI≠1. The ratio score of ERI<1 indicates an imbalance in favour of rewards, while a ratio score >1 indicates an imbalance in favour of effort.40

Schedule

Data collection will be completed between March and May in 2025. Database curation will be done during the fall of 2025 and data will be ready for analysis in December 2025. If the protocol is successful, data collection will be extended to all donors of the Central Finland Biobank in the autumn of 2026.

Sample size calculation and statistical analysis plan

Since the primary focus of the project is on piloting the operational model, no formal sample size calculation has been performed, and no detailed statistical analysis plan has been developed. Based on our previous experience in recruiting participants within the same study area and in other population-based studies, we expected to have a response rate of 30%, which would yield 474 participants (out of 1580 participants eligible for the study). This population would provide an adequate sample to conduct statistical analysis in terms of several health-related outcomes.

Ethics and dissemination

The Human Sciences Ethics Committee of the University of Jyväskylä provided its favourable consent to the study protocol (1671/13.00.04.00/2023). Central Finland Biobank approved the study (BB24-0333-A01). The project metadata will be published on the Jyväskylä University Digital Repository ‘JYX’. Data will be archived in the Central Finland Biobank, where it will be made available for researchers through the Fingenious service (https://site.fingenious.fi/en/). Data will be used in scientific articles and several Master’s theses. We will follow Plan S and publish in open-access journals and/or parallel-publish in the university repository ‘JYX’, where theses will also be published.

Discussion

One of the greatest challenges in public health promotion is recognising individuals at risk for developing diseases. This would help direct personalised preventative actions before disease manifestation. This ambitious and important goal cannot be achieved without proper datasets. Large-scale biobank datasets include data from hundreds of thousands of genotyped individuals along with register-based disease endpoints and biological samples. Complementing biobank data with lifestyle and environmental risk factors using the biobank recalls would offer possibilities for studying the complex interactions between genetic predispositions, lifestyle choices and environmental exposures. Customised clinical recall studies for Finnish biobank donors have recently been piloted as part of a large public partner project, the FinnGen.16 41 42 These pilot studies, which focused on neurocognitive diseases,41 42 revealed that the recall concept is feasible but also highlighted the need for developing a more centralised recall procedure for Finnish biobanks,16 testing recall concepts with different subgroups and considering the provision of concrete rewards or personalised health insights to enhance participation rates.16 The BioRecall study concept is an attempt to address this need.

The BioRecall concept presented in this study offers a cost-effective and fully electronic approach to data collection, which is crucial for large-scale epidemiological research. This pilot is conducted in the Central Finland Biobank with genotyped individuals having T2D. The concept has the potential to expand data collection to all genotyped Finnish individuals with biobank consent (10% of the population). Another primary strength of the BioRecall concept is its versatility in collecting comprehensive data on genetic information, lifestyles, environmental exposures and clinical outcomes. This holistic exposome approach addresses the current challenges in understanding the multifactorial nature of NCDs. It is expected that larger datasets with novel machine-learning tools will help identify key risk factors and protective factors at both the individual and population levels.

The biobank recall concept presented here may allow longitudinal monitoring of participants in a cost-effective manner. This capability is essential for tracking changes in health behaviours and outcomes over time and providing valuable data for the prevention and management of NCDs. Collaboration with the well-being services counties can facilitate the further development of this concept, ensuring that the data collected is used to improve public health strategies and outcomes. This cooperation can lead to the development of more targeted and effective health policies and interventions. The potential for health promotion interventions through feedback activities further enhances the utility of this approach, offering opportunities for personalised health recommendations and interventions. The feedback tool is particularly advantageous, as it can address the common issue of low response rates in lifestyle surveys.16 Automated personalised feedback mechanisms are expected to improve participant engagement and response rates, representing a significant advancement over traditional survey methods. The return of personal health risk information may also be used to promote lifestyle changes in participants.

Several challenges must be considered. Based on prior evidence among Finnish participants, postal contact may yield better response rates than contacting via a healthcare portal and reminders may be needed.16 However, a similar concept with automatic feedback among the current patient group has not been tested before. Comprehensive data can be obtained using a broad questionnaire. On the other hand, the broad scope of the questionnaire may result in low adherence due to the time required to complete it. The response rates in the piloting phase will indicate how to successfully balance these challenges. An important limitation of this study is that in this pilot phase, the sample size is limited and only one regional biobank is included. However, all other software, except OmaHyvis, which is used for the first contact, is expected to be available at the national level, and all well-being services counties have a system allowing electronic contact for their residents. In addition, self-reported lifestyle data, particularly physical activity, is susceptible to multiple sources of bias. We attempted to minimise these biases by using validated questionnaires whenever possible; however, the potential for error cannot be entirely eliminated.

In conclusion, the biobank recall concept represents a promising and innovative approach to epidemiological research. Its ability to integrate versatile data types, respond to current survey challenges and provide cost-effective longitudinal monitoring positions it as a valuable tool for advancing our understanding of NCDs and improving public health.

Acknowledgements

We thank all biobank donors for their valuable contributions to science.

Footnotes

Funding: This project is supported by the JYU.Well strategic funding was provided by the University of Jyväskylä (to ES), and research infrastructure funding was obtained from the Research Council of Finland (grant number: 358894 to ES) as part of the BBMRI.fi consortium. ES is supported by the Research Council of Finland (grant numbers: 341750, 346509 and 361981), the Juho Vainio Foundation and the Päivikki and Sakari Sohlberg Foundation. SR and TF are supported by the Juho Vainio Foundation. MR is supported by the Research Council of Finland (grant number: 330185). KA is partially supported by the European Regional Development Fund through the Regional Council of Northern Savo (No. A80407) for the development of FOODNUTRI infrastructure and by the Joint Action on Cardiovascular Diseases and Diabetes (JACARDI), which is funded by the EU4Health Programme (Grant Agreement No. 101126953). The funder did not influence the results/outcomes of the study despite the authors affiliations with the funder.

Prepublication history for this paper is available online. To view these files, please visit the journal online (https://doi.org/10.1136/bmjopen-2025-100092).

Patient consent for publication: Not applicable.

Provenance and peer review: Not commissioned; externally peer reviewed.

Patient and public involvement: Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.

References

  • 1.World Health Organization Noncommunicable diseases. 2024. https://www.who.int/health-topics/noncommunicable-diseases Available.
  • 2.Zdravkovic S, Wienke A, Pedersen NL, et al. Heritability of death from coronary heart disease: a 36-year follow-up of 20 966 Swedish twins. J Intern Med. 2002;252:247–54. doi: 10.1046/j.1365-2796.2002.01029.x. [DOI] [PubMed] [Google Scholar]
  • 3.Marenberg ME, Risch N, Berkman LF, et al. Genetic Susceptibility to Death from Coronary Heart Disease in a Study of Twins. N Engl J Med . 1994;330:1041–6. doi: 10.1056/NEJM199404143301503. [DOI] [PubMed] [Google Scholar]
  • 4.Khera AV, Chaffin M, Aragam KG, et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat Genet. 2018;50:1219–24. doi: 10.1038/s41588-018-0183-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.King A, Wu L, Deng H-W, et al. Polygenic risk score improves the accuracy of a clinical risk score for coronary artery disease. BMC Med. 2022;20:385. doi: 10.1186/s12916-022-02583-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Mars N, Koskela JT, Ripatti P, et al. Polygenic and clinical risk scores and their impact on age at onset and prediction of cardiometabolic diseases and common cancers. Nat Med. 2020;26:549–57. doi: 10.1038/s41591-020-0800-0. [DOI] [PubMed] [Google Scholar]
  • 7.Torkamani A, Wineinger NE, Topol EJ. The personal and clinical utility of polygenic risk scores. Nat Rev Genet. 2018;19:581–90. doi: 10.1038/s41576-018-0018-x. [DOI] [PubMed] [Google Scholar]
  • 8.Lankinen MA, Nuotio P, Kauppinen S, et al. Effects of Genetic Risk on Incident Type 2 Diabetes and Glycemia: The T2D-GENE Lifestyle Intervention Trial. The Journal of Clinical Endocrinology & Metabolism. 2024;110:130–8. doi: 10.1210/clinem/dgae422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Lewis CM, Vassos E. Polygenic risk scores: from research tools to clinical instruments. Genome Med. 2020;12:44. doi: 10.1186/s13073-020-00742-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Risk Factors Collaborators Global, regional, and national comparative risk assessment of 84 behavioural, environmental and occupational, and metabolic risks or clusters of risks, 1990-2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet. 2016;390:1345–422. doi: 10.1016/S0140-6736(17)32366-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Vermeulen R, Schymanski EL, Barabási A-L, et al. The exposome and health: Where chemistry meets biology. Science. 2020;367:392–6. doi: 10.1126/science.aay3164. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Siroux V, Agier L, Slama R. The exposome concept: a challenge and a potential driver for environmental health research. Eur Respir Rev. 2016;25:124–9. doi: 10.1183/16000617.0034-2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Tynkkynen NP, Törmäkangas T, Palviainen T, et al. Associations of polygenic inheritance of physical activity with aerobic fitness, cardiometabolic risk factors and diseases: the HUNT study. Eur J Epidemiol. 2023;38:995–1008. doi: 10.1007/s10654-023-01029-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Sillanpää E, Palviainen T, Ripatti S, et al. Polygenic Score for Physical Activity Is Associated with Multiple Common Diseases. Med Sci Sports Exerc. 2022;54:280–7. doi: 10.1249/MSS.0000000000002788. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Jackson SE, Llewellyn CH, Smith L. The obesity epidemic - Nature via nurture: A narrative review of high-income countries. SAGE Open Med. 2020;8 doi: 10.1177/2050312120918265. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rodosthenous RS, Niemi MEK, Kallio L, et al. Recontacting biobank participants to collect lifestyle, behavioural and cognitive information via online questionnaires: lessons from a pilot study within FinnGen. BMJ Open. 2022;12:e064695. doi: 10.1136/bmjopen-2022-064695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.World Health Organization Global physical activity questionnaire (GPQA) 2021. https://www.who.int/publications/m/item/global-physical-activity-questionnaire Available.
  • 18.Cleland CL, Hunter RF, Kee F, et al. Validity of the global physical activity questionnaire (GPAQ) in assessing levels and change in moderate-vigorous physical activity and sedentary behaviour. BMC Public Health. 2014;14:1255. doi: 10.1186/1471-2458-14-1255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Bull FC, Al-Ansari SS, Biddle S, et al. World Health Organization 2020 guidelines on physical activity and sedentary behaviour. Br J Sports Med. 2020;54:1451–62. doi: 10.1136/bjsports-2020-102955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Haskell WL, Lee I-M, Pate RR, et al. Physical activity and public health: updated recommendation for adults from the American College of Sports Medicine and the American Heart Association. Med Sci Sports Exerc. 2007;39:1423–34. doi: 10.1249/mss.0b013e3180616b27. [DOI] [PubMed] [Google Scholar]
  • 21.Kovanen V, Aukee P, Kokko K, et al. Design and protocol of Estrogenic Regulation of Muscle Apoptosis (ERMA) study with 47 to 55-year-old women’s cohort: novel results show menopause-related differences in blood count. Menopause. 2018;25:1020–32. doi: 10.1097/GME.0000000000001117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Lindström J, Aittola K, Pölönen A, et al. Formation and Validation of the Healthy Diet Index (HDI) for Evaluation of Diet Quality in Healthcare. Int J Environ Res Public Health. 2021;18:2362. doi: 10.3390/ijerph18052362. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Hemiö K, Pölönen A, Ahonen K, et al. A simple tool for diet evaluation in primary health care: validation of a 16-item food intake questionnaire. Int J Environ Res Public Health. 2014;11:2683–97. doi: 10.3390/ijerph110302683. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Ruokavirasto Ravitsemussuositukset_2014. [17-Dec-2024]. https://www.ruokavirasto.fi/globalassets/teemat/terveytta-edistava-ruokavalio/kuluttaja-ja-ammattilaismateriaali/julkaisut/ravitsemussuositukset_2014_fi_web_versio_5.pdf Available. Accessed.
  • 25.Kaprio J. The Finnish Twin Cohort Study: an update. Twin Res Hum Genet. 2013;16:157–62. doi: 10.1017/thg.2012.142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Melin A, Tornberg AB, Skouby S, et al. The LEAF questionnaire: a screening tool for the identification of female athletes at risk for the female athlete triad. Br J Sports Med. 2014;48:540–5. doi: 10.1136/bjsports-2013-093240. [DOI] [PubMed] [Google Scholar]
  • 27.Ilmarinen J, Tuomi K, Klockars M. Changes in the work ability of active employees over an 11-year period. Scand J Work Environ Health. 1997;23 Suppl 1:49–57. [PubMed] [Google Scholar]
  • 28.El Fassi M, Bocquet V, Majery N, et al. Work ability assessment in a worker population: comparison and determinants of Work Ability Index and Work Ability score. BMC Public Health. 2013;13:305. doi: 10.1186/1471-2458-13-305. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Siegrist J. Adverse health effects of high-effort/low-reward conditions. J Occup Health Psychol. 1996;1:27–41. doi: 10.1037//1076-8998.1.1.27. [DOI] [PubMed] [Google Scholar]
  • 30.Siegrist J, Wege N, Pühlhofer F, et al. A short generic measure of work stress in the era of globalization: effort-reward imbalance. Int Arch Occup Environ Health. 2009;82:1005–13. doi: 10.1007/s00420-008-0384-3. [DOI] [PubMed] [Google Scholar]
  • 31.Juvani A, Oksanen T, Salo P, et al. Effort-reward imbalance as a risk factor for disability pension: the Finnish Public Sector Study. Scand J Work Environ Health. 2014;40:266–77.:3402. doi: 10.5271/sjweh.3402. [DOI] [PubMed] [Google Scholar]
  • 32.Schaufeli WB, Bakker AB, Salanova M. The measurement of work engagement with a short questionnaire: a cross-national study. 2006. https://journals.sagepub.com/doi/abs/10.1177/0013164405282471 Available.
  • 33.Thompson ER. Development and Validation of an Internationally Reliable Short-Form of the Positive and Negative Affect Schedule (PANAS) J Cross Cult Psychol. 2007;38:227–42. doi: 10.1177/0022022106297301. [DOI] [Google Scholar]
  • 34.Jenkins CD, Stanton BA, Niemcryk SJ, et al. A scale for the estimation of sleep problems in clinical research. J Clin Epidemiol. 1988;41:313–21. doi: 10.1016/0895-4356(88)90138-2. [DOI] [PubMed] [Google Scholar]
  • 35.Finnish institute for health and welfare (THL), finland. health 2011 survey - methods. 2016. [18-Dec-2024]. https://www.julkari.fi/handle/10024/130780 Available. Accessed.
  • 36.Finnish institute for health and welfare (thl), finland. health 2011 -forms. 2011. [18-Dec-2024]. https://thl.fi/en/research-and-development/research-and-projects/health-2000-2011/forms/health-2011-forms Available. Accessed.
  • 37.Rune B, Rikke A, Kristoffer AE, et al. Nordic nutrition recommendations. integrating environmental aspects. Nordic Council of Ministers. 2023:388. [Google Scholar]
  • 38.Itä-Suomen Biopankki; 2024. [18-Dec-2024]. Nutrition, lifestyle factors, health and genes study.https://ita-suomenbiopankki.fi/tutkimus/nutrition-lifestyle-factors-health-and-genes-study/ Available. Accessed. [Google Scholar]
  • 39.Kuopiossa kehitetään ravitsemustieteen tutkimusinfrastuktuuria | itä-suomen yliopisto. [18-Dec-2024]. https://www.uef.fi/fi/artikkeli/kuopiossa-kehitetaan-ravitsemustieteen-tutkimusinfrastuktuuria Available. Accessed.
  • 40.Stanhope J. Effort–Reward Imbalance Questionnaire. Occup Med. 2017;67:314–5. doi: 10.1093/occmed/kqx023. [DOI] [Google Scholar]
  • 41.Julkunen V, Schwarz C, Kalapudas J, et al. A FinnGen pilot clinical recall study for Alzheimer’s disease. Sci Rep. 2023;13:12641. doi: 10.1038/s41598-023-39835-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Vuoksimaa E, Saari TT, Aaltonen A, et al. TWINGEN: protocol for an observational clinical biobank recall and biomarker cohort study to identify Finnish individuals with high risk of Alzheimer’s disease. BMJ Open. 2024;14:e081947. doi: 10.1136/bmjopen-2023-081947. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from BMJ Open are provided here courtesy of BMJ Publishing Group

RESOURCES