Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2026 Mar 7.
Published before final editing as: Biopreserv Biobank. 2026 Feb 28:19475535261422295. doi: 10.1177/19475535261422295

Current assessment of existing biospecimen use in population-based cancer research

Rachel Hanisch 1,2, Lisa Gallicchio 1, Danielle Daee 1, Somdat Mahabir 1, Stefanie Nelson 1, Danielle M Carrick 1
PMCID: PMC12965398  NIHMSID: NIHMS2151168  PMID: 41761706

Abstract

INTRODUCTION:

The use of biospecimens is an important part of conducting population-based research as they allow the linkage of biological information to other important clinical, social, and environmental factors, providing a more robust understanding of cancer prevention, treatment, and care options. It can be costly and labor-intensive to collect and process these biospecimens, making the use of pre-existing, banked biospecimens an appealing option for researchers.

OBJECTIVE:

This study examines the use of existing biospecimens in National Cancer Institute (NCI)-funded population-based cancer control research grants managed by the Division of Cancer Control and Population Sciences (DCCPS) as of January 2024.

METHODS:

A total of 104 grants managed by NCI’s DCCPS that involved the use of existing biospecimens were included in this analysis. Information that was abstracted from the grants included cancer type, biospecimen type, study design (intervention or observational), cancer continuum category (risk/etiology or survivorship), and named biospecimen resource.

RESULTS:

The most commonly used biospecimens were blood products (64.4%), DNA (59.6%), tissue (53.8%), and RNA (26.9%). Risk-focused studies mainly used blood products, while survivorship studies favored tissue and RNA. There was also a notable difference in biospecimen use between studies of common vs. rare cancers, with rare cancer studies using tissue and RNA samples less frequently than studies of common cancers.

CONCLUSION:

The variety of biospecimen types being used to examine a breadth of hypotheses related to cancer risk and survivorship emphasize the value of biospecimen resources across the cancer continuum.

Introduction

The use of biospecimens is an important part of conducting population-based research as they allow the linkage of biological information to other important clinical, lifestyle, social, environmental, and molecular factors, providing a more robust understanding of cancer prevention, treatment, and care options. Biospecimens can also enable the discovery and validation of key biomarkers, increase our understanding of disease mechanisms, and contribute to improving treatment outcomes in personalized medicine. Biospecimen analysis techniques are rapidly advancing (1,2), further opening the possibilities of their use to elucidate knowledge gaps in cancer research. In light of these potential applications, biospecimens are often collected as part of population-based studies with two functions in mind: to address current investigator-initiated (prospective) research specific aims and to bank additional supplies for future, to-be-determined use. It can be costly and labor-intensive to collect and process the large number of biospecimens that are often needed in population-based research, making the use of pre-existing, banked biospecimens an appealing option for researchers.

In 2014, Carrick et al. (3) published an analysis of National Cancer Institute (NCI)-funded grants active as of July 2012 and managed by the Division of Cancer Control and Population Sciences (DCCPS) to investigate the use of biological specimens in population-based cancer control research. In that article, the authors reported that 19.4% of the grants funded by NCI and managed by DCCPS utilized pre-existing biospecimens (and did not collect new biospecimens) with DNA, tissue, and serum/plasma being the most commonly utilized biospecimen types among these grants. Through that analysis, several areas of improvement were identified to facilitate population-based cancer research with cost- and time-effective approaches to biospecimen use in mind. This included expanding the use of biospecimens in behavioral research studies, increasing collecting of rarer specimen and cancer types, conducting validation studies of biospecimen collections to ensure stored samples are suitable, and continuing to foster sharing of biospecimens.

Since then, bioinformatic and technological advances have empowered researchers to ask novel questions and permitted the use of a wider variety of biospecimen types, enabling a more comprehensive understanding of how molecular, clinical and environmental factors interact in cancer risk and outcomes. For example, spatial-omics technologies have been developed enabling analysis of tumor heterogeneity, profiles, evolution, and microenvironment (1). Additionally, integration of tumor immunology with germline genetics and exposures has facilitated integrated analyses of peripheral blood circulating cells, cell-free plasma, tissue, feces, and urine in research aimed at better understanding strategies for cancer prevention and management (2).

The goals of this analysis were to 1) describe the current use of existing biospecimens in NCI-funded grants active as of January 2024 and managed by DCCPS, NCI, and 2) to determine whether the areas identified 10 years ago to improve use of pre-existing biospecimens have been addressed.

Methods

In January 2024, the Information for Management, Planning, Analysis, and Coordination (IMPACII) records (National Institutes of Health’s proprietary system containing information about all NIH extramural research projects) were queried using NCI’s Portfolio Management Application (PMA) to identify active grants (excluding supplements) managed by NCI’s DCCPS. Data obtained from IMPAC II for this analysis included grant type, grant mechanism, and fiscal year that the competing grant application was awarded. Additionally, information available on biospecimen use were obtained for the identified grants. The biospecimen related data are available for all active DCCPS grants starting in 2014, and are based on annual review of grant abstract, aims, research design, and methods. Biospecimen-related variables available include whether the grant application describes the collection of new biospecimens and/or the utilization of existing biospecimens (i.e., specimens that have been collected previously for other purposes) as well as the biospecimen type. Biospecimen type categories are: DNA, cell lines, strains (includes Epstein-Barr Virus (EBV) transformed lymphocytes, but not purchased cell lines), red blood cells (RBC), feces, leukocytes/buffy coat, urine, peripheral blood lymphocytes (PBL; includes peripheral blood mononuclear cells (PBMC), isolated T cells, etc.), buccal/saliva/oral swab for DNA, Guthrie cards/blood spot, cervical swab, blood (whole blood unprocessed blood components), serum or plasma, tissue culture, tissue (fresh, frozen, paraffin embedded tissue blocks, slides, bone marrow, tissue microarray (TMA)), RNA, saliva for biomarkers. For this analysis, whole blood, serum/plasma, RBCs, PBLs, and PBMCs, were combined into a single “blood products” variable. The tissue category mostly includes formalin fixed paraffin embedded (FFPE) tissue from tumor resection or biopsy.

As of January 2024, DCCPS managed 1,039 active grants. The focus of this analysis was on population-based cancer research studies that used only existing biospecimens. We defined existing biospecimens as biological materials that were collected prior to the funding period of the active grant included in the present analysis. These existing biospecimens were collected from individuals for research or clinical use and were stored in manners that allow researchers to request those stored materials for new studies. After excluding grants that did not include biospecimens (i.e. no biospecimens at all were included in the grant), those that included new collection of biospecimens for the purposes of the grant, or those that did not use existing biospecimens to test a population-based cancer research hypothesis (i.e. the existing biospecimens were used for methods development, etc.), 104 grants remained (Fig. 1) and are listed in Supplemental Table 1. Note that the analysis included all grant types (including exploratory investigator-initiated awards (e.g. NIH R21 activity code), small research investigator-initiated awards (e.g. NIH R03 activity code), program project awards (e.g. NIH P01 activity code), independent investigator-initiated awards (e.g. NIH R01 activity code), etc.).

Figure 1. Biospecimen grant review process.

Figure 1.

1039 grants were included. Of these, 652 were initially excluded because they did not include biospecimens (i.e. no biospecimens at all were included in the grant) and 257 were excluded because they involved new collection of biospecimens. Upon further review, 26 additional grants were excluded because while they included existing biospecimens, the existing specimens were not being used to test a population-based cancer research hypothesis (i.e. they were used for methods development, etc.). 104 grants that only used existing biospecimens for population-based cancer research were the focus of this manuscript.

Each of the 104 grants were coded independently by two co-authors who abstracted the following information from the study aims: cancer type, study design (intervention or observational), cancer continuum category (risk/etiology [defined as pre-diagnosis and referred to in this manuscript as “risk”] or survivorship [defined as focused on the health and well-being of a person with cancer from the time of diagnosis until the end of life]), sex (male or female), and named biospecimen resource. If a consortium was named as the resource used, only the name of the consortium was recorded information was abstracted. Discrepancies in coding were resolved by consensus or a third reviewer. For analysis purposes, rare cancers were defined as those with an age-adjusted incidence rate of less than 15/100,000 in the population using the NCI-supported Surveillance, Epidemiology, and End Results Program (SEER) Incidence Data, November 2023 Submission (1975–2021) (4). This study did not require NIH institutional review board (IRB) review since it is not considered human subjects research, as the analyses were restricted to summaries of existing studies, did not include identifiable private information or biospecimens, and involved no interaction or intervention with human participants.

Results

Of the 1,039 active grants managed by DCCPS, 652 were excluded because they did not include any biospecimens (i.e. no biospecimens at all were included in the grant), 257 were excluded because they included new collection of biospecimens for the purposes of the grant, and another 26 were excluded because they did not use existing biospecimens to test a population-based cancer research hypothesis (i.e. the existing biospecimens were used for methods development, etc.). The remaining 104 grants (10.0%) used pre-existing biospecimens only (Fig. 1). The 104 grants spanned multiple funding mechanisms and DCCPS program areas (5), with the majority being R01s (63.5%; an R01 is the most common type of research grant awarded by the NIH for independent investigator-initiated research projects) and aligned with the “Epidemiology and Genomics” program area (94.2%; Table 1). Most of the grants utilizing existing biospecimens were risk-focused grants (53.8%), with 29.8% focused on survival or survivorship, and 16.3% focused on both risk and survivorship. The grants spanned many different cancer types, with breast cancer being the most common cancer of focus (24%), followed by colorectal cancer (11.5%) and blood cancers (11.5%). Of the 104 grants, 2.9% used biospecimens from childhood cancer survivors, 97.1% biospecimens from adults, 31.7% biospecimens from females only, and 57.7% from both males and females.

Table 1.

Existing biospecimens utilized in DCCPS grants active as of January 2024 (n=104), by study characteristic (row percents)

Characteristic Number of grants Blood products DNA Tissue specimens RNA

n % n % n % n %




Overall 104 67 64.4 62 59.6 56 53.8 28 26.9
Program areas
 Epidemiology & Genomics 98 62 63.3 60 61.2 55 56.1 26 26.5
 Behavioral Research 5 4 80.0 2 40.0 1 20.0 2 40.0
 Healthcare Delivery Research 1 1 100.0 0 0.0 0 0.0 0 0.0
Cancer continuum focus
 Risk/Etiology 56 42 75.0 32 57.1 23 41.1 14 25.0
 Survivorship 31 17 54.8 21 67.7 21 67.7 12 38.7
 Both 17 8 47.1 9 52.9 12 70.6 2 11.8
Childhood cancer population
 Yes 3 2 66.7 3 100.0 0 0.0 1 33.3
 No 101 65 64.4 59 58.4 56 55.4 27 26.7
Tumor type focus
 Solid 87 53 60.9 51 58.6 52 59.8 24 27.6
 Blood 12 10 83.3 9 75.0 2 16.7 3 25.0
 Cancer agnostic 5 4 80.0 2 40.0 2 40.0 1 20.0
Rare versus common cancer focus
 Common 61 38 62.3 38 62.3 42 68.9 20 32.8
 Rare 36 23 63.9 21 58.3 12 33.3 7 19.4
 Unknown 2 2 100.0 1 50.0 0 0.0 0 0.0
 Cancer agnostic 5 4 80.0 2 40.0 2 40.0 1 20.0
Cancer type
 Cancer agnostic 5 4 80.0 2 40.0 2 40.0 1 20.0
 Multiple cancer types 5 3 60.0 4 80.0 1 20.0 1 20.0
 Any blood cancer 12 10 83.3 9 75.0 2 16.7 3 25.0
 Biliary tract 2 2 100.0 2 100.0 0 0.0 0 0.0
 Bladder 1 1 100.0 0 0.0 1 100.0 0 0.0
 Brain 3 1 33.3 3 100.0 2 66.7 0 0.0
 Breast 25 16 64.0 12 48.0 17 68.0 8 32.0
 Cervical 1 0 0.0 1 100.0 1 100.0 0 0.0
 Colon 1 1 100.0 1 100.0 0 0.0 0 0.0
 Colorectal 12 5 41.7 9 75.0 9 75.0 5 41.7
 Endometrial 1 1 100.0 1 100.0 1 100.0 0 0.0
 Esophageal 1 0 0.0 1 100.0 1 100.0 1 100.0
 Liver 2 2 100.0 0 0.0 0 0.0 0 0.0
 Lung 7 5 71.4 5 71.4 4 57.1 1 14.3
 Melanoma 2 2 100.0 2 100.0 2 100.0 1 50.0
 Neuroendocrine 1 1 100.0 1 100.0 0 0.0 0 0.0
 Oropharyngeal 3 1 33.3 0 0.0 2 66.7 1 33.3
 Ovarian 3 0 0.0 1 33.3 3 100.0 0 0.0
 Pancreatic 3 3 100.0 1 33.3 0 0.0 0 0.0
 Prostate 6 4 66.7 4 66.7 5 83.3 3 50.0
 Renal cell 1 0 0.0 0 0.0 1 100.0 1 100.0
 Squamous cell carcinoma 1 0 0.0 1 100.0 1 100.0 1 100.0
 Stomach 2 2 100.0 0 0.0 0 0.0 0 0.0
 Testicular 4 3 75.0 2 50.0 1 25.0 1 25.0
Sex
 Female only 33 20 60.6 17 51.5 22 66.7 8 24.2
 Male only 11 7 63.6 6 54.5 6 54.5 4 36.4
 Both male and female 60 40 66.7 39 65.0 28 46.7 16 26.7
a

includes whole blood, serum, plasma, red blood cells, peripheral blood lymphocytes, peripheral blood mononuclear cells

The four most commonly used biospecimens were blood products (64.4%), DNA (59.6%), tissue (53.8%), and RNA (26.9%) (Table 1). Other less frequently utilized biospecimens included urine (5.7%), cell lines/strains (4.8%), buccal cells/saliva for DNA (4.8%), Guthrie Cards/blood spots (2.9%), cervical swabs (1.0%), tissue culture (1.9%), saliva for biomarkers (1.9%) (Supplementary Table 2). No grant utilized existing fecal samples.

Use of blood products was higher among risk focused-only grants (75.0%) compared to those focused on survivorship (54.8%) or both risk and survivorship (47.1%), while use of DNA or RNA was higher among survivorship grants compared to the other cancer continuum type grants. Further, use of tissue was higher among grants with a survivorship component (67.7%) versus risk-focused only grants (41.1%).

A similar percentage of the common versus rare cancer-focused grants utilized blood products and DNA; however, a higher percentage of the common cancer-focused versus the rare cancer-focused grants utilized tissue (68.0% versus 33.3%) and RNA (32.8% versus 19.4%). Blood product and RNA use was higher among grants focused on hematologic cancers versus those focused on solid tumors; conversely, the use of tissue was higher among grants focused on solid tumors.

Among grants that referenced their source of biospecimens, there were over 150 uniquely named resources with about half of the grants utilizing more than one resource (Supplemental Table 3). Biospecimen resources named in 3 or more grants (n=18) are shown on Table 2. The Southern Community Cohort Study was named in almost one fifth of the grants included in this portfolio analysis (17.3%). Other frequently named resources were the Multiethnic Cohort (n=12), the Women’s Health Initiative (n=11), the Nurses’ Health Study (n=9), and the Black Women’s Health Study (n=7).

Table 2.

Biospecimen resources named in three or more active DCCPS grants utilizing existing biospecimens, January 2024

Named Biospecimen Resource number of grants where resource was named
Southern Community Cohort Study (SCCS) 18
Multiethnic Cohort (MEC) 12
Women’s Health Initiative (WHI) cohort/WHI Life and Longevity after Cancer (LILAC) cohort 11
Nurses’ Health Study (NHS) 9
Black Women's Health Study (BWHS) 7
Health Professionals Follow-up Study (HPFS) 6
Prostate, Lung, Colorectal, Ovarian Cancer Screening Trial (PLCO) 6
Shanghai Mens Health Study (SMHS) 6
Shanghai Women's Health Study (SWHS) 6
Nurses’ Health Study-II (NHS-II) 4
Physicians Health Study (PHS) 4
European Prospective Investigation into Cancer (EPIC) 3
Kaiser Permanente Northern California (KPNC) 3
Komen Tissue Bank 3
Melbourne Collaborative Cohort Study (MCCS) 3
Singapore Chinese Health Study (SCHS) 3
The Pathways Study 3
Women's Health Study (WHS) 3

Discussion

Rather than collecting new biospecimens for a study, which can be costly and time-intensive, using existing biospecimens for cancer control and population science research can be an appealing and effective option for researchers. Although it is often challenging to identify pre-existing biospecimens that have the appropriate accompanying clinical, lifestyle, social, environmental, and/or molecular data, once biospecimens are identified, this information has the potential to address many key cancer-related hypotheses. This analysis of grants managed by NCI DCCPS was conducted to understand the use of existing biospecimens in population-based cancer research. Most of the grants using biospecimens supported epidemiology and genomics research (94.2%, Table 1) and represented aspects of research across the cancer continuum including risk and survivorship.

The 2014 analysis described in Carrick et al. (3) found that 19.4% of active NCI DCCPS grants leveraged pre-existing biospecimens only. Our new analysis found 10.0% of grants active in 2024 used pre-existing biospecimens only. The difference may be attributed to a slight increase in the overall percentage of active NCI DCCPS grants not using any biospecimens (Supplemental Fig. 1). Since 2017, the proportion of newly funded population-based cancer grants not utilizing any biospecimens has increased slightly from 53.8 and 53.0% in 2015 and 2016 to between 59.1 and 68.6% between 2017 and 2024. One possible explanation for this trend is the increasing emphasis on the secondary use of already existing data. NIH has encouraged the use of existing data and, starting in 2017, the NCI issued funding opportunities for secondary analysis and integration of existing data to elucidate cancer risk and related outcomes; NCI grant funding opportunities for investigator-initiated cancer-related research funding using already existing data including notices: PA-17-239, PA-17-243, PAR-25-095, PAR-25-096, PAR-20-276, PAR-20-277. What is clear is that biospecimens continue to be used in a substantial portion of grants across the entire analysis period, emphasizing the need to continue supporting initiatives for researchers to use both retrospectively and prospectively collected materials.

Recent advances in molecular technologies have allowed researchers to test a variety of previously unexplorable hypotheses in cancer control research. In the 2014 analysis (3), DNA was the most common biospecimen included in grants using existing biospecimens (72.0%). In the current analysis, blood is the most common (64.4%); in the 2014 analysis, approximately 52.0% of grants used existing blood (serum, plasma or blood). This increase might reflect development of novel technologies allowing measurement of analytes in preserved blood specimens. For instance, one recent population-based study measured cotinine in previously collected maternal serum as an indication of smoking during pregnancy and the potential impact on testicular cancer risk in offspring (6). Advances in technologies have also spurred a growing interest in integrating transcriptomics with genomics data to gain a better understanding of cancer risk, identify novel biomarkers, and to better characterize tumor subtypes. Our current analysis found existing RNA specimens were used in 26.9% of the grants, but the 2014 analysis (3) found that less than 5.0% of the grants used existing RNA specimens, which may support the idea of increasing interest in transcriptomics and genomics in population-based cancer research. Some examples of this use of existing RNA specimens were identified in manuscripts resulting from grants included in this current analysis; for example, RNA was used for long non-coding RNA profiling as a potential regulator of tumor aggressiveness and prognosis (7) and for gene expression profiling of inflammatory factors as potential mediators of accelerated aging in breast cancer patients (8).

While rare cancers account for 25% of all new cases of adult cancers each year in the U.S., it can be difficult to conduct epidemiologic studies of rare cancers due to challenges in achieving sufficient sample sizes (9). This analysis provided information about the types of biospecimens used by funded rare cancer grants. Studies of rare cancers tended to use mostly blood products (63.9%), DNA (58.3%), tissue (33.3%), or RNA (19.4%). While the percentage of rare cancer studies utilizing blood products and DNA was similar to the common cancer studies, there were notable differences between rare and common cancer grants utilizing existing tissues (33.3% of rare cancer studies versus 68.9% of common cancer studies) and existing RNA (19.4% vs. 32.8%). These differences are likely due to the availability (or lack thereof) of existing tissue and RNA resources for population-based cancer control rare cancer versus common cancer grants.

The current analysis showed tissue is one of the most utilized biospecimen types (53.8% of grants utilized tissue). A recent effort to help facilitate future activities involving tissues for population-based cancer research is the SEER Virtual Repository (10,11). In 2024, the NCI established the SEER Virtual Repository as an infrastructure for researchers to obtain de-identified, but linked, tissue and data on cancer cases ascertained by the SEER Program cancer registries (10,11). This program is anticipated to facilitate future studies of cancer types that have traditionally been difficult to study in population-based studies, such as early onset cancers, rare cancers, cancer cases with unusual cancer outcomes, and cases with multiple diagnoses of a primary cancer.

In this analysis, the most common sources of existing biospecimens mentioned were longstanding (at least 20 years) epidemiology cohort studies with at least 50,000 enrollees at baseline. Three of the most commonly referenced sources include epidemiology cohorts such as the Southern Community Cohort Study (SCCS), the Multiethnic Cohort Study (MEC), and the Women’s Health Initiative (WHI cohort and the WHI-Life and Longevity After Cancer (LILAC) cohort. These are large, longitudinal studies with well-annotated specimens that have existed for many years, making them rich biospecimen resources relevant to population-based cancer research. In addition to large epidemiologic cohort studies, other sources of existing biospecimens included case-control studies, randomized controlled trial studies, consortiums, networks, and hospital or other specialized biobanks (e.g. Komen Tissue Bank, Children’s Hospital of Philadelphia Biobank). The sources of existing biospecimens were utilized by a combination of investigators who originally developed the resource and by investigators unaffiliated with the original resource. Although these older collections of existing biospecimens have certain limitations, such as biospecimen quality considerations, limited accompanying data (such as epidemiologic, comorbidity, and treatment data), and data on potentially outdated clinical practices, treatments, and/or exposures that may have changed over time, they represent an important potential source of biospecimens. Researchers need to be cognizant of these potential issues when selecting biospecimens and designing their studies. Researchers also need to consider these limitations when selecting a study design, such as making appropriate case-control selections (such as matching cases with controls from the same time period), considering changes in exposure variables over time using longitudinal or multi-point analysis models, and utilizing biospecimens that are paired with robust data on specimen handling and storage and from biobanks that have quality control validation data on biospecimen suitability.

Our previous analysis in 2014 noted several areas of improvement to better facilitate population-based cancer research using pre-existing biospecimens (3). Noted areas included expanding the use of biospecimens in behavioral research studies, increasing collection of rarer specimens and specimens from rarer cancer types, conducting validation studies of biospecimen collections to ensure stored samples are suitable, and continuing to foster sharing of biospecimens. Progress has been made on some, but not all, gap areas that were identified in 2014. In 2014, we speculated that behavioral cancer research studies would benefit from greater use of biospecimens as analytical methods evolve. For example, behavioral studies have benefitted from the development of polygenic scores, which rely on DNA (12). In this current analysis, we observed a slight increase in biospecimen use among NCI population science behavioral grants (grants that were funded primarily by the Behavioral Research Program within NCI DCCPS) from 24.1% in 2014 to 30.9% in 2024. As another example of the breadth of research disciplines utilizing biospecimens, we also identified a healthcare delivery research project which involved analysis of existing serum biospecimens to measure prostate-specific antigen (PSA) levels, with the overall objective of optimizing prostate cancer screening by determining which men should be screened, when screening should occur, and how frequently screening should be performed (13).

Another area for improvement identified in 2014 was increasing collection of rarer specimen types (such as feces and cervical swabs). Given the small number of grants utilizing them, it is possible that prospectively collecting these specimens on an as-needed basis is more efficient or that the current science for utilizing these samples for research purposes is still being developed, such as biospecimen collection/preservation techniques for fecal analyses (1416) and the development of specific collection strategies for gynecological biospecimens (17). Another recommended improvement was to increase biospecimen collections for rare cancer types, which takes more concerted and coordinated efforts than collection of more common cancer types. Significant individual initiatives have advanced rare tumor research capacity in part via increased biospecimen collection, including the My Pediatric and Adult Rare Tumor (MyPART) network, which was designed to conduct a comprehensive longitudinal Natural History Study of Rare Solid Tumors (18), and the previously-mentioned SEER Virtual Tissue Repository which could be a rich resource for rare cancer tissue. The need for rare tumor biospecimen collections also seems to be increasingly recognized by the larger research community (19,20), aligning with recent noted trends in increasing US and global rare cancer research efforts as evidenced by publications (reviewed in (21)). Taken together, these recent efforts may indicate an increasing recognition of the importance of collecting and of pooling of rare cancer resources including biospecimens.

In 2014, we recommended that validation studies be conducted regularly to ensure stored samples are suitable for the desired analyses, thus increasing the potential utility of existing biospecimens. NCI and the International Society for Biological and Environmental Repositories (ISBER) have published best practices for biospecimens (22,23) which include establishing and implementing quality control protocols to ensure quality and reliability of the biospecimens for research. With the increasing number of molecular analysis platforms, a broader and more fitting recommendation may be to ensure that the samples are “fit for purpose” - stored in manners that are suitable and will yield reproducible data for their intended research question and with the selected technology or assay (24,25). Consulting the scientific literature or other resources such as the NCI Biospecimen Research Database (26,27) or individual biobanks for studies that have used the biospecimens from the specific biobanks for similar assays/technologies/study questions can give insight as to whether the biospecimens could be fit for purpose. Biobanks may also facilitate pilot studies using a small subset of samples at no or minimal cost to ensure the stored specimens are fit for the desired purpose (H. Eliassen and J. Neville-Golden, personal communication, September 4, 2025).

Over the past 10 years, progress has also been made in terms of fostering sharing of biospecimens. In an effort to facilitate the identification of potential sources of existing biospecimens and data, the NCI maintains a Specimen Resource Locator (SRL, (28)) and a database of cancer epidemiology cohorts (Cancer Epidemiology Descriptive Cohort Database, CEDCD (29)). The SRL is a database of biospecimen resources designed to help researchers locate possible samples for their investigational use, while the CEDCD has descriptive information about cohort studies, including types of biospecimens and enrollment counts. As of 2025, more than 55 cancer epidemiology cohorts in the CEDCD (29) have blood, buccal/saliva, tissue, urine, and/or feces in their repositories and have established processes in place for researchers to request access to these existing biospecimens and data. The CEDCD and SRL allow users to search by specimen type and other key variables to identify collections with their desired criteria. More recently, NCI DCCPS developed suggested biospecimen management and sharing plan elements for population science projects (30) to help foster sharing of existing biospecimens.

The observation that only 10.0% of NCI DCCPS funded grants are relying solely on existing biospecimens that were already collected and stored suggests a need to reconsider the biobanking model typically used in population-based studies. Biobanking efforts developed for population-based research studies typically focus on collecting specimens to meet planned goals but also place importance on storing unused specimens for unknown future use by investigators. These efforts tend to be associated with large epidemiology cohort studies which follow large groups of participants over time for the incidence of new cancers and cancer-related outcomes. Utilization of the stored biospecimens can facilitate new cancer research studies in a cost and time-efficient manner (3). However, too often, biobanked specimens are underutilized, leading to increased costs (31). Understanding the reasons for underutilization of pre-existing samples and the need for collecting new biospecimens could help inform future biobanking efforts. Reasons why investigators may opt to collect new specimens rather than using existing specimens could include administrative and regulatory requirements for accessing existing stored specimens and data that are perceived as excessive. New study designs are being explored to collect and store biospecimens from certain subsets of participants (32). Shifting to more of a prospective biobank model (where biospecimens are collected at the time they are requested by researchers) instead of a classic biobank model (where residual biospecimens are collected for potential future uses) could be considered in some instances (33). The SEER VTR was developed with this idea in mind; collecting and storing residual SEER FFPE tissues for potential future uses was not sustainable, and thus, the idea of a prospective SEER FFPE tissue request and distribution model was developed.

Conclusion

Through this analysis, we determined and described types of and typical uses for existing biospecimens in population-based cancer control studies. The variety of biospecimen types being used to examine a breadth of hypotheses related to cancer risk and survivorship demonstrate the value of biospecimen resources. Our analysis focused on a small set of grants - those using existing biospecimens in classic-model biobanks. Biospecimens continue to be important to the aims of a substantial portion of grants across the entire analysis period, emphasizing the need to continue supporting initiatives for researchers to use both retrospectively and prospectively collected materials. The grants included in this analysis utilized many different sources of existing biospecimens. Given the potentially overwhelming number of sources of appropriate specimens and data and the need to facilitate sharing of existing biospecimens, we encourage investigators to include their resource descriptions in centralized databases like the SRL or CEDCD, conduct validation studies of stored specimens and make the results easily accessible, lower the administrative burdens (perceived or real) for accessing stored specimens, and consider implementing different models for biobanking.

Supplementary Material

Supplemental Fig 1
Supplemental Tables

Acknowledgments:

We thank Scott Rogers for help with the initial identification of grants for this portfolio analysis.

Funding Statement:

The authors’ salaries were provided by the National Institutes of Health. No other funding was provided for this research.

Footnotes

Author’s disclosures: There are no disclosures.

Disclaimer: The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

References

  • 1.Bressan D, Battistoni G, Hannon GJ. The dawn of spatial omics. Science 2023;381(6657):eabq4964, doi: 10.1126/science.abq4964 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Ogino S, Nowak JA, Hamada T, et al. Insights into Pathogenic Interactions Among Environment, Host, and Tumor at the Crossroads of Molecular Pathology and Epidemiology. Annu Rev Pathol 2019;14(83–103, doi: 10.1146/annurev-pathmechdis-012418-012818 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Carrick DM, Mette E, Hoyle B, et al. The use of biospecimens in population-based research: a review of the National Cancer Institute’s Division of Cancer Control and Population Sciences grant portfolio. Biopreserv Biobank 2014;12(4):240–5, doi: 10.1089/bio.2014.0009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Surveillance Research Program N. SEER*Explorer: An interactive website for SEER cancer statistics https://seer.cancer.gov/statistics-network/explorer/; 2025.
  • 5.NCI. National Cancer Institute, Division of Cancer Control & Population Sciences: Program Areas. Available from: https://cancercontrol.cancer.gov/program-areas.
  • 6.Beck AL, Uldbjerg CS, Lim YH, et al. Cotinine concentrations in maternal serum and amniotic fluid during pregnancy and risk of testicular germ cell cancer in the offspring: A prospective nested case-control study. Int J Cancer 2024;154(1):71–80, doi: 10.1002/ijc.34688 [DOI] [PubMed] [Google Scholar]
  • 7.Chen J, Higgins MJ, Hu Q, et al. DNA methylation differences in noncoding regions in ER negative breast tumors between Black and White women. Front Oncol 2023;13(1167815, doi: 10.3389/fonc.2023.1167815 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Carroll JE, Crespi CM, Cole S, et al. Transcriptomic markers of biological aging in breast cancer survivors: a longitudinal study. J Natl Cancer Inst 2025;117(2):312–321, doi: 10.1093/jnci/djae201 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Gallicchio L, Daee DL, Rotunno M, et al. Epidemiologic Research of Rare Cancers: Trends, Resources, and Challenges. Cancer Epidemiol Biomarkers Prev 2021;30(7):1305–1311, doi: 10.1158/1055-9965.EPI-20-1796 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.NCI. SEER-Linked Virtual Tissue Repository (VTR) Program. Available from: https://seer.cancer.gov/vtr/. [DOI] [PMC free article] [PubMed]
  • 11.Sanchez P, Van Dyke AL, Petkov VI, et al. NCI SEER-Linked Virtual Tissue Repository Pilot. J Natl Cancer Inst Monogr 2024;2024(65):180–190, doi: 10.1093/jncimonographs/lgae034 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Plomin R Celebrating a Century of Research in Behavioral Genetics. Behav Genet 2023;53(2):75–84, doi: 10.1007/s10519-023-10132-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.RePORTER N Tailored prostate cancer screening: addressing USPSTF priority research gaps in a racially-diverse study population. Available from: https://reporter.nih.gov/search/kC7iIm-I6ECyjhr8xo4oFw/project-details/10735739.
  • 14.Jorgensen AB, Almer L, Brandstrup B, et al. Fecal microbiomes from screening sampling tubes are stable despite varying sampling and storage conditions. Sci Rep 2025;15(1):26951, doi: 10.1038/s41598-025-12506-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Shao L, Liao J, Qian J, et al. MetaGeneBank: a standardized database to study deep sequenced metagenomic data from human fecal specimen. BMC Microbiol 2021;21(1):263, doi: 10.1186/s12866-021-02321-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Zhang X, Walker K, Mayne J, et al. Evaluating live microbiota biobanking using an ex vivo microbiome assay and metaproteomics. Gut Microbes 2022;14(1):2035658, doi: 10.1080/19490976.2022.2035658 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Grech L, Grech CA, Calleja-Agius J, et al. Biobanking and gynecologic oncology - Special considerations, challenges and opportunities. Eur J Surg Oncol 2025;51(4):109713, doi: 10.1016/j.ejso.2025.109713 [DOI] [PubMed] [Google Scholar]
  • 18.Ahmed S, Wedekind MF, Del Rivero J, et al. Longitudinal Natural History Study of Children and Adults with Rare Solid Tumors: Initial Results for First 200 Participants. Cancer Res Commun 2023;3(12):2468–2482, doi: 10.1158/2767-9764.CRC-23-0247 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Holowatyj AN, Overman MJ, Votanopoulos KI, et al. Defining a ‘cells to society’ research framework for appendiceal tumours. Nat Rev Cancer 2025;25(4):293–315, doi: 10.1038/s41568-024-00788-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Schultz KAP, Chintagumpala M, Piao J, et al. Rare Tumors: Opportunities and challenges from the Children’s Oncology Group perspective. EJC Paediatr Oncol 2023;2(doi: 10.1016/j.ejcped.2023.100024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Vivelo C, Reilly KM, Widemann BC, et al. The Landscape of US and Global Rare Tumor Research Programs: A Systematic Review. Oncologist 2024;29(2):106–116, doi: 10.1093/oncolo/oyad285 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Engel KB, Vaught J, Moore HM. National Cancer Institute Biospecimen Evidence-Based Practices: a novel approach to pre-analytical standardization. Biopreserv Biobank 2014;12(2):148–50, doi: 10.1089/bio.2013.0091 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Snapes E, Astrin JJ, Bertheussen Kruger N, et al. Updating International Society for Biological and Environmental Repositories Best Practices, Fifth Edition: A New Process for Relevance in an Evolving Landscape. Biopreserv Biobank 2023;21(6):537–546, doi: 10.1089/bio.2023.0140 [DOI] [PubMed] [Google Scholar]
  • 24.Hartman V, Matzke L, Watson PH. Biospecimen Complexity and the Evolution of Biobanks. Biopreserv Biobank 2019;17(3):264–270, doi: 10.1089/bio.2018.0120 [DOI] [PubMed] [Google Scholar]
  • 25.Nanni U, Ferroni P, Riondino S, et al. Biospecimen Digital Twins: Moving from a “High Quality” to a “Fit-for-Purpose” Concept in the Era of Omics Sciences. Cancer Genomics Proteomics 2023;20(3):211–221, doi: 10.21873/cgp.20376 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Moore HM. The NCI Biospecimen Research Network. Biotech Histochem 2012;87(1):18–23, doi: 10.3109/10520295.2011.591833 [DOI] [PubMed] [Google Scholar]
  • 27.NCI. The Biospecimen Research Database. Available from: https://brd.nci.nih.gov/brd/.
  • 28.NCI. National Cancer Institute, Specimen Resource Locator (SRL). https://specimens.cancer.gov/;
  • 29.NCI. Cancer Epidemiology Descriptive Cohort Database (CEDCD). Available from: https://cedcd.nci.nih.gov/.
  • 30.NCI. Suggested Resource Sharing Plan Elements for Biospecimen Management and Sharing. 2025. Available from: https://epi.grants.cancer.gov/funding/biospecimen-sharing.html.
  • 31.Grizzle WE, Sexton KC. Commentary on Improving Biospecimen Utilization by Classic Biobanks: Identifying Past and Minimizing Future Mistakes. Biopreserv Biobank 2019;17(3):243–247, doi: 10.1089/bio.2018.0080 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Ramos MLF, Chaturvedi AK, Graubard BI, et al. Efficient risk-based collection of biospecimens in cohort studies: designing a prospective study of diagnostic performance for multicancer detection tests. Am J Epidemiol 2025;194(1):243–253, doi: 10.1093/aje/kwae139 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Grizzle WE, Sexton KC, McGarvey D, et al. Lessons Learned During Three Decades of Operations of Two Prospective Bioresources. Biopreserv Biobank 2018;16(6):483–492, doi: 10.1089/bio.2018.0073 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Fig 1
Supplemental Tables

RESOURCES