Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2022 Apr 7;17(4):e0266111. doi: 10.1371/journal.pone.0266111

Comprehensive cancer-oriented biobanking resource of human samples for studies of post-zygotic genetic variation involved in cancer predisposition

Natalia Filipowicz 1,*, Kinga Drężek 1, Monika Horbacz 1, Agata Wojdak 1, Jakub Szymanowski 1,2, Edyta Rychlicka-Buniowska 1, Ulana Juhas 1, Katarzyna Duzowska 1, Tomasz Nowikiewicz 3,4, Wiktoria Stańkowska 1, Katarzyna Chojnowska 1, Maria Andreou 1, Urszula Ławrynowicz 1, Magdalena Wójcik 1, Hanna Davies 5, Ewa Śrutek 4,6, Michał Bieńkowski 7, Katarzyna Milian-Ciesielska 8, Marek Zdrenka 6, Aleksandra Ambicka 9, Marcin Przewoźnik 9, Agnieszka Harazin-Lechowska 9, Agnieszka Adamczyk 9, Jacek Kowalski 7, Dariusz Bała 4,10, Dorian Wiśniewski 10, Karol Tkaczyński 10, Krzysztof Kamecki 11, Marta Drzewiecka 3, Paweł Wroński 11, Jerzy Siekiera 11, Izabela Ratnicka 12, Jerzy Jankau 12, Karol Wierzba 13, Jarosław Skokowski 14,15, Karol Połom 14, Mikołaj Przydacz 16, Łukasz Bełch 16, Piotr Chłosta 16, Marcin Matuszewski 17, Krzysztof Okoń 8, Olga Rostkowska 18, Andrzej Hellmann 18, Karol Sasim 19, Piotr Remiszewski 18, Marek Sierżęga 20, Stanisław Hać 18, Jarosław Kobiela 18, Łukasz Kaska 18, Michał Jankowski 4,10, Diana Hodorowicz-Zaniewska 20, Janusz Jaszczyński 21, Wojciech Zegarski 4,10, Wojciech Makarewicz 14,22, Rafał Pęksa 7, Joanna Szpor 8, Janusz Ryś 9, Łukasz Szylberg 6,23, Arkadiusz Piotrowski 1,24, Jan P Dumanski 1,5,24,*
Editor: Isaac Yi Kim25
PMCID: PMC8989288  PMID: 35390022

Abstract

The progress in translational cancer research relies on access to well-characterized samples from a representative number of patients and controls. The rationale behind our biobanking are explorations of post-zygotic pathogenic gene variants, especially in non-tumoral tissue, which might predispose to cancers. The targeted diagnoses are carcinomas of the breast (via mastectomy or breast conserving surgery), colon and rectum, prostate, and urinary bladder (via cystectomy or transurethral resection), exocrine pancreatic carcinoma as well as metastases of colorectal cancer to the liver. The choice was based on the high incidence of these cancers and/or frequent fatal outcome. We also collect age-matched normal controls. Our still ongoing collection originates from five clinical centers and after nearly 2-year cooperation reached 1711 patients and controls, yielding a total of 23226 independent samples, with an average of 74 donors and 1010 samples collected per month. The predominant diagnosis is breast carcinoma, with 933 donors, followed by colorectal carcinoma (383 donors), prostate carcinoma (221 donors), bladder carcinoma (81 donors), exocrine pancreatic carcinoma (15 donors) and metachronous colorectal cancer metastases to liver (14 donors). Forty percent of the total sample count originates from macroscopically healthy cancer-neighboring tissue, while contribution from tumors is 12%, which adds to the uniqueness of our collection for cancer predisposition studies. Moreover, we developed two program packages, enabling registration of patients, clinical data and samples at the participating hospitals as well as the central system of sample/data management at coordinating center. The approach used by us may serve as a model for dispersed biobanking from multiple satellite hospitals. Our biobanking resource ought to stimulate research into genetic mechanisms underlying the development of common cancers. It will allow all available “-omics” approaches on DNA-, RNA-, protein- and tissue levels to be applied. The collected samples can be made available to other research groups.

Introduction

One of the prerequisites for translational research is availability of well-characterized samples of different types from patients suffering from various diseases. Another requirement is the access to comprehensive and long-term follow-up clinical records for patients, which is important for the correlations between molecular findings and medical parameters. The third condition is a broad participation of patients treated at hospitals as well as control subjects, via their donation of samples to research projects. When these conditions are met, progress can be made towards 3P Medicine, i.e. preventive, personalized and precision.

Cancer is generally defined as a genetic disease, but the frequency of germline cancer-predisposing mutations vary considerably between different tumors and these inherited mutations are responsible for less than 10% of all cancers [13]. The remaining >90% of cancers arise as a result of mutations acquired during lifetime in normal somatic cells and the bulk of all cancers occurs late in life. Studies of cancer genomes have contributed to numerous discoveries of mutations that drive cancer growth. However, a fully developed tumor is often clonally heterogeneous and represents a late stage of the disease. This might restrict description of mutations that are occurring very early and initiate tumor development.

Post-zygotic (or somatic) mutations (PZM) in histologically normal human cells from various organs that develop cancer have increasingly been suggested over the past decade as a major source of cancer driving mutations, but this field is still poorly explored [4]. Examples can be given for breast cancer [57], normal skin and other normal tissues [8,9], colon cancer [10], urinary bladder cancer [11], aberrant clonal expansions in peripheral blood of healthy subjects (also known as clonal hematopoiesis of indeterminate potential) [1214] and esophageal cancer [15]. In the latter study, the prevalence of cancer driving mutations was higher in normal epithelium than in esophageal cancers. Furthermore, our biobanking project has been influenced by our interest in analysis of mosaic Loss of chromosome Y (LOY) in blood. It has been noted for over 50 years that chromosome Y is frequently lost in the leukocytes of aging men [16,17]. Recent epidemiological investigations show that LOY in leukocytes, representing lack of nearly 2% of the haploid nuclear genome, is associated with earlier mortality and morbidity from many diseases in men, including multiple common forms of cancer [13,18,19]. Moreover, LOY is the most common post-zygotic (somatic) mutation from analyses of bulk DNA and single-cells from peripheral blood [20].

Thus, comprehensive cancer-oriented biobanking requires sampling of not only tumor tissues but also normal fragments from the affected organ and other control tissue(s) that is not directly involved in the disease process. We use the term “uninvolved margin (UM) or tissue” that refers to histologically controlled non-tumorous tissue, which is located at various distances from the site of primary tumor [5,6]. Collection of blood and plasma (liquid biopsy) is also crucial for future genetic and proteomic analyses. We report here the results of biobanking activities for five common cancer diagnoses that have been ongoing at five major clinical cancer centers in four cities of Poland for a period of more than 23 months.

Materials and methods

Diagnoses, logistics and collection protocols

We selected five diagnoses, i.e. breast-, colorectal-, prostate-, bladder- and exocrine pancreas carcinomas, as well as metachronic metastases of colorectal cancer to liver. The choice was based on the high incidence of these diagnoses and/or often fatal outcome of the disease. We established collaboration with five clinical centers in Poland: Oncology Center in Bydgoszcz; National Institute of Oncology in Cracow; University Clinical Centre in Gdansk, University Hospital in Cracow and Specialist Hospital in Koscierzyna. The Pathology Departments at each of these centers were equipped with small -80°C freezers (80-liter volume, model ULTF 80, Arctico), used for temporary storage of samples prior to shipment to 3P-Medicine Laboratory in Gdansk. The freezers were accompanied by a laptop computer and rack reader for 2D Data-Matrix coded tubes (2 ml, Micronic), used for registration of samples with help of newly developed “MABData1” biobanking software (see below). Dispatch of samples on dry-ice and the corresponding documents from each hospital to 3P-Medicine Laboratory in Gdansk takes place approximately quarterly according to a well-defined procedure.

Figs 1 and 2 show types and number of independent samples collected for two common diagnoses such as breast- and prostate cancer. Breast cancer samples were obtained during mastectomy or breast conserving therapy (BCT) surgeries. Fig 3 show similar outlines of sample collection from patients operated for colorectal cancer (resection of primary tumor and metachronic metastasis of colorectal cancer to liver), bladder cancer (treated by either cystectomy or Trans-Urethral Resection of Bladder Tumor [TURBT]) and exocrine pancreas cancer, respectively. For each diagnosis, the compulsory set of samples include: 1–2 primary tumor fragments (PT); 1–12 specimens of uninvolved margin (UM) composed of macroscopically normal tissue collected at various distances from PT; 1–4 samples of whole blood (WB) (1.5 ml each) as the normal reference tissue; and 1–2 samples blood plasma (BP) (1–1.5 ml each) for future proteomic studies. Whenever available, for breast and colorectal carcinoma, local metastases to lymph node(s) (LN) were also collected, but only when they were clearly identifiable and large enough on gross examination. The volumes of samples from solid tissues ranged between 0.005 cm3 to 1 cm3. The tissues were collected according to the well-defined protocols. After macro-sectioning of the resected organ, small tissue fragments were selected and excised for biobanking. Subsequently, each fragment was cut in half: one portion was placed into a cryovial and fresh-frozen at -80°C, while the other one was fixed in formalin, embedded in paraffin and underwent standard processing, sectioning and H&E staining (FFPE). The latter FFPE tissue sectioning was done along the cut surface closest to the fresh-frozen biobanked piece of tissue, so that the FFPE section is as much as possible representative for the tissue in the frozen specimen. Therefore, despite the degree of uncertainty/discrepancy in the macroscopic assessment in some situations (i.e. for prostatic adenocarcinoma, multifocal breast cancer, pancreatic adenocarcinoma with coexistent chronic pancreatitis, tumors after neoadjuvant therapy), every single biobanked tissue fragment (both tumor and normal) has its matching FFPE tissue that undergoes pathological verification of the actual tumor content.

Fig 1. A summary of samples collected for two common cancer diagnoses.

Fig 1

(A) Collection for breast carcinoma patients. (B) Collection for prostate carcinoma patients. FFPE, Formalin-Fixed Paraffin-Embedded blocks; OCT, Optimal Cutting Temperature compound for fresh-frozen tissue; PBMC, Peripheral Blood Mononuclear Cells; CPT, Cell Preparation Tube with Sodium Heparin (BD Bioscience) for separation of granulocyte- and PBMC-fraction of white blood cells; FACS, Fluorescent Activated Cell Sorting; lymph., lymphocytes; Treg, T-regulatory lymphocytes; NK, Natural Killer cells.

Fig 2. An illustration of sample collection protocols for breast- and prostate cancer.

Fig 2

(A) Procedure for breast carcinoma samples treated with mastectomy. (B) Procedure for breast carcinoma patients treated with Breast Conservative Therapy (BCT). The distances in centimeters between samples of primary tumor and normal tissue are illustrated in panel A with solid lines. (C) Protocol for prostatectomy with detailed scheme of sample collection in different cross-sections (a–g). Abbreviations: UM, uninvolved margin composed of macroscopically normal tissue; PT, primary tumor; S, skin; LUM, lower uninvolved margin; UUM, upper uninvolved margin; LN, regional lymph node. Detailed description of particular fragments for the protocols is given in the Materials and Method section.

Fig 3. An illustration of sample collection protocols for colorectal carcinoma and metastases of colorectal cancer to liver, urinary bladder- and exocrine pancreas carcinomas.

Fig 3

(A) Scheme of sample collection for colorectal carcinoma. (B) Protocol of samples collection for metastases of colorectal cancer to liver. (C) Protocol of sample collection for urinary bladder after cystectomy. (D) Collection of samples for transurethral resection of tumor (TURBT). (E) Scheme of sample collection for the surgical removal of pancreas head. (F) Scheme of sample collection for the total pancreatomy. Primary tumors in all panels are drawn in red and samples of normal tissues in green. Abbreviations: UM, uninvolved margin composed of macroscopically normal tissue; PT, primary tumor; LN, regional lymph node; the lines show distances in centimeters from primary tumor.

The dispersed nature and scale of our biobanking required development of unified sample collection protocols, using well-defined clinical criteria for patient inclusion (Table 1). These protocols were developed in close collaboration between the molecularly-oriented team, surgeons involved in patient recruitment and treatment and pathologists collecting samples. In planning for material collection, we relied on our previous experience from studies of breast cancer [5,6,21]. In order to assure good quality of RNA/DNA extracted from the collected material, the general condition for all collected samples was a standardized time of usually 60–75 minutes (and maximum 2 hours) between tumor/organ resection and the moment of -80-degrees freezing for specimens dissected by the pathologist. It did not apply for the material used for Fluorescent Activated Cells Sorting (FACS) of leukocytes in the context of Loss Of chromosome Y (LOY) in male colorectal- and prostate cancer study as well as for material used for establishment of primary cell cultures (see below).

Table 1. Inclusion and exclusion criteria for each diagnosis.

Diagnosis Inclusion criteria
Breast cancer BCT with/without neoadjuvant therapy
Unilateral or bilateral mastectomy with/without neoadjuvant therapy
Colorectal cancer Resection of uni- or multifocal primary tumor with/without neoadjuvant therapy
Liver metastasis Resection of uni- or multifocal metachronous tumor with/without perioperative therapy
Prostate cancer Prostatectomy with/without neoadjuvant therapy
Bladder cancer TURBT with/without neoadjuvant therapy
Radical cystectomy with/without neoadjuvant therapy
Pancreatic cancer Uni- or multifocal Adenocarcinoma (exocrine cancer) *
Pancreaticoduodenectomy ** without neoadjuvant therapy
Total pancreatectomy without neoadjuvant therapy
Control group Age ≥ 65 y.o. without oncological and Alzheimer Disease in clinical history

BCT–Breast Conservative Therapy; metachronic tumor stands for the secondary tumor that arose more than 6 months after diagnosis of first malignancy; TURBT–Transurethral Resection of Bladder Tumor, *—exclusion criteria: Preoperative neoadjuvant therapy, Endocrine cancer; **—Pancreaticoduodenectomy (Whipple procedure)–operation performed to remove the cancerous head of the pancreas.

Tissue fragments dissected from mastectomy (Fig 2A) specimen include 1–4 PTs (depending on the tumor size and multifocality) and 9 UMs. Samples UM1 and UM2 located ~1 cm from PT (and presumably in the same lobe of the breast) towards the nipple and outer part, respectively. UMs 11–13 located ~2 cm from PT in the same quadrant, UMs 97 and 98 in two adjacent quadrants, UM99 in opposite quadrant, and S (skin) sample close to resection margin. BCT protocol (Fig 2B) involves collection of 1–2 PTs (depending on the tumor size), 7 UMs and 1 S. UMs 11–14 are collected in the vicinity of both resection lines; two of them (UMs 11 and 12) are closer to the nipple, while the other two (UMs 13 and 14) are further away from the center of the breast. Sample UM1 is taken between the nipple and PT, UM2 between PT and outer part of the breast. All the latter UMs are presumed to be located within the same quadrant and possibly the same lobe, in which the tumor is localized. In both breast cancer procedures, local metastases (LN) are also collected, when possible. Moreover, for both breast cancer surgical procedures, S100 and UM100 are dissected by surgeons directly at the operation theatre to a tube with sterile medium with antibiotics and sent to Gdansk within 24 hours to establish organoids and primary cell cultures (see below).

Due to the usually multifocal growth of prostate adenocarcinoma (constituting up to 95% of all prostate cancers) within this organ and frequent problems to macroscopically assess its exact location, the whole gland is sliced from base to the apex (Fig 2C). Four samples are taken with the punch in the slice that likely include the tumor: PT1 peripheral, right lobe; PT2 peripheral, left lobe; PT3 periurethral, right lobe; PT4 periurethral, left lobe. Analogically, four fragments of potentially unaffected tissue are collected in the slice located towards the base (UUM1–4) and apex (LUM1–4). A total of twelve tissue fragments are biobanked, each followed by the pathological report on the actual tumor content in the matching FFPE sample.

For colorectal cancer, up to 4 PTs (labeled PT1 and PT2 in case of multifocality, or A and B when two fragments are resected) and 8 UMs of mucosa (without muscular and serosa layer) were preserved: UMs 1–3 collected 1 cm away from the margin of PT, UMs 11–13 with 2 cm distance from the tumor, UM98 and 99 at least 5 cm from PT, with UM99 as the most distant one, located as far as possible from PT (Fig 3A). For metachronic metastasis (MT) of colorectal cancer to liver two fragments MT1 and MT11 (in the center and margin of the metastatic tumor respectively) and maximum two UM (close—UM1 and distant—UM99) are collected (Fig 3B). A similar pattern of samples is applicable for radical cystectomy (Fig 3C), while a unique protocol for TURBT involves one PT and two UM samples (Fig 3D). Collection for exocrine pancreatic cancer involves one PT and 1–3 UMs (depending on the type of resection and size of the material that is available) located about 1 cm from the margin of PT, while the UM99 is being the most remote fragment from PT (Fig 3E and 3F).

Cell cultures for breast cancer- and sorted leukocytes for Loss Of chromosome Y (LOY) project

In addition to the standard deep-frozen samples for biobanking, we also gathered additional unique material demanding dedicated procedures. Primary cultures from skin and uninvolved non-tumorous glandular tissue fragment of breast cancer patients (Fig 2A and 2B, S100 and UM100) yielding skin and stromal fibroblasts as well as organoids were initiated for a subset of cases using in house designed primary cell-culture protocols. For the same subset of patients, OCT blocks from skin and uninvolved margin were also prepared as a possible material for future spatial transcriptomics analysis.

In order to further study the association between LOY in blood and prostate- as well as male colorectal cancer, a total volume of 36 ml of peripheral blood was used for leukocyte sorting using FACS, as described previously [20]. Blood processing involved viable freezing of 2–4 million of peripheral blood mononuclear cells (PBMCs) and 0.1–4 million of CD4+ cells for single cell analysis. The sorted fractions included: CD19+ B cells, CD8+ T cells, CD4+ T-regulatory (Treg) and CD4+ non-Treg cells, granulocytes, monocytes and natural killer (NK) cells for further DNA and RNA analysis. We also collected samples from whole blood and buccal swabs for males without cancer or Alzheimer disease diagnosis that were age-matched to the above-mentioned cohorts of prostate- and male colorectal cancer patients. These subjects will serve as controls and their blood leukocytes were sorted and preserved according to the above-mentioned scheme.

Development of dedicated IT solutions

The computerization and semi-automation of the sample collection was implemented from the very beginning at each collection site. This facilitated the process of preserving a very high number of donors, reaching 100 per month without the risk of mix-up of samples. Each Pathology Department at the partner hospital was provided with a PC and a dedicated MABData1 software designed for that purpose. This program package has a simple, user-friendly web browser-based interface enabling registration of patients/samples with a set of clinical data (excluding personal information for safety reasons) and automated registration of tubes containing unique 2D codes. It also allows introduction of medical follow-up information at a later stage. All data is being synchronized every 5 minutes with MABServer in Gdansk using safe Advanced Encryption Standard (AES). The main goal of developing dedicated software for donor and sample registration at satellite hospitals was to introduce a user-friendly system that allows fast data synchronization with the central server, independence from network access and proper function also in an unstable internet connection environment. MABServer software is a proxy system (located at the 3P-Medicine Laboratory at the Medical University of Gdansk) that holds data from all hospitals for further use. Moreover, this system was developed also for safety reasons; in case of any MABData1 computer failure all data is kept at MABServer, which is included in the central backup schedule at the 3P-Medicine Laboratory. The management of samples and data locally is being assisted with MABData2 system, allowing donor and sample pseudonymization, registration of original samples from hospitals and numerous types of derivative specimens, together with many additional parameters, adding extra attachments, simple and advanced searching and data exporting, as well as inventory of samples. MABData2 is a complete, stand-alone biobank management system that supports not only current biobanking project, but is also the main software solution, connecting data from other projects and collections for further cross-searches. The documentation received on paper is pseudonymized, scanned and uploaded to the MABData2 system, using implementation of Optical Character Recognition (OCR) algorithms inserted into the database.

Bioethical approval

All procedures for sample collection were approved by the Independent Bioethics Committee for Research at the Medical University of Gdansk (approval number NKBBN/564/2018 with multiple amendments). This approval is valid for collection of samples at multiple collaborating hospitals. Since our collection of samples is still ongoing, our initial ethical approval provides us with the possibility to extend the scope of biobanking for additional diagnoses, after an amendment of the application. Written informed consent was obtained from all the patients prior to surgery. All procedures were performed in accordance with the relevant national and international laws and guidelines as well as in compliance with European Union General Data Protection Regulation (EU GDPR).

Other laboratory procedures

DNA extraction from frozen solid tissues was performed using standard phenol/chloroform method with several in-house modifications (depending on the tissue type) and lysis buffer with SDS (0.5% SDS, 60mM Tris pH 7.9, 5 mM EDTA) or sarcosine and SDS (2% sarcosine, 0.5% SDS, 50mM Tris pH 7.9, 10 mM EDTA); DNA from whole blood was acquired using QIAamp DNA Blood Midi Kit (Qiagen) or QuickGene DNA whole blood kit S (Kurabo) with QuickGene-Mini 480 instrument (Kurabo), DNA from sorted cells was extracted either using sarcosine lysis buffer (1% sarcosine, 10 mM Tris-HCl, 50mM NaCl, 10 mM EDTA) followed by ethanol precipitation (<500 000 cells) or with QIAamp DNA Mini Kit (Qiagen) (>500 000 cells). All laboratory processes using our samples were carried out according to standard protocols including: sorting of leukocytes on FACS machines, establishing primary cultures, extraction of DNA and RNA for various downstream applications; e.g. droplet digital PCR (ddPCR), construction of NGS libraries for targeted DNA/RNA sequencing.

Results

Rationale

The main rationale and aims of our biobanking program are systematic explorations PZMs especially in normal tissues, which is an understudied field of cancer research. We also incorporate material for studies of mosaic LOY in males that might predispose to various diseases, with a particular focus on cancer. The selected diagnoses cover sporadic breast-, colorectal-, prostate-, bladder- and pancreatic carcinomas and represent consecutively collected patients, affected by common and often fatal diseases. The setup of our biobank should allow a wide range of “omics”- and other methods to be applied in studies of the collected clinical material.

General statistics

The first patient was registered in the database on June 5, 2019 and statistics described here are up to May 12, 2021. Our collection originating from five clinical cancer centers, after nearly 2-years of cooperation, reached in total 1963 donors. However, 1711 of these were collected effectively (i.e. with all tissue types from the collection protocols, described above in Materials and Methods as compulsory set) yielding 23226 independent samples (Fig 4A and 4B). This results with an average of 74 donors and 1010 samples collected per month. Incomplete sampling affected 13% of all donors and was caused by several factors, such as small size of tumor resulting in insufficient amount of tissue material, prolonged surgery time and inability to prepare material by pathologists the same day, as well as unresectability of the tumor. The predominant diagnosis was breast carcinoma (933 donors), followed by colorectal- (383 donors), prostate- (221 donors), urinary bladder- (81 donors), exocrine pancreas carcinomas (15 donors) and metachronous metastases of colorectal cancer to liver (14 donors). We also collected blood and buccal swabs from 64 healthy male control subjects that were age matched for the cohort of males with colorectal- and prostate carcinoma (Table 2). The average age of male oncological patients was 67 years (1 SD ±9, range 33–93 years), while for female oncological donors it was 62 years (1 SD ±13, 24–92 years) and for healthy control subjects 71 years (1 SD ±5, 61–91 years).

Fig 4. The statistics of donors and samples collected in five collaborating hospitals; status as of May 12, 2021.

Fig 4

(A) The total number of donors with compulsory set of samples (as described in Materials and Methods-section and shown in Figs 1 and 2). (B) The sum of all samples collected from recruited donors. The numbers for donors and samples are divided for six different cancer diagnoses and controls (*). The control (*) category represents a healthy male cohort ≥ 65-year-old recruited as controls for patients with prostate- and colorectal cancer, used in the Loss of Y Chromosome (LOY) project. (C–F) show distribution of diagnoses according to International Classification of Diseases (ICD-10, World Health Organization) for breast, colorectal, bladder and pancreatic cancer patients, respectively. Abbreviations: C50, Malignant neoplasm of breast; C50.0, Nipple and areola; C50.1, Central portion of breast; C50.2, Upper-inner quadrant of breast; C50.3, Lower-inner quadrant of breast; C50.4, Upper-outer quadrant of breast; C50.5, Lower-outer quadrant of breast; C50.6, Axillary tail of breast; C50.8, Overlapping lesion of breast; C50.9, Breast, unspecified; C18, Malignant neoplasm of colon; C18.0, Caecum, Ileocaecal valve; C18.1, Appendix; C18.2, Ascending colon; C18.3, Hepatic flexure; C18.4, Transverse colon; C18.5, Splenic flexure; C18.6, Descending colon; C18.7, Sigmoid colon, Sigmoid (flexure); C18.8, Overlapping lesion of colon;C18.9, Colon, unspecified, Large intestine, unspecified; C19, Malignant neoplasm of rectosigmoid junction, including colon with rectum, rectosigmoid colon; C20, Malignant neoplasm of rectum, Including rectal ampulla; C21, Malignant neoplasm of anus and anal canal; C21.0, Anus, unspecified, excluding anal margin and perianal skin; C21.1, Anal canal, Anal sphincter; C67, Malignant neoplasm of bladder; C67.0, Trigone of bladder; C67.2, Lateral wall of bladder; C67.3, Anterior wall of bladder; C67.4, Posterior wall of bladder; C67.5, Bladder neck, Internal urethral orifice; C67.7, Urachus; C67.8, Overlapping lesion of bladder; C67.9, Bladder, unspecified; C25, Malignant neoplasm of pancreas; C25.0, Head of pancreas; C25.1, Body of pancreas; C25.2, Tail of pancreas. ND–not yet defined due to temporary lack of medical documentation.

Table 2. A summary of donors and cancer diagnoses included in the collection (status as of May 12, 2021).

Diagnosis Sex distribution Average age Average samples (range) Clinical information
Breast cancer
(ICD-10 C50)
F– 99% (n = 921)
M– 1% (n = 12)
60 y ± 13
64 y ± 8
13 (7–32) • Mastectomy– 42% (n = 391)
• BCT– 53% (n = 494)
• Non-specified* – 5% (n = 48)
Colorectal cancer
(ICD-10 C18 –C21)
F– 45% (n = 174)
M –55% (n = 209)
66 y ± 12
67 y ± 10
16 (7–27)
Liver metastasis (colorectal cancer)
(ICD-10 C78.7)
M– 86% (n = 11)
F– 14% (n = 3)
65 y ± 11
66 ± 4
11 (7–15)
Prostate cancer
(ICD-10 C61)
M– 100% (n = 221) 65 y ± 7 15 (7–19) Gleason score:

• 3+2–0.5% (n = 1)
• 3+3–16% (n = 35)
• 3+4–28% (n = 61)
• 4+3–28% (n = 61)
• 4+4–2% (n = 5)
• 4+4+5–0.5% (n = 2)
• 4+5–1% (n = 3)
• Non-specified* – 24% (n = 53)
Bladder cancer
(ICD-10 C67)
F– 21% (n = 17)
M– 79% (n = 64)
68 y ± 8
69 y ± 9
12 (7–17) • TURBT– 47% (n = 38)
• Cystectomy– 53% (n = 43)
Pancreatic cancer
(exocrine, ICD-10 C25)
Female– 47% (n = 7)
Male– 53% (n = 8)
68 y ± 6
66 y ± 9
10 (8–11)
Control group** Male– 100% (n = 64) 71 ± 5 7 (6–12)

F–female; M–male; *—information not yet available and incorporated in our database; y–years; BCT–Breast Conservative Therapy. ICD codes: C50, Malignant neoplasm of breast; C18, Malignant neoplasm of colon; C19, Malignant neoplasm of rectosigmoid junction; C20, Malignant neoplasm of rectum; C21, Malignant neoplasm of anus and anal canal; C78.7, Secondary malignant neoplasm of liver and intrahepatic bile duct; C61, Malignant neoplasm of prostate; C67, Malignant neoplasm of bladder; C25, Malignant neoplasm of pancreas; TURBT–Transurethral Resection of Bladder Tumor.

**—Healthy male cohort ≥ 65 years old recruited as controls for the male patients with prostate and colorectal cancer for whom the white blood cell fractions were sorted by FACS to study loss of chromosome Y.

Fig 4C–4F shows the distribution of ICD-10 codes within particular diagnoses: C50 (malignant neoplasm of breast), C18—C21 (malignant neoplasm of colon, rectosigmoid junction, rectum, anus and anal canal), C67 (malignant neoplasm of bladder), and C25 (malignant neoplasm of pancreas). Within colorectal diagnoses clear predominance of rectal cancers (C20, 99 cases) is notable, followed by tumors of caecum (C18.0, 57 donors) and sigmoid colon (C18.7, 53 cases). In bladder cancer patients, the affected sites include trigone and lateral wall of the bladder (C67.0 and C67.2 and 34 and 25 donors, respectively). The vast majority of pancreatic tumors are located in the body of the organ (C25.0, 12 patients). In addition to ICD-10 diagnoses, we also gathered other clinical information regarding the donors recruited to the project, which is summarized in Table 3. It covers basic clinical data: imaging results for the patient, type of surgery, dates of initial diagnosis and treatment, blood count, as well as full histopathological report with data for microscopic examination of all resected tissue fragments. Moreover, we collected information from each donor via medical questionnaires covering oncological history in the family, chronic illnesses and smoking habits.

Table 3. Type of data collected from medical records of patients.

Document Information
Registration form in the hospital  • Clinical data available from the hospitals: CT, PET, MRI, RTG, USG, colonoscopy, mammography, urography, cystoscopy, scintigraphy;
 • Date of first diagnosis and treatment;
 • Type of surgery (radical cystectomy/TURBT, mastectomy/BCT, type of pancreatectomy).
Histopathological report  • Histopathological type of cancer;
 • Microscopic description of tumor/non-tumoral tissue;
 • Grade of cancer;
 • pTNM stage;
 • Gleason score (ICD-10 C61);
 • ER, PR, HER2 status (ICD-10 C50);
 • Ki-67 (ICD-10 C50);
 • Size of resected organ.
Patient questionnaire  • Smoking status;
 • Chronic illnesses;
 • Family history–oncological treatment, Alzheimer Disease.
Complete blood count  • Red Blood Count;
 • White Blood Count;
 • Platelets.

CT–Computer Tomography; PET–Positron Emission Tomography; MRI–Magnetic Resonance Imaging; RTG–radiography; USG–ultrasonography. ICD codes: C50, malignant neoplasm of breast; C61, malignant neoplasm of prostate; TURBT, Transurethral Resection of Bladder Tumor; BCT, Breast Conservative Therapy; ER, Estrogen Receptor status; PR, Progesterone Receptor status; HER2, status of the human epidermal growth factor receptor; Ki-67, marker of proliferation Ki-67.

The uniqueness of our collection is illustrated by the number of fragments dissected from macroscopically normal cancer-neighboring margin of tissue (named UM, LUM or UUM) (Figs 1 and 2), which accounts for nearly 40% of the total sample count, while the contribution from tumor samples is 12%. The remaining sample represent blood, plasma, skin and sorted leukocytes using FACS. Furthermore, whenever possible we also collected material from local metastases to lymph nodes, for a total of 117 and 80 donors with breast and colorectal cancer, respectively (1% of all samples). The implemented sampling procedures are demanding in terms of time needed for material preparation. The estimated average time devoted by the Pathology Departments is 98 minutes per patient to fulfill the requirements of our protocols (ranging from 60 minutes for liver metastasis resection to 155 minutes for prostatectomy), which include patient registration, blood processing for collection of tubes with peripheral blood and plasma, tissue dissection by pathologists, preparation of FFPE blocks/slides, and histopathological assessment. Thus, the collection from 1711 donors sums up to about 2800 working hours or about 70 working weeks with full time effort. This time represents only part of the entire project, as we do not include time spent on the following tasks: recruitment of patients by surgeons, collecting informed consents and filling of questionnaires, local transport within each hospital and from five partner hospitals to Gdansk, as well as acquisition of material by the 3P-Medicine Laboratory in Gdansk. Furthermore, in the time devoted to development and testing of software (MABData1 and MABData2) we do not count preparation of common protocols for sample collection as well as preparation of formal agreements with the participating hospitals.

Another important aspect of our biobanking approach is the histopathological verification of the tissue fragments that were macroscopically presumed to represent either tumor or non-tumor fragment and were later verified by analysis of FFPE sections. We performed such comparison for prostate cancer and colorectal cancer. As suspected, prostate cancer represents a challenge due to the lack of clear macroscopic demarcation of the tumor. The calculation on the representative cohort of 100 prostate cancer patients showed that out of 400 specimens collected presumably as tumorous tissue, 263 (65.7%) showed the presence of tumor cells. For the similar sample size of 100 colorectal cancer patients, 205 out of 205 (100%) presumed tumor samples were confirmed as containing tumor tissue. It is important to mention that each fragment selected for further molecular examination is verified with the histopathological report.

Analysis of smoking status, which is included in the medical questionnaire (Fig 5), revealed that nearly 50% of female cancer patients declared themselves as non-smokers and 31% as past-smokers. Corresponding numbers of non-smokers among males is clearly lower (28%) and past-smokers is much higher (54%). Similar numbers of females and males declared themselves as present smokers (16% and 14%, respectively), and this is also valid for control subjects (19%).

Fig 5. Smoking status declared by donors in the questionnaire.

Fig 5

(A) Female cohort. (B) Male cohort. (C) Control male group, as described in Materials and Methods.

Specific diagnoses

The summary of donors and samples for all diagnoses is shown in Table 2. Breast cancer is the most common type of cancer diagnosed world-wide in 2020 [22] (https://www.who.int/news-room/fact-sheets/detail/cancer) and it is also well represented in our collection, accounting for 55% of all donors, with obvious predominance of women and only 1% of males with breast cancer. The average age for females undergoing mastectomy or BCT is 60 years (1 SD ±13), and the corresponding number for males is 64 years (1 SD ±8). Counting all breast surgeries together, 53% are BCTs, and 42% are mastectomies, remaining 5% are not defined due to the temporary lack of medical documentation. Colorectal cancer is the second most common diagnosis in the collection (22% of donors), with the predominance of male patients (55% versus 45% of females), with the average age of initial surgical treatment similar for both sexes (66 versus 67 years). Due to the usually large extent of the resection of the colon, we reached the highest average sample count (n = 16) among all diagnoses. Prostate cancer in our assembly is diagnosed and treated on average at the age of 65 years, at a medium-grade stage: Gleason score 3+4 (27% of patients) and 4+3 (27%), low medium-grade: Gleason score 6 (15%) and rarely at a high-grade, Gleason score ≥8 (3.5%). The collection procedure for prostatectomy is the most complicated of all diagnoses and the average sample count for prostate cancer is the second highest (n = 15). Organ sparing treatment using TURBT, was applied for 47% of all gathered bladder cancer donors, which adds to the uniqueness of our collection. The frequency of bladder cancer among males is much higher than for females (79% versus 21%), with a similar average age of onset for both sexes (69 and 68 years, respectively). Exocrine pancreas cancer, which is frequently unresectable and the most fatal disease in our collection, stands for below 1% of all our donors and has a similar distribution of age and sex.

Well-defined recruitment criteria for volunteers as healthy controls for the LOY project (Table 1) resulted in a homogeneous group of male subjects, with an average age of 71 years and average sample count of seven. The starting material is peripheral blood for preparation of viable PBMCs and CD4+ T-cells and sorting of seven populations of leukocytes using FACS. Additionally, buccal swabs are collected as reference material for non-mesoderm-derived tissue. The same procedure for sorting of leukocytes, as above for controls, was applied for 20 prostate and 28 colorectal cancer patients.

Quality of derivative samples

The collected samples have already been used for subsequent research activities and a wide variety of downstream applications. Hence, we optimized and implemented several laboratory protocols that include DNA and RNA preparation and aliquoting. We present here the concentrations and quality of DNA extracted from solid tissue (PT and UMs), blood and the pellets of FACS sorted cells (Fig 6). The amount of tissue used for DNA preparation differs widely for various diagnoses and is lowest for bladder and prostate samples (starting from 2 mg of fresh frozen tissue), where the collected fragments usually do not exceed 8 mm3. However, the quality, integrity (DIN) and spectroscopic purity of DNAs were high for all samples (Fig 6B and 6C) with the exception of A260/230 ratio for DNAs from sorted cell pellets, especially from the subpopulations with low cell count after sorting (below 150 000 cells) (Fig 6C). This quality is, however, still sufficient to run ddPCR analysis.

Fig 6. DNA quality isolated from collected tissues.

Fig 6

(A) Concentration of DNA (ng/μl) obtained from blood and tissues from donors per diagnoses, measured with the fluorometric method (Qubit Fluorometric Quantification and/or Agilent TapeStation System). (B) DNA Integrity Number (DIN) for DNA obtained from blood and tissue in four diagnoses measured by Genomic DNA ScreenTape Analysis (Agilent). (C) DNA quality measured by UV-Vis spectroscopic method for DNA obtained from blood and tissues from donors with five diagnoses. The number of samples for each diagnosis that was used for calculations are: 88 for breast cancer; 202 for colorectal cancer; 3092 for prostate cancer; 189 for bladder cancer; 12 for pancreas cancer; and 62 for controls. The amount of frozen tissue used for DNA extractions range as follows: Breast cancer 15–60 mg; bladder cancer 2–19 mg; prostate cancer 6–43 mg; colorectal cancer 15–33 mg; pancreas cancer 12–21 mg; and controls (sorted leukocytes) 0.05x106 - 1x106 cells.

Dedicated software for dispersed biobanking

In a typical research project oriented on sample/donor collection, software solutions are built in central models as one system case with a single pipe for data input. In our project, we created a dedicated solution for dispersed biobanking at satellite hospitals. Each partner hospital is actually an independent biobank, equipped with both hardware and software solutions allowing sample/data registration as well as storage, including a possibility of a posteriori data verification and supplementation. This developed system and communication methods bring data safety standards that correspond to ISO-270001 requirements. The most important features are included in MABData2 software. This package is not only the tool dedicated for biobanking collection, administration and management, but also allows for customized data search, connecting different types of data and parameters describing samples and donors. Both MABData1 and MABData2 packages are owned by the Medical University of Gdansk, which has full license- and copyrights. Therefore, this in-house developed software allows independence for future development and for any number of users.

Discussion

One of the major aims of our biobanking effort is to gather a representative collection of samples from multiple common sporadic cancer diagnoses, which would primarily allow large-scale studies of early predisposing PZMs that initiate tumor development in normal cells from the affected organ. Consequently, the largest number of samples are derived from histologically verified non-tumorous tissue, which is located at various distances from the site of primary tumor. The design of our biobanking is based on our previous experience from studies of multifocal breast cancer [5,6,21]. Furthermore, availability of material from local metastases to lymph nodes for breast and colorectal cancer as well as distant metastases of colorectal cancer to the liver allows for assessment of tumorous tissue at different stages of disease. Moreover, we collect at least one additional reference tissue that is not related to tumor development; usually blood or skin (Figs 1 and 2). This reference material is important whenever a possibly pathogenic variant is found, in order to confidently exclude that it could represent a germline mutation. The information regarding oncological family history gathered from the patient questionnaire might also be helpful in such cases. Collection of multiple tubes of plasma (liquid biopsy) from donors suffering from cancer and from healthy controls is also crucial for future genetic and proteomic analyses. The highest number of independent samples per single donor is thirty-two (Table 2). Thus, according to the official BBMRI-ERIC directory (https://directory.bbmri-eric.eu/#/), our collection is comprehensive and represents the largest assembly of samples oriented towards studies of early events in cancer development. This collection will allow essentially all available “-omics” approaches on DNA-, RNA-, protein- and tissue levels as well as many other methodological approaches to be applied. Our biobanking project also assumes acquisition of long-term follow-up (3 to 10 years after treatment) for recruited patients, which will be an important added value. As mentioned above, our biobanking effort is still ongoing and we have an opportunity to modify collection protocols and include other cancer diagnoses. The collected biological material and clinical data can be made available for other investigators after a request to both corresponding authors. The letter should outline the aim, number/type of requested samples and methodology of the proposed collaborative project. Such scientific cooperation can be established based on bilateral scientific cooperation agreement.

The number of recruited donors already provides us a good perspective regarding statistics of the four most frequent diagnoses in our study. For the breast carcinoma, a clear predominance (32%) of C50.9 diagnosis (Breast, unspecified) is apparent together with the relative high incidence of ICD 50.8 (Malignant neoplasm of overlapping sites of breast), which suggest unclear location or lack of the record in the patient documentation. This is followed by the carcinomas located in upper outer (ICD 50.4–27%), lower outer (ICD 50.5–9%), and upper inner (ICD 50.2–8%) quadrants of the breast. This trend reflects the distribution observed in other large-scale study [23]. In the case of colorectal cancer, the location of tumor is a crucial factor determining molecular type, disease progression, prognosis, treatment and outcome [24]. In our collection 38% of cases are right-sided colon cancers (RSCC/proximal): ICD-10 from C18.0 to C18.5 and 61% represent left-sided colon cancers (LSCC/distal): C18.6 to C19, C20 (Fig 4D). The distribution is in accordance with a larger study [25]. RSCCs are generally higher in females and have worse overall prognosis—in our study it is comparable for both sexes (21% females vs 19% males), the opposite trend is observed in LSCC (25% for females and 35% in this study). Such a representation and distribution of donors and samples in the collection will enable the examination of molecular heterogeneity, etiology and progression of these cancers.

For breast- and urinary bladder cancer patients, we collected samples derived from two different surgical procedures. Breast cancer operations can be performed using either mastectomy or BCT and the frequency of these two surgeries in our material are 42% and 53%, respectively. The tumors removed via BCT are diagnosed at earlier stage of development, are typically smaller and these patients usually do not show suspicion of multifocal tumor development. The trend in our cohort reflects the worldwide shift in surgical approach from mastectomy to BCT [26]. Bladder cancer patients are qualified for either cystectomy or sparing therapy using (TURBT) and their respective frequencies of these treatments in our collection are 53% and 47%. Hence, we have an opportunity to molecularly study these two common cancers in patients with different grades of severity of the disease, which represents an added value.

It is well known that males have a higher incidence and mortality from most sex-unspecific cancers, which is largely unexplained by known risk factors [27,28]. This substantial cancer-related sex-disparity appears to be a neglected aspect of cancer research. Our collection of donors for bladder- and colorectal cancer confirms this. Bladder cancer is much more frequent among males than females (79 versus 21), with a similar average age of onset for both sexes (69 and 68 years, respectively) (Table 2). Colorectal cancer also shows male predominance (55% males vs. 45% females), although the difference is less pronounced. Smoking habits among males in Poland might be, at least partially, responsible for these differences. As shown in Fig 5, 47% and 68% oncological female and male patients, respectively, declared themselves as past- or present smokers, and there is a proven correlation between cancer incidence, smoking habits and LOY for men [18,29,30]. Our ongoing effort to obtain sorted subpopulations of leukocytes using FACS from males treated for prostate and colorectal cancer will help to study and possibly provide further support for this hypothesis.

Supporting information

S1 Table. Pseudonymized list of patients that were included in the statistics presented in the manuscript with the information on diagnosis, age, sex, ICD10, type of surgery where applicable, number of original samples and the status of smoking declared in the medical questionnaire.

(XLSX)

Acknowledgments

We thank all the patients and volunteer controls for acceptance to participate, sample contribution and information provided in the medical questionnaire. We thank Dr. Leszek Kalinowski for use of the temporary office, laboratory and freezer space as well as access to other laboratory facilities. We acknowledge Drs. Darek Kędra, Marco Günthel, and Paweł Olszewski for consultations regarding laboratory and bioinformatic procedures. We also thank physicians and nurses involved in the patient recruitment process, collaborating technicians, diagnosticians and pathologists from: Oncology Center—Prof. Franciszek Łukaszczyk Memorial Hospital in Bydgoszcz (Jowita Nowaczewska, Katarzyna Krzysiak, Anetta Słupicka, Mateusz Matusiak); Maria Skłodowska-Curie National Research Institute of Oncology in Kraków (Justyna Wajda, Dorota Lech, Kaja Majchrzyk); University Clinical Centre in Gdańsk (Wojciech Biernat, Wojciech Połom, Jakub Gondek, Tomasz Cwaliński, Grażyna Stęplewska, Elżbieta Wierszyło, Elżbieta Pietruszka, Grażyna Dombrowska, Paweł Górny, Małgorzata Derwis, Justyna Pietruszewska, Irena Pellowska, Michał Kunc, Aleksandra Korwat, Ewa Miłoszewska, Aleksandra Kaczor, Dawid Foltynowski); University Hospital in Cracow (Weronika Natkaniec, Magdalena Smolik, Małgorzata Molus, Jadwiga Gałek, Hieronim Strojniak, Adrian Głownia, Anna Janas, Iwona Zawadzka, Monika Cała, Joanna Ciężarek, Shymko Anzhela, Izabela Pabisz-Zarębska); and Specialist Hospital in Kościerzyna (Atanasiu Apostolis, Barbara Koenner, Michał Kujach, Renata Knuth). We thank Drs. Daniil Sarkisyan and Eva Tiensuu Janson for critical review of the manuscript.

Data Availability

We are not able to provide the access to our internal database MABData 2, as this is only available locally for the authorized researchers of our unit. However, we have prepared a minimal anonymized dataset in excel file with all the donors and clinical/medical data that were included in the paper. We provide this information as a supporting information file (S1 Table).

Funding Statement

This study was sponsored by the Foundation for Polish Science (FNP) under the International Research Agendas Program to J.P.D. and A.P., co-financed by the European Union under the European Regional Development Fund. Our biobank also received financing via the “Excellence Initiative - Research University'' program from Medical University of Gdansk. This project obtained further partial funding from The Swedish Cancer Society and Swedish Medical Research Council to J.P.D. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Frank C, Fallah M, Sundquist J, Hemminki A, Hemminki K. Population Landscape of Familial Cancer. Sci Rep. 2015;5:12891. Epub 2015/08/11. doi: 10.1038/srep12891 ; PubMed Central PMCID: PMC4530455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Hemminki K, Sundquist J, Bermejo JL. How common is familial cancer? Ann Oncol. 2008;19(1):163–7. Epub 2007/09/07. doi: 10.1093/annonc/mdm414 . [DOI] [PubMed] [Google Scholar]
  • 3.Scheuner MT, McNeel TS, Freedman AN. Population prevalence of familial cancer and common hereditary cancer syndromes. The 2005 California Health Interview Survey. Genet Med. 2010;12(11):726–35. Epub 2010/10/06. doi: 10.1097/GIM.0b013e3181f30e9e . [DOI] [PubMed] [Google Scholar]
  • 4.Forsberg LA, Gisselsson D, Dumanski JP. Mosaicism in health and disease—clones picking up speed. Nat Rev Genet. 2017;18(2):128–42. Epub Dec 12. doi: 10.1038/nrg.2016.145 . [DOI] [PubMed] [Google Scholar]
  • 5.Ronowicz A, Janaszak-Jasiecka A, Skokowski J, Madanecki P, Bartoszewski R, Balut M, et al. Concurrent DNA Copy-Number Alterations and Mutations in Genes Related to Maintenance of Genome Stability in Uninvolved Mammary Glandular Tissue from Breast Cancer Patients. Hum Mutat. 2015;36(11):1088–99. Epub 2015/07/30. doi: 10.1002/humu.22845 . [DOI] [PubMed] [Google Scholar]
  • 6.Forsberg LA, Rasi C, Pekar G, Davies H, Piotrowski A, Absher D, et al. Signatures of post-zygotic structural genetic aberrations in the cells of histologically normal breast tissue that can predispose to sporadic breast cancer. Genome Res. 2015;25(10):1521–35. Epub 2015/10/03. doi: 10.1101/gr.187823.114 [pii]. ; PubMed Central PMCID: PMC4579338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Gadaleta E, Fourgoux P, Pirro S, Thorn GJ, Nelan R, Ironside A, et al. Characterization of four subtypes in morphologically normal tissue excised proximal and distal to breast cancer. NPJ Breast Cancer. 2020;6:38. Epub 2020/09/05. doi: 10.1038/s41523-020-00182-9 ; PubMed Central PMCID: PMC7442642. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Martincorena I, Roshan A, Gerstung M, Ellis P, Van Loo P, McLaren S, et al. High burden and pervasive positive selection of somatic mutations in normal human skin. Science. 2015;348(6237):880–6. doi: 10.1126/science.aaa6806 ; PubMed Central PMCID: PMC4471149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Yizhak K, Aguet F, Kim J, Hess JM, Kubler K, Grimsby J, et al. RNA sequence analysis reveals macroscopic somatic clonal expansion across normal tissues. Science. 2019;364(6444). Epub 2019/06/07. doi: 10.1126/science.aaw0726 ; PubMed Central PMCID: PMC7350423. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Lee-Six H, Olafsson S, Ellis P, Osborne RJ, Sanders MA, Moore L, et al. The landscape of somatic mutation in normal colorectal epithelial cells. Nature. 2019;574(7779):532–7. Epub 2019/10/28. doi: 10.1038/s41586-019-1672-7 . [DOI] [PubMed] [Google Scholar]
  • 11.Lawson ARJ, Abascal F, Coorens THH, Hooks Y, O’Neill L, Latimer C, et al. Extensive heterogeneity in somatic mutation and selection in the human bladder. Science. 2020;370(6512):75–82. doi: 10.1126/science.aba8347 [DOI] [PubMed] [Google Scholar]
  • 12.Forsberg LA, Rasi C, Razzaghian H, Pakalapati G, Waite L, Stanton Thilbeault K, et al. Age-related somatic structural changes in the nuclear genome of human blood cells. Am J Hum Genet. 2012;90(2):217–28. doi: 10.1016/j.ajhg.2011.12.009 ; PubMed Central PMCID: PMC3276669. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Forsberg LA, Rasi C, Malmqvist N, Davies H, Pasupulati S, Pakalapati G, et al. Mosaic loss of chromosome Y in peripheral blood is associated with shorter survival and higher risk of cancer. Nature Genetics. 2014;46(6):624–8. Epub 2014/04/30. doi: 10.1038/ng.2966 [pii]. . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Jaiswal S, Fontanillas P, Flannick J, Manning A, Grauman PV, Mar BG, et al. Age-related clonal hematopoiesis associated with adverse outcomes. N Engl J Med. 2014;371(26):2488–98. doi: 10.1056/NEJMoa1408617 ; PubMed Central PMCID: PMC4306669. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Martincorena I, Fowler JC, Wabik A, Lawson ARJ, Abascal F, Hall MWJ, et al. Somatic mutant clones colonize the human esophagus with age. Science. 2018;362(6417):911–7. Epub 2018/10/20. doi: 10.1126/science.aau3879 ; PubMed Central PMCID: PMC6298579. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Jacobs PA, Brunton M, Court Brown WM, Doll R, Goldstein H. Change of human chromosome count distribution with age: evidence for a sex differences. Nature. 1963;197:1080–1. Epub 1963/03/16. doi: 10.1038/1971080a0 . [DOI] [PubMed] [Google Scholar]
  • 17.Pierre RV, Hoagland HC. Age-associated aneuploidy: loss of Y chromosome from human bone marrow cells with aging. Cancer. 1972;30(4):889–94. Epub 1972/10/01. doi: [DOI] [PubMed] [Google Scholar]
  • 18.Dumanski JP, Rasi C, Lonn M, Davies H, Ingelsson M, Giedraitis V, et al. Smoking is associated with mosaic loss of chromosome Y. Science. 2015;347(6217):81–3. Epub 2014/12/06. doi: 1262092 [pii] science.1262092 [pii] doi: 10.1126/science.1262092 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Dumanski JP, Lambert JC, Rasi C, Giedraitis V, Davies H, Grenier-Boley B, et al. Mosaic Loss of Chromosome Y in Blood Is Associated with Alzheimer Disease. Am J Hum Genet. 2016;98(6):1208–19. doi: 10.1016/j.ajhg.2016.05.014 ; PubMed Central PMCID: PMC4908225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Dumanski J, Halvardson J, Davies H, Rychlicka-Buniowska E, Mattisson J, Torabi Moghadam B, et al. Immune cells lacking Y chromosome show dysregulation of autosomal gene expression. Cell Mol Life Sci. 2021. https://www.biorxiv.org/content/10.1101/673459v2 doi: 10.1007/s00018-021-03822-w . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Pekar G, Davies H, Lukacs AP, Forsberg L, Hellberg D, Dumanski J, et al. Biobanking multifocal breast carcinomas: sample adequacy with regard to histology and DNA content. Histopathology. 2016;68(3):411–21. doi: 10.1111/his.12758 . [DOI] [PubMed] [Google Scholar]
  • 22.Heer E, Harper A, Escandor N, Sung H, McCormack V, Fidler-Benaoudia MM. Global burden and trends in premenopausal and postmenopausal breast cancer: a population-based study. Lancet Glob Health. 2020;8(8):e1027–e37. Epub 2020/07/28. doi: 10.1016/S2214-109X(20)30215-1 . [DOI] [PubMed] [Google Scholar]
  • 23.Yu C, Mitchell JK. Non-randomness of the anatomical distribution of tumors. Cancer Converg. 2017;1(1):4. Epub 2017/01/01. doi: 10.1186/s41236-017-0006-7 ; PubMed Central PMCID: PMC5876694. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Cancer Genome Atlas N. Comprehensive molecular characterization of human colon and rectal cancer. Nature. 2012;487(7407):330–7. Epub 2012/07/20. doi: 10.1038/nature11252 ; PubMed Central PMCID: PMC3401966. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Guinney J, Dienstmann R, Wang X, de Reynies A, Schlicker A, Soneson C, et al. The consensus molecular subtypes of colorectal cancer. Nature medicine. 2015;21(11):1350–6. Epub 2015/10/13. doi: 10.1038/nm.3967 ; PubMed Central PMCID: PMC4636487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Margenthaler JA, Dietz JR, Chatterjee A. The Landmark Series:Breast Conservation Trials (including oncoplastic breast surgery). Annals of Surgical Oncology. 2021;28(4):2120–7. doi: 10.1245/s10434-020-09534-y [DOI] [PubMed] [Google Scholar]
  • 27.Cook MB, McGlynn KA, Devesa SS, Freedman ND, Anderson WF. Sex disparities in cancer mortality and survival. Cancer Epidemiol Biomark Prev. 2011;20(8):1629–37. Epub 2011/07/14. doi: 10.1158/1055-9965.EPI-11-0246 ; PubMed Central PMCID: PMC3153584. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Edgren G, Liang L, Adami HO, Chang ET. Enigmatic sex disparities in cancer incidence. Eur J Epidemiol. 2012;27(3):187–96. Epub 2012/01/04. doi: 10.1007/s10654-011-9647-5 . [DOI] [PubMed] [Google Scholar]
  • 29.Loftfield E, Zhou W, Yeager M, Chanock SJ, Freedman ND, Machiela MJ. Mosaic Y Loss Is Moderately Associated with Solid Tumor Risk. Cancer Res. 2019;79(3):461–6. doi: 10.1158/0008-5472.CAN-18-2566 ; PubMed Central PMCID: PMC6359954. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Loftfield E, Zhou W, Graubard BI, Yeager M, Chanock SJ, Freedman ND, et al. Predictors of mosaic chromosome Y loss and associations with mortality in the UK Biobank. Sci Rep. 2018;8(1):12316. doi: 10.1038/s41598-018-30759-1 . [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Isaac Yi Kim

3 Dec 2021

PONE-D-21-28020Comprehensive cancer-oriented biobanking resource of human samples for studies of post-zygotic genetic variation involved in cancer predispositionPLOS ONE

Dear Dr. Filipowicz,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jan 17 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Isaac Yi Kim, MD, PhD, MBA

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. For descriptions of databases/biobanks, please provide details about how the data were curated, as well as plans for long-term database/biobank maintenance, growth, and stability. Please provide a direct link to the database/biobank hosting site from within the paper. See our submission guidelines for further details  at http://journals.plos.org/plosone/s/submission-guidelines#loc-methods-software-databases-and-tools

3. Thank you for stating the following in the Acknowledgments Section of your manuscript: 

"This study was sponsored by the Foundation for Polish Science (FNP) under the International Research Agendas Program (grant number MAB/2018/6) to JPD and AP, co-financed by the European Union under the European Regional Development Fund. Our collection also received financing via the “Excellence Initiative - Research University'' program from Medical University of Gdansk. This project obtained further partial funding from The Swedish Cancer Society and Swedish Medical Research Council to JPD"

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. 

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows: 

"This study was sponsored by the Foundation for Polish Science (FNP) under the International Research Agendas Program to J.P.D. and A.P., co-financed by the European Union under the European Regional Development Fund. Our biobank also received financing via the “Excellence Initiative - Research University'' program from Medical University of Gdansk. This project obtained further partial funding from The Swedish Cancer Society and Swedish Medical Research Council to J.P.D."

Please include your amended statements within your cover letter; we will change the online submission form on your behalf

4. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. 

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide

5. We note that Figure 1 in your submission contain copyrighted images. All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.

We require you to either (1) present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or (2) remove the figures from your submission:

1. You may seek permission from the original copyright holder of Figure 1 to publish the content specifically under the CC BY 4.0 license. 

We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:

“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”

Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission. 

In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”

2. If you are unable to obtain permission from the original copyright holder to publish these figures under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: N/A

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The manuscript describes large scale biobanking that can be utilized for genomic analysis for cancer predisposition.

- As the authors mentioned, prostatic adenocarcinoma is usually multifocal and difficult to identify macroscopically. How are the PT and UM tissues verified that they in fact contain tumor and non-tumor tissues, respectively?

- Similarly, post-treatment tumors are difficult to identify macroscopically. How are the tissues verified?

- The protocol described is dependent on the presumptive diagnoses and macroscopic analysis for tissue collection. At times, the final pathology reports will not match the presumptive diagnoses. The procedure to address this issue needs to be explained.

Reviewer #2: Reviewer Comments

The biobanking approach by the authors in not an innovative idea or approach. However, the authors did an excellent and very thorough job in demonstrating a useful model of specimen coordination and tissue collection from multiple satellite sites. Below are questions and comments that should be addressed in the main text of the manuscript, which will help clarify missing details of the biobanking processes.

- Please indicate how specific regions of tumor harvest were determined? For example, was MRI used to confirm lesion or no lesion regions for the prostate cancer patients? What was the percentage of total number of tumors actually collected compared to the total number of tumors attempted to be collected (for each tumor type)?

- What considerations were taken in collecting samples from tumors with high heterogeneity? For each fragment of collected solid tissue, it was mentioned that a control tissue taken for H&E? How were “uninvolved margin” regions confirmed?

- For tumors that were used for multiple techniques such as H&E, OCT, etc., was the tumor split from the original sample or was adjacent tumor regions taken? If the later, how do you address potentially heterogeneity concerns during collection?

- How are tissue handled that are not 100% cancer? For example, what were done with tumor cores that contained a percentage of non-cancer tissue or tumor cores that had non-continuous cancer? How did this impact the freezing/storage of the tissue pieces? Were the non-cancer regions dissected from the cancer regions before freezing/storage?

- What specific techniques (such as frozen sections) were used by the pathologists in determining non-tumor regions and cancer regions? Were frozen sections taken at the time of surgery or during tissue processing to confirm the cancer and non-cancer regions for every tissue sample collected including tumor, UM regions, and matched controls?

- For breast cancer tissue collection, it was mentioned that a subset of tissues were collected directly from the surgical operations. Was the pathology of these tissue pieces also confirmed by frozen sections before making organoid and/or primary cell lines?

- What procedures/protocols were used to determine if local mets would be collected? Was the collection of mets pre-planned? How was biobanking and patient care both satisfied during met collection? In general, how were the mets collected/processed (i.e., FFPE, OCT, etc.)?

- For bladder cancer, TURBT procedures can be used for clinical diagnosis/staging in order to determine effective treatment plans. Did you encounter any embargoing of the tissue from TURBT procedures especially for heterogeneous areas of the bladder? Were there any issues with tissue collection for the biorepository that was canceled due to the need for clinical diagnosis/staging? If not, please explain how this was avoided?

- Were the pathologists used for biobanking all sub-specialized? For examples, was a GU pathologist used for bladder and prostate tissue collection? Why was it or why not was it necessary to have such sub-specialists in place?

- It was mentioned that “…collection from 1711 donors sums up to about 2800 working hours or about 70 working weeks with full time effort.” During the tissue collection pipeline, how much extra time was added to the clinical pipelines of surgery and tissue processing? Can you breakdown the 2800 working hours by task? For example, how many pathologists’ hours were needed?

- “The collected samples can be made available to other research groups.” Who gets to use the tissues? Is there a system in place to share the collected tissue? How will tissue usage be prioritized?

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2022 Apr 7;17(4):e0266111. doi: 10.1371/journal.pone.0266111.r002

Author response to Decision Letter 0


16 Jan 2022

Re: revision of the manuscript PONE-D-21-28020 “Comprehensive cancer-oriented biobanking resource of human samples for studies of post-zygotic genetic variation involved in cancer predisposition”

General comment regarding answers to the below questions/comments from Reviewer #1 and Reviewer #2

There is a large overlap between the issues raised and we reply by cross-referring between our answers to both of the reviewers, to avoid unnecessary repetitions of the text. Therefore, we believe that all our answers should be shown to both reviewers.

Reviewers' comments and our answers:

Reviewer #1: The manuscript describes large scale biobanking that can be utilized for genomic analysis for cancer predisposition. -

1. As the authors mentioned, prostatic adenocarcinoma is usually multifocal and difficult to identify macroscopically. How are the PT and UM tissues verified that they in fact contain tumor and non-tumor tissues, respectively?

2. Similarly, post-treatment tumors are difficult to identify macroscopically. How are the tissues verified?

3. The protocol described is dependent on the presumptive diagnoses and macroscopic analysis for tissue collection. At times, the final pathology reports will not match the presumptive diagnoses. The procedure to address this issue needs to be explained.

Reviewer #1: Answer to questions 1-3:

These three questions are closely related and their primary relevance is prostatic adenocarcinoma, because of recurrent problems to macroscopically define the location of primary tumor. The second major diagnosis where these questions are also quite relevant is identification of multifocal breast cancer. In our past experience, UMs taken from areas of macroscopically normal breast tissue (close, or occasionally at considerable distances from the primary tumor; see PMIDs 26430163 and 26219265) may sometimes contain tumor cells. Therefore, it is obviously important to characterize histologically each tissue fragment (presumed macroscopically normal or suspected to contain tumor tissue) that is biobanked, which we did (and continue collecting in this way, as the biobanking project is not yet finished).

The tissues were collected according to the well-defined protocol to address the above issue, i.e., after macro-sectioning of the resected organ, small tissue fragments were selected and excised for biobanking. Subsequently, each fragment was cut in half: one portion was placed into a cryovial and fresh-frozen at -80℃, while the other one was fixed in formalin, embedded in paraffin and underwent standard processing, sectioning and H&E staining (FFPE). The latter FFPE tissue sectioning was done along the cut surface closest to the fresh-frozen biobanked piece of tissue, so that the FFPE section is as much as possible representative for the tissue in the frozen specimen.

For instance, for prostate specimens, a total of twelve tissue fragments were biobanked, each followed by the pathological report on the actual tumor content in the matching sample. Therefore, despite the degree of uncertainty/discrepancy in the macroscopic assessment in some situations (i.e. for prostatic adenocarcinoma, multifocal breast cancer, pancreatic adenocarcinoma with coexistent chronic pancreatitis, tumors after neoadjuvant therapy), every single biobanked tissue fragment has its matching FFPE tissue that undergoes pathological verification of the actual tumor content. The results of this assessment is every time included in the routine patient histopathological report.

The above questions are similar to some of questions/comments from reviewer #2. Because of these concerns, we have modified our manuscript to emphasize this issue (please see page 8 and 9) . Furthermore, it is important to mention that each time, the biobanked material is selected for further molecular analysis(es), the presence/absence of tumor/non-tumor tissue is being verified using the pathological reports from the participating hospitals. If necessary, an additional investigation of FFPE slides for both primary tumors and UMs that are archived, can be undertaken.

Reviewer #2: Reviewer Comments

The biobanking approach by the authors in not an innovative idea or approach. However, the authors did an excellent and very thorough job in demonstrating a useful model of specimen coordination and tissue collection from multiple satellite sites. Below are questions and comments that should be addressed in the main text of the manuscript, which will help clarify missing details of the biobanking processes.

1. Please indicate how specific regions of tumor harvest were determined? For example, was MRI used to confirm lesion or no lesion regions for the prostate cancer patients?

For most included patients and cancer types, the tumor is clearly identifiable macroscopically. However, as mentioned above for prostate cancer, the macroscopic assessment is sometimes unreliable. Therefore, pre-specified regions of the gland (central and peripheral areas of upper/central/lower part of each lobe, similarly to the mapping biopsy procedure) were harvested. Subsequently, all matching specimens underwent pathologic examination to verify the actual tumor content (see above answers to reviewer #1).

Whenever imaging analysis was performed prior to surgery, MRI and/or CT results were accessible for pathologists prior to collection tumor/non-tumor tissue from the prostate gland. We do not currently have data on how often MRI and/or CT results were available to the pathologists and how efficient these imaging data actually were in guiding pathologists during the macro-sectioning of the prostate. However, we confirm that the pathologists were using the MRI and/or CT results during macro-dissection of the prostate. This issue is actually an interesting future research question that could be addressed when we finish our collection of prostate cancers. We could then evaluate (using a much larger sample size) the effectiveness of targeting the primary tumor by the pathologist, when dissecting prostate.

Generally speaking, our protocol for prostate cancer sample collection was designed to cover most of the possible tumor locations, in order to sample both primary tumor and normal prostate tissue. Based on the presumable location of the tumor, we selected four likely tumor specimens (named PT1-4; see Figure 2C in the manuscript). Samples with presumably no lesions in the slice towards apex (LUM1-4) and towards base (UUM1-4) were analogically excised. All the specimens were further confirmed by post-operative histopathological examination. See also below answer to question 2.

2. What was the percentage of total number of tumors actually collected compared to the total number of tumors attempted to be collected (for each tumor type)?

Thank you for this comment, as this is an important aspect of our project. We presume that this question refers to the statistics of how often the presence of tumor tissue was verified by histopathological analysis of FFPE-sections in the samples that were macroscopically presumed to represent a tumor sample.

In response to this question, we performed such comparison for prostate cancer and colorectal cancer. As suspected, prostate cancer represents a challenge due to the lack of clear macroscopic demarcation of the tumor. The calculation on the cohort of 100 prostate cancer patients showed that out of 400 specimens collected presumably as tumorous tissue, 263 (65.7%) showed the presence of tumor cells. For the similar sample size of 100 colorectal cancer patients, 205 out of 205 (100%) presumed tumor samples were confirmed as containing tumorous tissue. It is again important to mention (see also below question 3) that each fragment selected for further molecular examination is verified with the histopathological report. We added the text summarizing the above comparison in the results section of the paper, on page 18.

We do not yet have similar statistics for other diagnoses.

3. What considerations were taken in collecting samples from tumors with high heterogeneity? For each fragment of collected solid tissue, it was mentioned that a control tissue taken for H&E? How were “uninvolved margin” regions confirmed?

Please see answers above. Each collected specimen has a matching FFPE fragment that undergoes the histological verification. This allows us to asses very well the heterogeneity of different pieces of the primary tumor and metastases.

4. For tumors that were used for multiple techniques such as H&E, OCT, etc., was the tumor split from the original sample or was adjacent tumor regions taken? If the later, how do you address potentially heterogeneity concerns during collection?

Please see above the answer to reviewer #1. Each collected sample was split in half and the splitting surface underwent histological assessment of the actual tissue content (i.e. each resected tissue fragment had two halves: biobanked fragment and reference one for FFPE). This approach provides only a general representation of the actual content of the biobanked material, leaving some uncertainty margin, as tissue areas 1 mm apart may differ. It is, however, the best way that it can be done. We cannot be 100% certain that the tissue used for DNA/RNA/protein extraction is identical to what is present on the histological slide.

5. How are tissue handled that are not 100% cancer? For example, what were done with tumor cores that contained a percentage of non-cancer tissue or tumor cores that had non-continuous cancer? How did this impact the freezing/storage of the tissue pieces? Were the non-cancer regions dissected from the cancer regions before freezing/storage?

It is safe to say that none of the tumor tissue specimens is containing 100% cancer, as cancer cells are always intermingled with a quite variable proportion of immune cells, fibroblasts, vessels etc. The histological verification of the tissue content was performed post-factum and there were no attempts to remove the non-neoplastic tissues. However, our samples can be used for future micro-dissection projects, if motivated by findings using bulk of the cells in a sample, to verify these results.

6. What specific techniques (such as frozen sections) were used by the pathologists in determining non-tumor regions and cancer regions? Were frozen sections taken at the time of surgery or during tissue processing to confirm the cancer and non-cancer regions for every tissue sample collected including tumor, UM regions, and matched controls?

We did not perform intraoperative pathological assessment (frozen sections) for the patients that were biobanked in our project, since this process is much more time consuming. We were aiming at shortening time between organ resection and snap freezing of the fragments (up to 2h), which is of importance for the quality of genetic material got later from the collected tissue fragments (especially RNA integrity). Moreover, intra-operative sampling decreases the amount of tumor tissue that is available for later diagnostic procedure using FFPE and material that can be biobanked. Using the approach of two halves of tissue fragments - biobanked and reference for FFPE (see above point 4), we could confidently confirm the content of biobanked material. The drawback of this approach is that the tissue fragments selected for harvesting based on the macroscopic assessment had to be verified after the surgery. Thus, as we mentioned above, each time, the frozen material is selected for further molecular analysis(es), the presence/absence of tumor/non-tumor tissue is being verified using the pathological reports from the participating hospitals.

7. For breast cancer tissue collection, it was mentioned that a subset of tissues were collected directly from the surgical operations. Was the pathology of these tissue pieces also confirmed by frozen sections before making organoid and/or primary cell lines?

The procedure for UM (uninvolved margin) and S (skin) collection for primary cell cultures was the only exception, where the fragments of tissue were collected directly at the operating theatre due to the necessity to keep the strictly sterile conditions. The morphology for those fragments were not confirmed by microscopical assessment. However, the UM fragment for patients undergoing BCT was collected from the most distant possible location of the breast, in comparison to the location of primary tumor. For mastectomy patients, UM and S fragments were always collected from the opposite quadrant of the breast.

8. What procedures/protocols were used to determine if local mets would be collected? Was the collection of mets pre-planned? How was biobanking and patient care both satisfied during met collection? In general, how were the mets collected/processed (i.e., FFPE, OCT, etc.)?

The collection of local metastases was optional, in case of breast and colorectal carcinoma patients. To ensure proper diagnosis and patient care, nodal metastases were only collected if they were clearly identifiable and large enough on gross examination. The metastases were processed in the same way as the other specimens. We added respective information about that on page 8 of the manuscript.

9. For bladder cancer, TURBT procedures can be used for clinical diagnosis/staging in order to determine effective treatment plans. Did you encounter any embargoing of the tissue from TURBT procedures especially for heterogeneous areas of the bladder? Were there any issues with tissue collection for the biorepository that was canceled due to the need for clinical diagnosis/staging? If not, please explain how this was avoided?

Collection of samples from TURBT procedures is the most challenging protocol considering the limited amount of tissue that is collected. The decision to enroll the patient into the biobanking project is taken by the surgeon and he/she takes into account several variables; e.g. size of the tumor and heterogeneity of the areas from which both tumor and a fragment(s) of normal tissue can be taken. However, the final decision to collect (or not) the fragments for biobanking is taken by the pathologist and this decision depends on the amount of tumor tissue that is available to safeguard the proper diagnostic procedure of the patient. The priority #1 is always the patient diagnosis.

We have a histopathological control for the normal tissue also in the case of this diagnosis (after TURBT procedure). However, it should be mentioned that the amount of normal tissue that is available strongly limits the number and type of molecular analyses that can be performed. As it is clearly visible in Figure 3D of the manuscript, the number of samples taken during TURBT procedure is usually three: one PT and two UMs (close and distant UM samples PT), with the minimal mandatory set of two samples (one PT and one UM) than can be included in our biobanking project.

In conclusion, there is no risk that the patient would suffer from problems regarding diagnosis, including the proper staging of the disease. If tumor sample is very small according to the experienced pathologist, we do not biobank material for such patient. If the patient is selected to biobank the tissue, the tumor sample as well as the fragment of normal tissue is divided into two “twin fragments”, one for FFPE and one for the biobank.

10. Were the pathologists used for biobanking all sub-specialized? For examples, was a GU pathologist used for bladder and prostate tissue collection? Why was it or why not was it necessary to have such sub-specialists in place?

In the larger cancer centers, where pathologists are diagnosing a very large number of patients, there usually is a degree of division of work involving specialization of pathologists for different diagnoses. This is valid only for physicians who already are specialist consultants in Pathology.

For the purpose of our biobanking project, there was no need to involve sub-specialization, as the biobanking protocol was fairly simple, although time consuming for some diagnoses. In each participating hospital, there was (is) a team of pathologists, who perform gross examination of the resected organs, biobanked samples and later were involved in the assessment of the tumor content. This team is processing all diagnoses for our project that are coming from any of the five collaborating hospitals. All the pathologists involved in our project prepared material according to detailed collection protocols and after training. Secondly, the workload necessary for this endeavor required the participation of the whole (or vast majority of) pathology teams (in smaller hospitals the whole team may be limited to two physicians) as the specimens sometimes arrived irregularly and had to be processed directly upon arrival. Strict sub-specialization is rarely practiced in Poland and almost only in hospitals dedicated to a narrow group of diseases, e.g., Institute of Tuberculosis and Lung Diseases or in reference Breast Cancer Units. Finally, sub-specialization is necessary for rare diagnoses or uncharacteristic entities and our study included common tumors, most of which had already been diagnosed in core needle biopsy or endoscopy specimens.

Occasional discrepancies and problems between the final diagnosis for the resected organ and the biopsy material may occur, but the assessment of the samples reflecting the biobanked tissues was always performed in parallel to the final pathologic diagnosis by the same pathologist, while any uncertainties were consulted with other team members.

11. It was mentioned that “… collection from 1711 donors sums up to about 2800 working hours or about 70 working weeks with full time effort.” During the tissue collection pipeline, how much extra time was added to the clinical pipelines of surgery and tissue processing? Can you breakdown the 2800 working hours by task? For example, how many pathologists’ hours were needed?

The below table is the summary of the requested details.

This information is a rough estimation of the extra time spent by the personnel on our biobanking project; that means the time added on top of what would be the time necessary to process a patient without biobanking. The presented numbers reflect the mean estimates for each task provided by the four largest partner hospitals:

Procedure Blood preparation Sample registration in the biobanking system Sample preparation, slide preparation and assessment in the pathology department Total

Breast-BCT 10 min 11 min 50 min 71 min

Breast-Mastectomy 10 min 14 min 81 min 105 min

Prostatectomy 10 min 16 min 128 min 154 min

Colectomy 10 min 15 min 65 min 90 min

Cystectomy 10 min 15 min 77 min 92 min

Hepatectomy 10 min 15 min 35 min 60 min

Pancreatectomy 10 min 15 min 85 min 110 min

The clinical pipeline of the surgery was longer only in two cases of our biobanking activity: while collecting the UM fragment from the glandular tissue of the breast for cell cultures directly at operating theatre (up to 5 min.) and during TURBT procedure (up to 5 min.).

We believe that this detailed calculation of time is not the most important aspect of the manuscript. Our aim was merely to provide an estimation of the necessary workload, that could be used by other investigators for planning purposes.

12. The collected samples can be made available to other research groups. ”Who gets to use the tissues? Is there a system in place to share the collected tissue? How will tissue usage be prioritized?”

Thank you for these questions, which point to an insufficient description of this aspect. The collected biological material and clinical details can be made available for other investigators after a request to both corresponding authors. The letter should outline the aim, number/type of requested samples and methodology of the proposed collaborative project. A sentence similar to the above one, has been added to the main text of the manuscript.

We obtained the research ethical approval, which is according to “the model of broad consent”, for the biobanking project outlined in our application for approval. This allows us to share, on the collaborative basis, the collected biological samples (in many formats; e.g. tissues, serum, blood and processed biological material such as DNA, RNA, etc.; all as pseudo-anonymized specimens) with investigators in Poland and outside for the purpose of specified collaboration project. Clinical details are available only to the authorized personnel of our unit and can be made available to outside investigators upon request.

The priority is determined by the availability of material, considering projects that are already ongoing, and avoiding competition for the same research questions.

Natalia Filipowicz

Jan Dumanski

Attachment

Submitted filename: Response_to_reviewers_PONE-D-21-28020.docx

Decision Letter 1

Isaac Yi Kim

15 Mar 2022

Comprehensive cancer-oriented biobanking resource of human samples for studies of post-zygotic genetic variation involved in cancer predisposition

PONE-D-21-28020R1

Dear Dr. Filipowicz,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Isaac Yi Kim, MD, PhD, MBA

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Isaac Yi Kim

30 Mar 2022

PONE-D-21-28020R1

Comprehensive cancer-oriented biobanking resource of human samples for studies of post-zygotic genetic variation involved in cancer predisposition

Dear Dr. Filipowicz:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Isaac Yi Kim

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Pseudonymized list of patients that were included in the statistics presented in the manuscript with the information on diagnosis, age, sex, ICD10, type of surgery where applicable, number of original samples and the status of smoking declared in the medical questionnaire.

    (XLSX)

    Attachment

    Submitted filename: Response_to_reviewers_PONE-D-21-28020.docx

    Data Availability Statement

    We are not able to provide the access to our internal database MABData 2, as this is only available locally for the authorized researchers of our unit. However, we have prepared a minimal anonymized dataset in excel file with all the donors and clinical/medical data that were included in the paper. We provide this information as a supporting information file (S1 Table).


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES