Whole-body tumor segmentation from FDG-PET/CT: Leveraging a segmentation prior from tissue-wise projections

Sambit Tarai; Elin Lundström; Nouman Ahmad; Robin Strand; Håkan Ahlström; Joel Kullberg

doi:10.1016/j.heliyon.2024.e41038

. 2024 Dec 10;11(1):e41038. doi: 10.1016/j.heliyon.2024.e41038

Whole-body tumor segmentation from FDG-PET/CT: Leveraging a segmentation prior from tissue-wise projections

Sambit Tarai ^a,^c,^⁎, Elin Lundström ^a, Nouman Ahmad ^a, Robin Strand ^b, Håkan Ahlström ^a,^c, Joel Kullberg ^a,^c

PMCID: PMC11719307 PMID: 39801978

Abstract

Background:

Accurate tumor detection and quantification are important for optimized therapy planning and evaluation. Total tumor burden is also an appealing biomarker for clinical trials. Manual examination and annotation of oncologic PET/CT is labor-intensive and demands a high level of expertise. One significant challenge is the risk for human error, leading to potential omission of especially small tumors and tumors with low FDG uptake.

Purpose: In this study, we introduced an automated framework with segmentation prior, from a tissue-wise multi-channel multi-angled based approach, to enhance tumor segmentation in whole-body FDG-PET/CT.

Method: The proposed framework utilized a segmentation prior generated from tumor segmentations in tissue-wise multi-channel projections of the standardized uptake value (SUV) from PET. Projections were created from various angles and the tissues were identified based on their CT Hounsfield values. The resulting segmentation masks were subsequently backprojected into a unified 3D volume for creation of the segmentation prior. Finally, the segmentation prior was provided as an additional input channel along with the CT and SUV images to three variants of 3D segmentation networks (3D UNet, dynUNet, nnUNet) to enhance the overall tumor segmentation performance. All the methods were independently evaluated using 5-fold cross-validation on the autoPET dataset and subsequently tested on the U-CAN dataset.

Results:

Combining the segmentation prior with the original SUV and CT images improved overall tumor segmentation performance significantly compared to a baseline network. The increase in Dice coefficient for lymphoma, lung cancer, and melanoma across different segmentation networks were: 3D UNet ( ${0.04}^{⁎}$ , ${0.02}^{⁎}$ , ${0.11}^{⁎}$ ), dynUNet ( ${0.05}^{⁎}$ , ${0.04}^{⁎}$ , ${0.08}^{⁎}$ ), and nnUNet ( ${0.02}^{⁎}$ , ${0.00}^{n s}$ , ${0.03}^{⁎}$ ), respectively; *, p-value < 0.05; ns, non-significance.

Conclusion: The increased segmentation accuracy could be attributed to the segmentation prior generated from tissue-wise SUV projections, revealing information from various tissues that was useful for segmentation of tumors. The results from this study highlight the potential of the proposed method as a valuable future tool for time-efficient quantification of tumor burden in oncologic FDG-PET/CT.

Keywords: Whole-body tumor segmentation, Multi-channel multi-angled PET/CT projections, Backprojection, Segmentation prior

1. Introduction

According to the World Health Organization (WHO), cancer stands as one of the leading causes of death worldwide, surpassing all other health related disorders [1]. Each year, the number of individuals diagnosed with cancer continues to rise, emphasizing its escalating prevalence. While detecting the presence of cancer may not be overly challenging, accurate quantification of tumors at an early stage remains critical, especially identification of small and low contrast metastases emerging in different parts of the body. In clinical practice, diagnostic assessments, staging, and monitoring of certain cancer forms can be performed non-invasively using positron emission tomography combined with computed tomography (PET/CT) after injecting 18F-fluorodeoxyglucose (FDG) [2]. FDG is widely used in routine oncologic PET due to its sensitivity to the high glucose metabolism of malignant tumors.

Traditionally, tumor segmentation has relied solely on manual delineation of FDG-PET/CT images by radiologists. As a result, it has become labor-intensive, time-consuming, and susceptible to human errors [3]. Furthermore, there is a risk of the radiologists overlooking small lesions and lesions with low FDG uptake, which can have serious consequences as they can proliferate over time and spread. Therefore, early and precise lesion detection becomes vital for non-invasive tumor tracking as a step in streamlining the treatment planning and potentially improving patient outcome. Additionally, this is important for estimating the total metabolic tumor volume (TMTV), quantifying the total number of lesions and their locations in the body, detecting the presence of new lesions in follow-up scans, and assessing lesion-specific changes post-treatment. These are important prognostic factors in risk assessment, therapy optimization and evaluation [4]. They are also appealing biomarkers for clinical trials.

Several convolutional neural network (CNN) based architectures have been developed for image segmentation, with UNet [5] being most widely used. Expanding on UNet, Zhou et al. introduced nested and dense skip connections in their network called UNet++ [6] [7], aiming at reducing the semantic gap between the encoder and decoder for improved segmentation results. Recently, Isensee et al. developed the nnUNet [8] architecture, featuring a self-adapting framework for configuring various segmentation components automatically. DynUNet is another segmentation network provided by MONAI [9], an open source framework. It builds upon the foundations of nnUNet and delivers exceptional performance with ease of implementation.

Recent studies have made significant advancements in automated PET/CT tumor segmentation, including the development of deep transfer learning approaches and ISA-Net, which have shown effectiveness in quantifying molecular tumor burden quantification, risk stratification, and treatment response evaluation [10] [11]. The HECKTOR challenge at MICCAI 2020 further highlighted the progress in segmenting Gross Tumor Volume (GTV) in head and neck cancer using FDG-PET/CT, where top methods outperformed human inter-observer agreement [12]. Similarly, the autoPET challenge at MICCAI 2022 confirmed the feasibility of accurate automated segmentation of metabolically active tumors in whole-body PET/CT, with success largely dependent on data quality and quantity [13].

Despite significant advancements in the field of tumor segmentation from medical imaging [13] [14] [12] [15] [16] [17], some challenges persist including diverse tumor characteristics, anatomical misalignment between PET and CT, limited inter-operator agreement between radiologists during delineation and uncertainty in the annotation boundary. Among all, the most crucial challenge faced by many networks is the accurate segmentation of small and low FDG uptake lesions. Therefore, the main goal of this work was to develop an automated framework that effectively segments challenging lesions overlooked by conventional baseline networks, surpassing current state-of-the-art methods. The developed solution has the potential to assist radiologists by reducing their workload and minimizing the risk of overlooking critical information during diagnostics. Additionally, it can support longitudinal monitoring of cancer patients, contributing to improved patient care.

Segmentation prior-based tumor segmentation in whole-body PET/CT and PET/MRI datasets has previously been explored by our reseach group [18]. This includes segmentation of tumors from multiple 2D SUV maximum intensity projections (MIPs) in order to generate a segmentation prior, thereafter used as an independent input channel for tumor segmentation in 3D. In this case, the segmentation prior consists of a single channel, corresponding to the SUV MIP from all tissues, projected at multiple angle (i.e. single-channel multi-angle approach). In segmentation prior-based methods, the effectiveness of the 3D tumor segmentation framework depends on the quality of the segmentation prior. Improving the reconstruction of such segmentation priors can result in enhanced tumor segmentation performance. By separating voxels from different tissues, tissue-wise SUV MIPs at multiple angles (i.e. a tissue-wise multi-channel multi-angle approach) can be obtained for increased information content in the projections, which potentially can assist in the tumor segmentation. Therefore, building upon previous work, our primary goal is to utilize a tissue-wise multi-channel multi-angle PET/CT projection-based approach [19] to improve the quality of the segmentation prior, aiming for state-of-the-art tumor segmentation results in whole-body FDG-PET/CT.

Previous methods have relied on training different variants of the UNet using extensive PET/CT and other datasets [20], [21], [22], [23], [24], [25], [26], [27], [28]. However, such methods may be clinically less relevant as different cancer types can exhibit heterogeneous imaging characteristics, necessitating disease-specific training for a more realistic approach. Hence, one of the secondary aims of this paper was to conduct disease-wise training, wherein independent neural networks were trained for different cancer types.

The main objectives of the paper can be summarized as follows:

1.
Developing an automated tumor segmentation framework, using three different 3D segmentation networks, to evaluate the advantages of the proposed method with various architectures.
2.
Applying a tissue-wise multi-channel PET/CT projection-based approach to enhance the quality of the segmentation prior.
3.
Investigating the benefits of disease-specific training versus general training (the latter including all cancer types).
4.
Comparing different models (baseline and proposed) through voxel-wise and lesion-wise analysis of segmentation metrics.
5.
Independently testing various approaches (baseline, prior_1, prior_2) on an internal test set to assess their generalizability.

2. Methodology

2.1. Dataset

This study utilizes FDG-PET/CT images from the autoPET cohort [29] for the purpose of comprehensive tumor segmentation analysis and validation of the performance of the proposed method against the baseline method. It also uses an internal test set from the U-CAN cohort [30] to evaluate the generalizability of the developed models. Table 1 provides an overview of the key features of the datasets utilized in the study. Ethical approval was obtained from the Swedish Ethical Review Authority to conduct retrospective image analysis on both datasets.

Table 1.

Summary of FDG-PET/CT datasets.

Parameters	autoPET	U-CAN
Medical imaging	FDG-PET/CT	FDG-PET/CT
Examinations	501	68
Cancer types	Lymphoma, Lung cancer, Melanoma	Diffuse large B cell lymphoma
Sex (Male/Female)	(290/209)	(51/37)
Avg. total metabolic tumor volume	220 ml	107 ml
Number of cites	single-cite	multi-cite

CT scanner	Siemens Biograph mCT	-
CT mAs	200 mAs	-
CT Tube Voltage	120 kV	-
CT Contrast Agent	Ultravist 370	-

PET Radioactivity	314.7 MBq	-
PET Acquisition Time per Bed Position	2 minutes	-

Disease	Inputs					Dice
Disease	MIP	Bone	Lean	Adipose	Air	Dice
Lymphoma	✓					0.6587
Lung Cancer	✓					0.7356
Melanoma	✓					0.5824

Lymphoma	✓	✓	✓	✓	✓	0.6869
Lung Cancer	✓	✓	✓	✓	✓	0.7667
Melanoma	✓	✓	✓	✓	✓	0.6148

Model	Inputs				Dice
Model	CT	SUV	Prior_1	Prior_2	Dice
	✓	✓			0.4491
3D UNet (lymphoma)	✓	✓	✓		0.4652
3D UNet (lymphoma)	✓	✓		✓	0.5165

	✓	✓			0.5042
dynUNet (lymphoma)	✓	✓	✓		0.5132
dynUNet (lymphoma)	✓	✓		✓	0.5367

	✓	✓			0.5483
nnUNet (lymphoma)	✓	✓	✓		0.5368
nnUNet (lymphoma)	✓	✓		✓	0.5632

Method	Model	Description	Dice	Comments
1 [20]	UNet	2D UNet based tumor segmentation with 5-fold CV strategy	0.69	Outperformed by our method.

2 [21]	UNet	Network takes PET and CT as input and outputs 8 channels, one of which is the true segmentation mask and others are auxiliary channels. 40 images were set aside for validation.	0.80	CV results are not available. Results reported on a set aside test set.

3 [22]	nnUNet + Swin UNetR	A 5-fold cross-validation was employed with stratification based on sex and diagnosis, and late fusion was applied to enhance the overall Dice.	0.72	Outperformed by our method.

4 [24]	nnUNet	Introduced a false positive reduction network for enhanced segmentation performance.	0.93	CV results are not available. Only results from preliminary test set reported.

5 [25]	UNet	Simple UNet based training and validation was done with an input size of (192, 192, 192). 103 images were set aside for validation.	0.75	CV results are not available. Results reported on a set aside validation set.

6 [26]	nnUNet	Proposed a joint (2D-3D models) whole-body lesion segmentation approach with a patch size of (128, 128, 128).	0.79	Performed 5-fold CV but reported results solely on the best performing fold 1 and 2.

7 [27]	nnUNet	Proposed a 2 step approach: first, generating a prior using the normal appearance autoencoder, and second, incorporating this prior into the segmentation network.	0.70	Outperformed by our method.

8 [28]	nnUNet	Proposed to use nnUNet with Graph convolutional network (GCN) refinement. 30 images were set aside for validation.	0.76	CV results are not available. Results reported on a set aside validation set.

9 [18]	3D UNet	Proposed to use a segmentation prior- based approach for enhanced tumor segmentation.	0.70	Outperformed by our method.

Ours	nnUNet	Described in section 3 of this paper.	0.74	-

PET	Positron Emission Tomography
CT	Computed Tomography
FDG	18F-fluorodeoxyglucose

SUV	Standardized uptake value
HU	Hounsfield Units
WHO	World health organization
CNN	Convolutional neural network
MIP	Maximum intensity projection
TCIA	The Cancer Imaging Archive
DL	Dice loss
FL	Focal loss
HD95	Hausdorff distance
ASD	Average surface distance
TMTV	Total metabolic tumor volume
TP	True positive
FN	False negative
FP	False positive

Comparison criteria	Segmentation networks
Comparison criteria	3D UNet	dynUNet	nnUNet
[1] Total trainable parameters	4807345	62839746	49339250
[2] Model size (MB)	57.9	377.2	250
[3] Training time per epoch (seconds)	210	220	260
[4] Validation time per epoch (seconds)	60	80	70
[5] Inference time per subject (seconds)	4	6	60
[6] Maximum batch size with current hardware	4	1	1
[7] Internal RAM usage	low	medium	high

Comparison criteria	Segmentation approach
Comparison criteria	baseline	prior_1	prior_2
[1] Total trainable parameters	4807345	4808209	4808209
[2] Model size (MB)	57.9	58.2	58.2
[3] Training time per epoch (seconds)	210	210	210
[4] Validation time per epoch (seconds)	60	60	60
[5] Segmentation prior	No	Yes	Yes
[6] Segmentation prior generation (RAM usage)	No	high	high
[7] Performance	Good	Better	Best

Inputs			Patch size	Network	Dice
CT	SUV	prior_2	Patch size	Network	Dice
✓	✓	✓	(96, 96, 96)	3D UNet	0.6213
✓	✓	✓	(128, 128, 128)	3D UNet	0.6285
✓	✓	✓	(160, 160, 160)	3D UNet	0.6618
✓	✓	✓	(208, 208, 208)	3D UNet	0.6427

PERMALINK

Whole-body tumor segmentation from FDG-PET/CT: Leveraging a segmentation prior from tissue-wise projections

Sambit Tarai

Elin Lundström

Nouman Ahmad

Robin Strand

Håkan Ahlström

Joel Kullberg

Abstract

1. Introduction

2. Methodology

2.1. Dataset

Table 1.

2.1.1. autoPET cohort

2.1.2. U-CAN cohort

2.2. Data pre-processing

2.3. Overview of the proposed tumor segmentation framework

Figure 1.

2.3.1. Tissue-wise multi-channel PET/CT generation

2.3.2. Tumor segmentation from multi-channel multi-angled SUV projections

Table 2.

2.3.3. Reconstruction of segmentation prior using backprojection

2.3.4. 3D tumor segmentation

2.4. Evaluation metrics

2.5. Statistics

3. Results

3.1. 2D tumor segmentation using multi-channel multi-angled SUV projections

3.2. 3D tumor segmentation

Table 3.

Table 4.

Table 5.

Table 6.

Figure 2.

Figure A.1.

Table 7.

Table 8.

Figure 3.

4. Discussion

5. Limitations

6. Conclusion

7. Abbreviations

Ethics approval and consent to participate

Consent for publication

Funding

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgements

Appendix A. Additional results

Table A.1.

Table A.2.

Table A.3.

Table A.4.

Table A.5.

Data availability

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases