Table 2.
Five-fold testing: quantitative performance evaluation of the cell segmentation module (DL/IDL) compared to state-of-the-art methods.
| Detection metrics | Semantic metric | Speed metrics | |||||||
|---|---|---|---|---|---|---|---|---|---|
| mIoU (%) | F1 (%) | Accuracy (%) | Precision (%) | Recall (%) | Dice (%) | Train epochs | Inference (s) | ||
| Cellpose17 | 500 | ||||||||
| Stardist16 | 94.80±0.15 | 400 | |||||||
| Ours | Att-UNet+LSTM | 40 | |||||||
| Att-UNet (XAI) | 73.55±1.41 | 84.53±1.55 | 20 | ||||||
| UNet+LSTM | 86.90±1.80 | 77.72±2.53 | 94.04 ± 0.28 | 40 | |||||
| UNet (XAI) | 20 | 0.136±0.0069 | |||||||
| BiONet+LSTM | 40 | ||||||||
| BiONet (XAI) | 20 | ||||||||
The reported results are the , computed over 5-fold testing. The best metrics (per column) are highlighted in bold, and the second-best metrics are underlined. Instance-level segmentation (detection) evaluations were used to assess performance with different metrics (per cell mask). The mIoU (mean Intersection over Union) is calculated as the sum of IoU (cell mask-wise) of the predicted cell masks divided by the ground-truth cell count. To report these metrics, we used between ground truth and predicted masks to compute TP, FP, and FN. The F1 score is defined as , while the , , and . We utilized a semantic segmentation metric (i.e., Dice coefficient) to quantify the foreground/background pixel-wise separation, defined as , where gt is the ground truth mask and pred is the predicted mask (). The training epochs refer to the number of epochs needed to complete the training phase. Inference time (on the test set) per image was computed using the following hardware: an 8-core i7 9700K CPU, 16GB RAM, NVIDIA MSI 2080.