A Multi-Stage Feature Aggregation and Structure Awareness Network for Concrete Bridge Crack Detection

. 2024 Feb 28;24(5):1542. doi: 10.3390/s24051542

Symbols	Description
$ℝ$	real number space
$X$	input tensor
$A_{m a s k}^{k}$	attention mask map of the kth stage
$S_{s i d e}^{k}$	side output of the kth stage
$Γ$	tensor concatenation operation
$σ$	Sigmoid activation function
$U p_{H \times W}$	upsampling to the input image size $H \times W$
$S_{f u s e}$	Output prediction map after multi-stage fusion, i.e., the final prediction result map
$\otimes$	convolution operation
$\oplus$	element-wise addition operation
$⊙$	element-wise multiplication operation
P	the predicted segmentation result
G	the ground truth binary label
N	the total number of pixels in a crack image
$α$	predicting the error weights
$β$	predicting the correct weights
$\| Y^{+} \|$	the number of positive samples in a crack image
$\| Y^{-} \|$	the number of negative samples in a crack image
$λ$	the loss ratio to balance the positive and negative samples
$L_{B W C E} (W)$	the balanced weighted cross-entropy loss
$α_{1}$	false positive weights
$β_{1}$	false negative weights
$L_{T v e r s k y} (W)$	Tversky loss
$L (W)$	$L_{B W C E} (W)$ and $L_{T v e r s k y} (W)$ weighted losses
$η$	the weight of loss $L_{B W C E} (W)$
$L_{t o t a l} (W)$	The final total loss function
$w_{s i d e}^{k}$	the loss weight of the kth stage
$w_{f u s e}$	the loss weight of the final fusion stage
$\sum$	Summation