|
Symbols
|
Description
|
|
|
real number space |
|
|
input tensor |
|
|
attention mask map of the kth stage |
|
|
side output of the kth stage |
|
|
tensor concatenation operation |
|
|
Sigmoid activation function |
|
|
upsampling to the input image size
|
|
|
Output prediction map after multi-stage fusion, i.e., the final prediction result map |
|
|
convolution operation |
|
|
element-wise addition operation |
|
|
element-wise multiplication operation |
| P |
the predicted segmentation result |
| G |
the ground truth binary label |
| N |
the total number of pixels in a crack image |
|
|
predicting the error weights |
|
|
predicting the correct weights |
|
|
the number of positive samples in a crack image |
|
|
the number of negative samples in a crack image |
|
|
the loss ratio to balance the positive and negative samples |
|
|
the balanced weighted cross-entropy loss |
|
|
false positive weights |
|
|
false negative weights |
|
|
Tversky loss |
|
|
and weighted losses |
|
|
the weight of loss
|
|
|
The final total loss function |
|
|
the loss weight of the kth stage |
|
|
the loss weight of the final fusion stage |
|
|
Summation |