Fig. 11.

Qualitative evaluation with GradCAM attribution heatmaps. We visually compare the original and the AttributionScanner-improved models for hair color and bird category classification, respectively. In each sub-figure, the first row refers to the original model (baseline), and the second row refers to the improved model (ours). We can observe AttributionScanner suppresses the spurious correlations by using the correct features for predictions.