Block diagram of the proposed method, where and represents the foreground features extracted from jth pooling layers () at whole-level and part-level, respectively. Similarly, and represents the background features extracted from the jth pooling layer at whole-level and part-level, respectively