Enhancing Few-Shot Learning in Lightweight Models via Dual-Faceted Knowledge Distillation

. 2024 Mar 12;24(6):1815. doi: 10.3390/s24061815

Algorithm 1 Implementation: Few-Shot Model Compression Algorithm.

Input: Base class dataset

D_{b}

, validation dataset

D_{v a l}

, and the novel class dataset

D_{n}

the teacher network

\{B_{θ}^{t} (\cdot), C_{b} (\cdot), C_{r} (\cdot)\}

, the student network

\{B_{θ}^{s} (\cdot), C_{s} (\cdot)\}

temperature parameter τ, hyperparameter

α_{1}

α_{2}

β

Output: The predicted value

{\hat{y}}_{q}

of query samples in

D_{n}

Stage 1: Teacher network pre-training

While epoch ≤ maximum number of the iteration

A batch of images is randomly selected from

D_{b}

Images are fed into the backbone of the teacher network to extract the feature.

Obtain the base class and rotation class probability values.

Pre-train the teacher network according to Equation (4).

Stage 2: Few-shot model compression

While epoch ≤ maximum number of the iteration

A batch of images is randomly selected from

D_{b}

The image is separately fed into the backbone of the teacher and the student networks to extract features.

Obtain the base class probability values from the teacher network and the student network, respectively.

Calibrate the feature error distribution between the student network and the teacher network according to Equation (16).

Calculate the knowledge distillation loss function for intermediate features according to Equation (17).

Calculate the KL divergence-based loss function between the predicted output values of the student network and the teacher network according to Equation (20).

Calculate the cross-entropy loss function of the student network according to Equation (21).

Train the student network according to Equation (22).

Stage 3: Few-shot model testing

While epoch ≤ maximum number of the iteration

Images from

D_{n}

are processed through the feature extractor to obtain the feature.

Train classifier

g_{\emptyset} (\cdot)

for the novel classes.

Test on the query set from

D_{n}