Fault Detection for Wind Turbine Blade Bolts Based on GSG Combined with CS-LightGBM

. 2022 Sep 7;22(18):6763. doi: 10.3390/s22186763

Algorithm 1 GSG Pseudo-code

Input:

Training Set(D) = {Set(min), set(max)}

| Set (\min) | = N

, | Set (\max) | = M

# N is the number of fault class samples.

# M is the number of normal class samples.

Output:

A training Set (D′) with more fault class samples than the original training Set(D).

r = N / M

# Calculate the class imbalance rate of the original dataset.

2: b = random(r,1)

3: a = int(M*b-N) # Calculate the number of samples to be expanded in Set(min).

4: C = BIC(Set(min)) # Determine an optimal number of components.

5: G_i = GMM(Set(min), C) # Use GMM to cluster Set(min) into C clusters.

6: for i ←1 to C do

7: Set(G″_i) = {} # Create C empty sample sets.

8: if

| Set {(G^{″}}_{i}) | < a * (| G_{i} | / N)

then

Set (x_new) = SMOTE (G_{i}, int (a * (| G_{i} | / N)))

# Use SMOTE to synthesize new samples based on the G_i cluster samples.

10: G′_I = GMM(Set(min) + Set(x_{new_k}), C)

# Use GMM again to cluster Set(min) and Set(x_{new_k}) into C clusters.

11: for k ←1 to int(a* num(G_i)/N) do

12: if x_{new_k} in G′_i then
# The x_{new_k} is the k-th sample in Set(x_new).

13: Add the x_{new_k} to Set(G″_i)
# Add the x_{new_k} sample to Set(G″i).

14: if x_{new_k} not in G′_i then

15: Remove the x_{new_k}

16: end if

17: end if

18: end for

19: end if

20: Set(N′) = concatenate (Set(G″_i))

21: end for

22: Set(D′) = concatenate (Set(max), Set(min), Set(N′))

23: return Set(D′)