Skip to main content
Computational Intelligence and Neuroscience logoLink to Computational Intelligence and Neuroscience
. 2022 Apr 25;2022:2698498. doi: 10.1155/2022/2698498

Intelligent Monitoring System Based on Noise-Assisted Multivariate Empirical Mode Decomposition Feature Extraction and Neural Networks

Le Fa Zhao 1,, Shahin Siahpour 2, Mohammad Reza Haeri Yazdi 3, Moosa Ayati 3, Tian Yu Zhao 4,
PMCID: PMC9061033  PMID: 35510053

Abstract

Because of the nonlinearity and nonstationarity in the vibration signals of some rotating machinery, the analysis of these signals using conventional time- or frequency-domain methods has some drawbacks, and the results can be misleading. In this paper, a couple of features derived from multivariate empirical mode decomposition (MEMD) are introduced, which overcomes the shortcomings of the traditional features. A wind turbine gearbox and its bearings are investigated as rotating machinery. In this method, two types of feature structures are extracted from the decomposed signals resulting from the MEMD algorithm, called intrinsic mode function (IMF). The first type of feature vector element is the energy moment of effective IMFs. The other type of vector elements is amplitudes of a signal spectrum at the characteristic frequencies. A correlation factor is used to detect effective IMFs and eliminate the redundant IMFs. Since the basic MEMD algorithm is sensitive to noise, a noise-assisted extension of MEMD, NA-MEMD, is exploited to reduce the effect of noise on the output results. The capability of the proposed feature vector in health condition monitoring of the system is evaluated and compared with traditional features by using a discrimination factor. The proposed feature vector is utilized in the input layer of the classical three-layer backpropagation neural network. The results confirm that these features are appropriate for intelligent fault detection of complex rotating machinery and can diagnose the occurrence of early faults.

1. Introduction

With the advent of new era of Industry 4.0, the human and machine interaction has dramatically changed [1]. The improvement and advancement in intelligent systems have paved the way for the better use of smart devices. This shifts traditional human-machine interactions (HMI) toward intelligent human-machine interactions. The application of intelligent HMI ranges from medical scenarios to industrial applications [25] (e.g., robotics, energy, maintenance, and semiconductor manufacturing). Among the key drivers of the transition from traditional to intelligent HMI, progress in machine learning and intelligent algorithms constitutes the main portion of importance [69].

Monitoring the condition of rotating machinery plays an important role in the engineering industries [10, 11]. To detect early faults and fully inspect the health condition of rotating systems, a condition monitoring structure is required to operate as soon as possible [12, 13]. The main objective of exploiting condition monitoring systems is to improve accuracy by lowering costs. The extraction of fault characteristics from these types of systems is a key step in the process of fault detection and condition monitoring [14].

Signals from complex rotating machinery are usually nonstationary and nonlinear, and extracting features that lead to a desirable outcome has become a challenging process. Features are the parameters that are derived from signals to indicate the characteristics of systems. So far, various features that can be extracted from vibration signals have been investigated [1517]. Signal processing to extract fault features is divided into three main domains: time domain, frequency domain, and time-frequency domain. Some conventional time-domain methods are skewness and kurtosis [18] or root mean square (RMS) and peak value of a signal [19]. Frequency analysis mostly contains Fourier spectra of a time series signal, cepstrum analysis, or envelope analysis [20, 21]. These features are in the time or frequency domain and are mostly extracted from raw vibration signals. In the presence of nonlinearity and nonstationarity in the signal, traditional features cannot have an accurate distinction between system conditions [22]. Because of these problems, time-frequency analysis of complex signals is introduced as an application of feature extraction. Time-frequency methods, such as the short-time Fourier transform [23], wavelet transform [24], empirical mode decomposition (EMD) [25], or Wigner–Ville [26], analyze signals in both time and frequency domains. Therefore, features can contain more comprehensive information of signals.

With the advent of a new time-frequency method, named Hilbert–Huang transform (HHT) [27], many studies have been conducted using this method in the field of signal processing [2830]. HHT is a powerful algorithm useful for nonlinear and nonstationary signals, performing an adaptive decomposition operation called empirical mode decomposition (EMD). The decomposed signals, named intrinsic mode functions (IMFs), are almost monocomponents which satisfy Hilbert transform terms. Each IMF covers a small range of frequency scales. This characteristic of IMFs makes them a suitable tool for the analysis of complex systems. EMD algorithm is sensitive to noise. When signals are noisy, the mode-mixing phenomenon can occur in IMFs [31]. In this situation, either a single IMF carries a signal of a widely disparate scale, or a single mode (or scaling) exists in more than one IMF. To overcome this phenomenon, Ensemble EMD (EEMD) is proposed [32].

When the system contains many components and has comprehensive information from all over the system, multiple sensors are located on different parts of the system. In this condition, the signals obtained from the sensors are a kind of multivariate signals. If the EMD algorithm is used on each signal individually, joint information will be wasted [33]. Furthermore, the same group of IMFs may have different characteristic information [34]. To overcome these problems, Riling et al. [35] proposed bivariate EMD. In this method, by mapping the bivariate signal in different directions, the local mean of the signal is calculated. To continue this idea, in 2010, Rehman and Mandic [36] proposed an empirical mode decomposition algorithm for trivariate signals. After that, they proposed an extension to their method and introduced multivariate EMD (MEMD) to deal with multidimensional signals [37]. This method allows us to analyze multidimensional signals simultaneously and covers the problem of using the EMD method for these kinds of signals. Zhao et al. [38] employ multivariate EMD method to extract some health condition information of the studied system. In their study, they used full spectrum based condition monitoring for rotating machinery. Lv et al. [33] used multivariate EMD as an application to investigate the health conditions of the patients.

Each IMF order resulting from the MEMD algorithm has the same frequency characteristic. This capability makes the MEMD algorithm a suitable method for feature extraction to diagnose faults in rotating systems. Some of the IMFs are spurious and need to be eliminated from the calculation to speed up the process of feature extraction and make the feature vector smaller without losing accuracy. Some IMFs are high-frequency ones, which can be regarded as noisy IMFs. In contrast, some IMFs contain low-frequency characteristics that exist due to the stopping criteria of the EMD algorithm and do not have physical meaning. Effective IMFs can be detected by user experience, but to make the process faster, a criterion or factor must be used. Ricci et al. [39] introduced a merit index that automatically selects the effective IMFs and eliminates the spurious ones. This index is based on the symmetrical and periodic IMF specifications. In [38], a sensitivity factor which is based on mutual information is proposed. In [33], a correlation factor is introduced to detect the most effective IMFs and, as is obvious from the name of the factor, it is based on the correlation between the signal and each IMF.

The features derived from the signals can be implemented as input for an artificial neural network (ANN) system [40] or can be used for a support vector machine (SVM) [41] to analyze the conditions of the system intelligently and automatically. Yang et al. [42] extract bearing health characteristics using the energy of decomposed IMFs. They compare the output results from a simple ANN while the features are derived from wavelet analysis. Bin et al. [43] used a combined method of wavelet packet decomposition (WPD) and EMD to extract fault features from a bearing mechanism as rotating machinery. In their study, the energy moment from the IMFs is used as the feature vector. WPD is used to denoise and preprocess a signal.

To address the aforementioned issues and challenges, an intelligent feature extraction is proposed. The following are the main novelties and contributions of this study:

  1. The NA-MEMD algorithm is used as a feature extraction method.

  2. Correlation analysis is used to detect effective IMFs.

  3. In addition to the energy moment of effective IMFs, an amplitude factor in the frequency domain is introduced as a complementary element for the feature vector.

  4. To show the capability of the proposed features in the diagnosis of system conditions, a discrimination criterion is exploited to make the comparison tangible. Features are then used for a backpropagation (BP) neural network input layer.

  5. The proposed algorithm can be used for analyzing the features of the data from the athletes and the fault analysis of the key mechanical components in the sport field. This paper focuses on the analysis of bearings used in the key components in the sport field.

This paper is organized as follows. In Section 2, the proposed signal processing and feature extraction procedure are explained. Section 3 is dedicated to the structure and design configuration of the neural network. In Section 4, the rotation system is introduced. In Section 5, the implementation of the proposed method on the studied system is investigated, and the results are discussed. The conclusion is presented in Section 6.

2. Feature Extraction Using Multivariate EMD

2.1. Fundamentals of Multivariate EMD

In standard EMD [27], the local mean can be calculated by interpolating the upper and lower envelope of a univariate signal. However, when dealing with multivariate signals, it is rather confusing to determine IMFs, because the value of local minima and maxima cannot be directly defined. Rehman and Mandic [37] introduce a multivariate EMD algorithm to overcome these issues. In this method, multivariate (n-variate) signals are considered as n-dimensional time series. Some appropriate direction vectors are chosen, and multivariate signals are projected on the selected direction vectors. All envelopes of these projected signals are calculated, and by averaging the envelopes, the local mean of the multivariate signal is determined. Therefore, the sifting process [31] (which is used in standard EMD) can be implemented to calculate IMF groups.

The process of local mean calculation can be considered as an approximation of the integral of all envelopes along with the multiple directions in the n-dimensional space. The accuracy of this calculation depends on the uniformity of the chosen direction vectors. To generate a set of uniformly distributed points, quasi-Monte Carlo-based low-discrepancy sequences can be utilized. The Halton sequence family is exploited as a convenient way to generate a low-discrepancy sequence.

Let x1,…, xn be the first n prime numbers, and the ith sample of a one-dimensional Halton sequence, denoted by rix, is given by

rix=a0x+a1x2++asxs+1, (1)

where the base-x representation of i is given by

i=a0+a1x++asxs. (2)

Starting from i=0, the ith sample of Halton sequence then becomes

rix1,rix2,,rixn. (3)

The Hammersley sequence is used when the total number of samples, n, is known a priori; in this case, the ith sample within the Hammersley sequence is calculated as

in,rix1,rix2,,rixn1. (4)

By using Halton and Hammersley sequences, a suitable set of direction vectors on the n-sphere is generated. Henceforth, projections of signals on this direction vector will be calculated. In the following paragraph, multivariate EMD will be explained briefly.

Let X(t)=[x1(t), x2(t),…, xn(t)] be an n-dimensional signal and Dk={d1k, d2k,…, dnk} correspond to the kth direction vector in a direction set D. The multivariate EMD algorithm is described as follows:

  • (1)

    Choose a suitable set of direction vectors, D.

  • (2)

    Calculate the kth projection, pk(t) of X along the kth direction, where k=1,2,…, K and K is a total number of direction vectors.

  • (3)

    Find the time instants, tik, corresponding to the maxima of projected signals.

  • (4)

    Interpolate [tik, X(tik)] to determine multidimensional envelopes, Ek(t).

  • (5)
    Calculate the mean by
    Mt=1lk=0KEkt. (5)
  • (6)

    Calculate the residual component R(t) = X(t) − M(t). If D(t) satisfies the stopping criterion explained in the previous section, then consider R(t) as an IMF and then repeat the algorithm until it meets the criterion.

2.2. Effect of Noise on IMFs

EMD method is sensitive to noise. In [44], an investigation is conducted on the sensitivity of MEMD to noise. It can be inferred from this study that the MEMD algorithm is sensitive to noise and mode-mixing problems that can happen in this method. An extension to MEMD is proposed to cover the problem. The extension is named noise-assisted multivariate empirical mode decomposition (NA-MEMD). NA-MEMD algorithm tries to eliminate noise interference in EEMD and reduce mode mixing in EMD and MEMD methods. The general algorithm in NA-MEMD is the same as in MEMD. The difference is that the input multivariate signal consists of input data and noise in separate channels. After the implementation of the MEMD algorithm on the new multivariate signal, the resulting noise-related IMFs will be discarded. This method is demonstrated briefly as follows:

  1. Construct l-channel of uncorrelated Gaussian white noise time series which have the same length as that of the input (l ≥ 1).

  2. Add noise channels, created in the previous step, to the input signals; therefore, the new input signal is (n + l)-channel.

  3. Process the (n + l)-channel multivariate signal using MEMD algorithm to obtain IMFs.

  4. Discard l-channels corresponding to the noise from (n + l)-variate IMFs and get n-channel IMFs corresponding to the original signal.

2.3. The Criterion for Choosing IMFs

To extract fault features from the signal, suitable IMFs must be selected. A suitable IMF is an IMF which has a meaningful frequency scale. The choice of IMF is usually based on experience and is done manually. This process is slow and time-consuming. To make this procedure faster and relatively automatic, an index or coefficient is needed to be introduced. One way to determine the suitability of an IMF is to calculate the correlation between the IMF and the original signal [45]. The IMF, which has a small correlation coefficient, is regarded as a redundant or noise component. With the help of the correlation coefficient, it is possible to accurately determine and eliminate the noise component and evaluate the effective IMFs to extract fault features from them.

In dealing with the MEMD algorithm, the resulting IMFs are a set of IMF groups, and some calculation must be done to identify the effective IMFs. Hence, a fault correlation factor (FCF) has been proposed [33] to conduct the analysis. Suppose that the input signal is n-variate signal and there exist n groups for mth IMFs corresponding to each signal. The multivariate signal can be organized as a matrix as follows:

St=S1t,S2t,,Snt. (6)

The kth IMF on n groups corresponds to each input signal and constitutes a matrix in the form of

Ct=c1kt,c2kt,,cnkt. (7)

A simplified form of the correlation coefficient is as follows:

λxy=n=1Nxtctn=1Nx2tn=1Nc2t, (8)

where t is the time and N is the total number of sampling points. λik is defined as the FCF of ith IMF of C(t) (7) and can be calculated by conducting correlation analysis on this IMF with each n-variate signal, respectively, and averaging all correlation factors. λik indicates the correlation between this IMF and the original signal. To make a comparison between each order of IMFs, the FCF of IMFs with the same order must be calculated. It can be achieved by averaging all vector correlations since each order of IMFs contains almost the same features.

λk=i=1nλikn. (9)

When the value of λk is large, it means that the degree of correlation of the fault characteristic between the kth order IMF of the n IMF groups and the original signal is higher. Based on the criterion of Pearson Correlation Coefficients, when the value of the correlation coefficient is higher than 0.3, it can be assumed that the signals are relevant. Therefore with this approach, effective IMFs can be determined.

2.4. Feature Selection

The idea of extracting features for the diagnosis of rotating machinery faults is a critical task. Features must be selected wisely, because some features may be futile in extracting fault characteristics of a signal, although these parameters are useful for other vibration signals. To choose the most effective features, a scientific criterion, which relates the features to the system condition, can be used. To achieve this purpose, in this paper, a discrimination criterion, denoted as J, is applied [46]. This criterion is based on the ratio between inter- and intra-variance. Suppose N features are extracted for a vibration signal with K class of system conditions. If rk,n is the nth feature of the kth class, the intraclass and interclass variance matrix of the average dispersion coefficients are given as follows:

Sintra=1K×Nk=1Kn=1Nr¯k,nμ¯kr¯k,nμ¯kt,Sinter=1Kk=1Kn=1Nr¯kμ¯cr¯kμ¯ct, (10)

while the mean of feature vectors of the kth class is defined by μ¯k=1/Nn=1Nr¯k,n and the total mean of feature vectors of all classes is μ¯c=1/Nk=1Kμ¯k.

Finally, J is computed as follows:

J=traceSintra1Sinter. (11)

According to the criterion, for the features with a high value of J, the effect of the corresponding feature on the diagnosis of a specific fault becomes greater.

2.5. Traditional Features

Traditional fault features are simple and can easily be implemented in signals [47]. In Table 1, some of these traditional features are represented in the frequency and time domain. When a fault occurs in the rotating machinery, the time-domain signal may change both its amplitude and distribution. Moreover, the frequency spectrum may encounter some deviation from the normal condition. Usually, with the help of these features, some faults can be determined in the system.Note. xn is vibration signal with n=1,…, N; N is the number of data points; sk is the frequency spectrum of xn; K is number of spectral lines; and fk is frequency value of kth spectral line.

Table 1.

Traditional feature set parameters.

Time-domain features Frequency-domain features
Root mean square pt1=1/Nn=1Nxn2 Frequency barycenter pf 1=(∑k=1Kfksk/∑k=1Ksk)
Peak pt 2=max(|xn|) Root mean square frequency pf2=k=1Kfk2sk/k=1Ksk
Square mean root pt3=1/Nn=1Nxn2 Standard deviation frequency pf3=k=1Kfkpf12sk/k=1Ksk
Absolute mean pt 4=(1/N)∑n=1N(|xn|) Frequency spectrum mean pf 4=(1/N)∑k=1Ksk
Kurtosis pt 5=(1/N)∑n=1Nxn4 Frequency spectrum deviation pf 5=(1/K − 1)∑k=1K(skpf4)2
Crest factor pt 6=(pt1)/(pt2) Frequency spectrum entropy pf6=k=1Ksk/Kpf4logsk/Kpf4

2.6. Feature Extraction from Decomposed IMFs

In addition to the traditional features mentioned earlier, the MEMD algorithm is used to extract some other features to form a more reliable and almost more robust feature vector.

Standard EMD is designed to process univariate signals. When signals from multiple sensors (or conditions) are individually processed by the EMD algorithm, there might be two main drawbacks in the results. The first drawback is the loss of joint information. The main reason for collecting information from multiple sensors (or conditions) is to have a more comprehensive understanding of the system. By implementation of EMD algorithm individually on each signal, the idea of multiple sensors would be vain. The second drawback is about the features of the same order of IMFs in each signal. IMFs in the same order corresponding to each signal that resulted from the EMD algorithm may have different features [34]. This makes it difficult to determine the effective IMFs.

MEMD algorithm overcomes these two problems. The IMFs, resulting from the MEMD algorithm, not only contain comprehensive information about the system, but also, in the same order of IMFs, almost consist of the same feature information. These two advantages of MEMD, in addition to the benefits of the EMD method, make this algorithm an ideal choice for extracting features contributing to multivariate signals.

As was mentioned before, each order of IMFs calculated by noise-assisted MEMD contains a small frequency scale. This characteristic paves the way for analysis and feature extraction in the frequency domain for each order of IMFs. When a fault occurs in a rotating component of a system, a natural frequency (or meshing frequency for contacting components, e.g., gearboxes) is excited, which results in a burst of energy at this frequency. To identify the fault, it is necessary to detect the frequency occurrence of these high-energy bursts. Since each IMF order is composed of a small range of frequencies, by performing frequency-domain analysis, the amplitude of the signal in characteristic frequencies can be determined. FCF is a suitable index to eliminate redundant IMFs or specifically redundant frequency bands. This amplitude can be regarded as a fault feature for implementation in smart analysis.

To clarify what was mentioned above, the procedure is implemented on the synthetic signal. The multivariate synthetic signal is given as follows:

x1=sin2πf1t+0.5  cos2πf2t+0.9  sin2πf3t,x2=0.7  sin2πf1t+cos2πf2t+0.4  cos2πf2t,x3=0.9  sin2πf1t+0.6  cos2πf2t+cos2πf2t, (12)

where f1=20 Hz, f2=50 Hz, and f3=90 Hz. The sampling point is N=1000, and the sampling frequency is fs=1000 Hz. White Gaussian noise is added to each signal. Noise signals are white Gaussian signals and the corresponding power is −10 dBW.

Since noise is added to the multivariate signal, to prevent the phenomenon of mode mixing, the NA-MEMD algorithm is implemented. Figure 1 shows the calculated IMFs by using NA-MEMD. From this figure, it is verified that each order of IMFs has the same frequency characteristics. IMF3 to IMF5 consist of the main frequencies of component signals. The remaining IMFs are redundant ones, either high-frequency IMFs which are regarded as noise or low-frequency IMFs which are due to the stopping criterion and do not have physical meaning.

Figure 1.

Figure 1

Decomposition results by using NA-MEMD on the synthetic multivariate signal.

FCF is used to determine effective IMFs and to detect which IMFs contain frequency features. In Table 2 the calculated results for FCF are shown. According to the criterion of Pearson Correlation Coefficients, since IMF3 to IMF5 have FCF higher than 0.3, they can be assumed to be relevant IMFs, which is acceptable for the manual estimation of these IMFs. Therefore, the process of selecting suitable IMFs converts to a relatively automatic procedure.

Table 2.

Fault correlation factor for synthetic multivariate signal.

IMF order 1 2 3 4 5 6 7 8 9 10
FCF 0.2131 0.1965 0.6491 0.4125 0.6600 0.0224 0.0208 0.0366 0.0376 0.0191

The amplitude of frequency spectrum of IMFs in characteristic frequencies is an ideal feature for fault detection of rotating machinery. In the dominant IMFs in the studied synthetic signal, there exist peaks in the propinquity of characteristic frequencies. The amplitude of these peaks is going to be used as a feature for the input of an artificial neural network, because the amplitude of these peaks changes when the system operates under different conditions. Therefore, this characteristic can make a distinction for different health conditions in the system.

To have accurate and reliable results from the neural network, features as the input of the neural network must contain the detailed information of the studied system. Vibration signals from rotating machinery are usually nonlinear and nonstationary. This specification of vibration signal, which changes the energy of the signal, is in some frequency bands. IMF components contain information corresponding to a frequency band; thus, the IMF energy can be used to characterize a signal. Instead of using energy [42] or the energy entropy of the signal [16], the energy moment [43] is used as part of the proposed characteristic vector. In this method, the time feature is used for the calculation of energy; thus, it can be a complementary feature extraction method in addition to the proposed frequency-domain method. The energy moment can distinguish signal features more accurately compared to the classical energy method when the signal is nonlinear or nonstationary, which will be explained in the following paragraphs.

The energy moment for each IMF can be calculated as

Ei=t.cit2dt, (13)

and for continuous calculation and discrete analysis,

Ei=k=1nkΔtcik.Δt2, (14)

where n is the total number of sampling points, Δt is the period of samples, and k is the number of the sample points.

Energy moment can form a feature vector as follows:

T=E1,E2,,En. (15)

Because the energy moment has a high value, T can be adjusted using normalization. Assume E=∑i=1nEi; then,

Tn=E1E,E2E,,EnE, (16)

where Ti is normalized energy moment for signal ci. As is clear from (13) and (14), the moment energy contains both the signal energy and the signal distribution in the time domain (because of the term t in the equations), indicating the advantage of the moment energy over the calculation of the classical energy [43].

3. Neural Network Structure

A BP neural network is designed to intelligently diagnose faults in rotating machinery. To do so, a neural model of BP must be structured. A typical BP neural network structure is illustrated in Figure 2. This network has one hidden layer. In the field of rotating machinery fault detection, the input layer contains features extracted from the original signal, and the output layer is the system health conditions (i.e., being healthy or having a specific fault type).

Figure 2.

Figure 2

A typical BP neural network.

The number of hidden layer cells cannot be defined accurately. If the hidden layer nodes are too high, the connection between nodes increases, and as a result, the number of connection weights increases, making the neural network training process more complex. If the hidden layer nodes are too small, the accuracy of the output results cannot be guaranteed. For a three-layer network (one hidden layer), there is an empirical and experimental relationship that relates the number of hidden layer nodes k to the number of input layer nodes n [43]. The relationship is given as follows:

k=2n+i,0i8. (17)

Note that even in this relationship, k is not definite and can be changed.

In Figure 3, an overview of smart fault detection of rotating machinery is illustrated schematically.

Figure 3.

Figure 3

Intelligent fault detection flowchart.

4. System Description

To explain the proposed method, this paper investigates the transmission system in the wind turbine system (gearbox and bearing), as a rotating machinery. The vibration data from the experiment were provided by the National Renewable Energy Laboratory (NREL). The system is depicted in Figure 4. As is indicated in the figure, the main sections rotate at the three speed stages, i.e., the low-speed stage (LSS), the intermediate-speed stage (ISS), and the high-speed stage (HSS). The test drive is designed for the wind turbine with rated power of 750 kW. The overall ratio for the gearbox system is 1 : 81.491. In Table 3, more details on the description of the gearbox are shown [48].

Figure 4.

Figure 4

Wind turbine planetary gearbox system (courtesy of NREL).

Table 3.

Dimensions and mechanical details of the gear element [48].

Gear Elements No. of teeth Mate teeth Root diameter (mm) Helix angle Face width (mm) Ratio
Ring gear 99 39 1047 7.5 L 230
Planet gear 39 99 372 7.5 L 227.5
Sun gear 21 39 186 7.5 R 220 5.71
Intermediate gear 82 23 678 14 R 170
Intermediate pinion 23 82 174 14 L 186 3.57
High-speed gear 88 22 440 14 L 110
High-speed pinion 22 88 100 14 R 120 4.0
Overall: 81.49

To obtain vibration data from the gearbox system, accelerometers are mounted on the top of the gearbox. Data are collected at a rate of 40 kHz per channel using a National Instruments PXI-4472B high-speed data acquisition system (DAQ). Eight sensors are located in different places of the system to obtain comprehensive information from the gearbox system.

As was mentioned in the previous section, the feature vectors contain some components which are related to the amplitude of the frequency spectrum with the characteristic frequency. Characteristic frequency encompasses not only the rotating frequencies of the components but also the meshing frequencies of linked components. The studied system in faulty condition corresponds to three major fault types. The formulation for the calculation of the main characteristic frequencies is briefly illustrated in Table 4.

Table 4.

Characteristic frequencies formulations.

Component Characteristic frequency Formulation
Fixed-axis gearbox Meshing frequency f m =f1N1=f2N2

Planetary stage Planet frequency [49] f p =((NpNR)Ns/(NR+Ns)Np)fs
Carrier frequency [49] f c =(Ns/NR+Ns)fs
Meshing frequency [49] f mp=(fsfc)Ns=(NRNS/NR+NS)fs

Bearing Ball pass frequency, outer race [50] BPFO=(nfr/2){1 − (d/D)cosϕ}
Ball pass frequency, inner race [50] BPFI=(nfr/2){1+(d/D)cosϕ}
Fundamental train frequency (cage speed) [50] FTF=(fr/2){1 − d/Dcosϕ}
Ball (roller) spin frequency [50] BSF(RSF)=(D/2  d){1 − ((d/D)cosϕ)}2

For the gearbox of fixed axis, f1, f2, N1, and N2 are the frequency of the pinion, the frequency of the gear, the number of teeth in the pinion, and the number of teeth in the gear, respectively. For the planetary stage, fs, Ns, NR, and Np are the sun frequency, the number of suns, the ring gear, and the teeth of the planet, respectively. For the bearing, fr, n, ϕ, d, and D are the shaft speed, the number of rolling elements, the angle of the load from the radial plane, the rolling element diameter, and the bearing average diameter, respectively. In Figure 5, the main dominant characteristic frequencies are shown schematically. These frequencies are high-speed shaft (HSS) frequency, planetary gear mesh frequency (PLTGM), high-speed shaft bearing B (Figure 4), high-speed shaft gear mesh (HSGM), and its second and third harmonics.

Figure 5.

Figure 5

The characteristic frequency of the gearbox.

5. Method Implementation on System and Discussion

5.1. Feature Extraction for the System

As was mentioned, the input layer in the neural network is a vector constructed from fault features. Some elements are composed of normalized energy moments. First, a windowing process is implemented on the input signal to construct as many signals as possible for the input of the NA-MEMD algorithm as the input of the neural network. The signals provided by NREL are made up of 10 signals of 60 s duration. Each signal contains 2400000 data samples. A window is a section of each signal with 240000 data samples without overlapping which divides the corresponding signal into 10 subsignals. Thus, for each condition of the system, 100 features can be constructed. Windowing increases the feature vectors, increasing the accuracy of neural network operation. Subsections are now considered as input to the MEMD algorithm to obtain IMFs. To avoid the mode-mixing phenomenon, NA-MEMD is used instead of the MEMD algorithm. 3 white Gaussian noises with a variance of 0.1 are added as 3 new channels to the multivariate input signal. In Figure 6, the resulting IMFs for one channel of the multivariate faulty signal are shown. 20 IMFs are extracted from the NA-MEMD while some of them are spurious and must be omitted from the consideration. In Table 5, FCF values calculated for the IMFs are shown. IMFs of orders three to eight have an FCF higher than 0.3; thus, these IMF groups are considered as effective IMFs for the calculation of energy moment.

Figure 6.

Figure 6

Decomposition results by using NA-MEMD on the synthetic multivariate signal.

Table 5.

Correlation coefficient factor for each IMF.

IMF order/FCF
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
0.1084 0.2254 0.6318 0.8279 0.4050 0.4152 0.3245 0.3014 0.0214 0.0212 0.0368 0.0039 0.0006 0.0001 0.0002 0.0002 0.0001 0.0001 0.0001 0.0002

The feature selection algorithm is applied to the proposed features. For the system studied, two classes of system conditions are considered (K=2) and 30 characteristics are extracted (N = 30). The resultant discrimination criterion is shown in Table 6. According to the table, the values of J for most of the MEMD characteristics are greater than the traditional characteristics except for the value of pt1 (that is, the root mean square). This shows that the proposed features can be suitable for detecting faults in the wind turbine gearbox studied. Therefore, the feature vector can be constructed as follows:

F=E3,E4,E5,E6,E7,E8,AF6,AF8,AF11, (18)

where Ei′ and AFi are normalized energy moment and amplitude factor (AF) for the ith IMF order, respectively. It should be noted that these features are selected based on the studied dataset; however, the feature selection practice for all similar datasets is the same. It means the features with highest FCF value should be selected for the input of any machine learning method.

Table 6.

Discrimination criterion for the proposed features.

Feature AF 6 AF 8 E 3 E 4 pt 1 E 5 E 6 E 7 AF 11 E 8 pt i , pf1, pfi
J criterion's value 1.932 1.812 1.720 1.600 1.541 1.021 0.952 0.741 0.603 0.402 ≤0.1

In Table 7 a feature vector as a sample is depicted. It can be seen from the table that the input vector is composed of nine features. Although the output layer of the neural network contains two conditions (i.e., healthy and faulty), it is worth noting that the faulty condition encompasses three different faults. Since the data provided consist of two conditions, inevitably two output conditions are chosen for the neural network. The trend of fault feature vectors is constructed to detect faults individually. However, in this paper, because of the limitation of data, faults are detected simultaneously in one condition label.

Table 7.

Neural network input and output vector.

Feature vector System condition
[0.0826, 07469,0.0877, 0.0200, 0.0261, 0.0367, 0.0477, 0.0409, 0.0535] Healthy
[0.2125, 0.5985, 0.0347, 0.0664, 0.0052, 0.0826, 0.5308, 0.7147, 0.0659] Faulty

5.2. The Design Neural Network for the System

The main step in designing a neural network is to train the network based on the training samples. As mentioned earlier, the feature vectors in the input layer contain nine components. The number of nodes in the output layer is considered to be 2, corresponding to two conditions of system. 100 feature vectors are constructed for each system condition. 80% of data are considered as the training data and 20% as the testing data. In this study, a three-layer neural network is constructed for the intelligent fault diagnosis procedure. Therefore, according to (17) and network training conditions, the number of hidden layer nodes (k) is 18–26. In this study, k=18, since the differences between the output errors for the different values of k are in the same order (1e -5) while the learning rate is 0.001. The training function is TRAINLM to update the weight and bias values based on the Levenberg–Marquardt optimization method, and the activation function between the hidden layer and output layer is Sigmoid function. Note that the setting is similar for all experiments and all experiments have been done using MATLAB platform. Diagnosis rate for both training data and test data is 100%. This shows that the features and the network configuration are successfully selected.

6. Conclusion

In this paper, the MEMD algorithm is applied for extracting features from rotating machinery. To investigate the capacity of the proposed method, vibration signals from a wind turbine gearbox system as a rotating machinery system are utilized. When the rotating system is complex and consists of many faults, multiple sensors are exploited to obtain comprehensive information from the system. MEMD algorithm has the advantage of dealing with multivariate signals simultaneously. Usually, when the system is nonstationary and there are nonlinearity and multiple faults, using traditional features may be abortive. Features derived from the MEMD algorithm are based on the time and frequency domain, which compensate for the problem of using traditional features. To validate the effectiveness of the proposed features, a discrimination criterion is introduced. This criterion is based on the relativity of features to the fault classes.

The basic MEMD algorithm is sensitive to noise. In this study, an extension of MEMD called NA MEMD is implemented on multivariate signals to overcome the noise sensitivity of MEMD. MEMD algorithm decomposes signals into some signals named IMFs. Some of these IMFs are spurious and need to be eliminated from the calculation. A correlation factor is introduced to achieve this purpose. With the help of this factor, the number of redundant features is reduced. Two types of features are extracted from the IMFs. From the point of view of time-domain analysis, the energy moment of IMFs is a suitable feature, since it contains the time characteristics of signals. Therefore, this can be helpful when the signal is nonstationary. The other feature is in the frequency domain, and it relates to the amplitude of frequency spectrum in the characteristic frequencies. Because each IMF order encompasses a small frequency range, frequency analysis of IMFs is an effective way of highlighting characteristics.

Based on the results, designing a neural network using the proposed features yields acceptable output results. The network is successfully trained using the training data, and the diagnostic rate is 100% not only for the training data, but also for the test data. It should be mentioned that the proposed algorithm is applied to real experimental data; however, by increasing the number of classes, the performance may decrease.

It should be noted that intelligent feature extraction using the proposed NA-MEMD method provides comprehensive information on the health status of the system. The proposed methodology gives higher explainability of the features compared to other similar methods. However, recently, deep learning-based methods have been successfully implemented in industrial datasets to automatically extract features. In spite of the effectiveness of these methods, they require high computation resources compared with the proposed method.

Acknowledgments

This study was funded by the Key R&D Plan of China for Winter Olympics (2021YFF0306401) and the Key Special Project of the National Key Research and Development Program “Technical Winter Olympics” (2018YFF0300502 and 2021YFF0306400).

Contributor Information

Le Fa Zhao, Email: larry2012@syty.edu.cn.

Tian Yu Zhao, Email: zhaotianyu@mail.neu.edu.cn.

Data Availability

The NREL wind turbine data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

  • 1.Gorecky D., Schmitt M., Loskyll M., Zühlke D. Human-machine-interaction in the industry 4.0 era. Proceedings of the 2014 12th IEEE International Conference on Industrial Informatics; 2014; Porto Alegre, Brazil. pp. 289–294. [DOI] [Google Scholar]
  • 2.Zhang W., Li X., Ma H., Luo Z., Li X. Transfer learning using deep representation regularization in remaining useful life prediction across operating conditions. Reliability Engineering & System Safety . 2021;211:p. 107556. doi: 10.1016/j.ress.2021.107556. [DOI] [Google Scholar]
  • 3.Mousavi M., Alzgool M., Towfighian S. Autonomous shock sensing using bi-stable triboelectric generators and mems electrostatic levitation actuators. Smart Materials and Structures . 2021;30(6) doi: 10.1088/1361-665x/abf72c.065019 [DOI] [Google Scholar]
  • 4.Zhang W., Li X. Federated Transfer Learning for Intelligent Fault Diagnostics Using Deep Adversarial Networks with Data Privacy . IEEE/ASME Transactions on Mechatronics; 2021. [Google Scholar]
  • 5.Jebellat I., Pishkenari H. N., Jebellat E. Training microrobots via reinforcement learning and a novel coding method. Proceedings of the 2021 9th RSI International Conference on Robotics and Mechatronics (ICRoM); 2021; Tehran, Iran. IEEE; pp. 105–111. [DOI] [Google Scholar]
  • 6.Zhang W., Li X., Ma H., Luo Z., Li X. Open-set domain adaptation in machinery fault diagnostics using instance-level weighted adversarial learning. IEEE Transactions on Industrial Informatics . 2021;17(11):7445–7455. doi: 10.1109/tii.2021.3054651. [DOI] [Google Scholar]
  • 7.Siahpour S., Li X., Lee J. A novel transfer learning approach in remaining useful life prediction for incomplete dataset. IEEE Transactions on Instrumentation and Measurement . 2022;71 doi: 10.1109/tim.2022.3162283. [DOI] [Google Scholar]
  • 8.Li X., Zhang W., Ma H., Luo Z., Li X. Degradation Alignment in Remaining Useful Life Prediction Using Deep Cycle-Consistent Learning . IEEE Transactions on Neural Networks and Learning Systems; 2021. [DOI] [PubMed] [Google Scholar]
  • 9.Fazlollahi V., Shirazi F. A., Taghizadeh M., Siahpour S. Robust wake steering control design in a wind farm for power optimisation using adaptive learning game theory (algt) method. International Journal of Control . 2021:1–17. doi: 10.1080/00207179.2021.2009558. [DOI] [Google Scholar]
  • 10.Ainapure A., Siahpour S., Li X., Majid F., Lee J. Intelligent robust cross-domain fault diagnostic method for rotating machines using noisy condition labels. Mathematics . 2022;10(3):p. 455. doi: 10.3390/math10030455. [DOI] [Google Scholar]
  • 11.Zhang W., Li X., Ma H., Luo Z., Li X. Universal domain adaptation in fault diagnostics with hybrid weighted deep adversarial learning. IEEE Transactions on Industrial Informatics . 2021;17(12):7957–7967. doi: 10.1109/tii.2021.3064377. [DOI] [Google Scholar]
  • 12.Siahpour S., Khakiani F. N., Fazlollahi V., Golozar A., Shirazi F. A. Morphing omni-directional panel mechanism: a novel active roof design for improving the performance of the wind delivery system. Energy . 2021;217:p. 119400. doi: 10.1016/j.energy.2020.119400. [DOI] [Google Scholar]
  • 13.Shen C., Wang D., Kong F., Tse P. W. Fault diagnosis of rotating machinery based on the statistical parameters of wavelet packet paving and a generic support vector regressive classifier. Measurement . 2013;46(4):1551–1564. doi: 10.1016/j.measurement.2012.12.011. [DOI] [Google Scholar]
  • 14.Li B., Zhang Y. Supervised locally linear embedding projection (sllep) for machinery fault diagnosis. Mechanical Systems and Signal Processing . 2011;25(8):3125–3134. doi: 10.1016/j.ymssp.2011.05.001. [DOI] [Google Scholar]
  • 15.Lei Y., He Z., Zi Y. A new approach to intelligent fault diagnosis of rotating machinery. Expert Systems with Applications . 2008;35(4):1593–1600. doi: 10.1016/j.eswa.2007.08.072. [DOI] [Google Scholar]
  • 16.Ben Ali J., Fnaiech N., Saidi L., Chebel-Morello B., Fnaiech F. Application of empirical mode decomposition and artificial neural network for automatic bearing fault diagnosis based on vibration signals. Applied Acoustics . 2015;89:16–27. doi: 10.1016/j.apacoust.2014.08.016. [DOI] [Google Scholar]
  • 17.de Azevedo H. D. M., de Arruda Filho P. H. C., Araújo A. M., Bouchonneau N., Rohatgi J. S., de Souza R. M. C. Vibration monitoring, fault detection, and bearings replacement of a real wind turbine. Journal of the Brazilian Society of Mechanical Sciences and Engineering . 2017;39(10):3837–3848. doi: 10.1007/s40430-017-0853-2. [DOI] [Google Scholar]
  • 18.Barszcz T., Randall R. B. Application of spectral kurtosis for detection of a tooth crack in the planetary gear of a wind turbine. Mechanical Systems and Signal Processing . 2009;23(4):1352–1365. doi: 10.1016/j.ymssp.2008.07.019. [DOI] [Google Scholar]
  • 19.Igba J., Alemzadeh K., Durugbo C., Eiriksson E. T. Analysing rms and peak values of vibration signals for condition monitoring of wind turbine gearboxes. Renewable Energy . 2016;91:90–106. doi: 10.1016/j.renene.2016.01.006. [DOI] [Google Scholar]
  • 20.Inalpolat M., Kahraman A. A theoretical and experimental investigation of modulation sidebands of planetary gear sets. Journal of Sound and Vibration . 2009;323(3-5):677–696. doi: 10.1016/j.jsv.2009.01.004. [DOI] [Google Scholar]
  • 21.Badaoui M. E., Guillet F., Danière J. New applications of the real cepstrum to gear signals, including definition of a robust fault indicator. Mechanical Systems and Signal Processing . 2004;18(5):1031–1046. doi: 10.1016/j.ymssp.2004.01.005. [DOI] [Google Scholar]
  • 22.Li C. J., Wu S. On-line detection of localized defects in bearings by pattern recognition analysis. Journal of Manufacturing Science and Engineering disseminates . 1989 [Google Scholar]
  • 23.Satish L. Short-time fourier and wavelet transforms for fault detection in power transformers during impulse tests. IEE Proceedings - Science, Measurement and Technology . 1998;145(2):77–84. doi: 10.1049/ip-smt:19981576. [DOI] [Google Scholar]
  • 24.Lin J., Qu L. Feature extraction based on morlet wavelet and its application for mechanical fault diagnosis. Journal of Sound and Vibration . 2000;234(1):135–148. doi: 10.1006/jsvi.2000.2864. [DOI] [Google Scholar]
  • 25.Fan X., Zuo M. J. Machine fault feature extraction based on intrinsic mode functions. Measurement Science and Technology . 2008;19(4) doi: 10.1088/0957-0233/19/4/045105.045105 [DOI] [Google Scholar]
  • 26.Staszewski W. J., Worden K., Tomlinson G. R. Time-frequency analysis in gearbox fault detection using the wigner-ville distribution and pattern recognition. Mechanical Systems and Signal Processing . 1997;11(5):673–692. doi: 10.1006/mssp.1997.0102. [DOI] [Google Scholar]
  • 27.Huang N. E., Shen Z., Long S. R., et al. The empirical mode decomposition and the hilbert spectrum for nonlinear and non-stationary time series analysis. Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences . 1998;454:903–995. [Google Scholar]
  • 28.Yu D., Cheng J., Yang Y. Application of emd method and hilbert spectrum to the fault diagnosis of roller bearings. Mechanical Systems and Signal Processing . 2005;19(2):259–270. doi: 10.1016/s0888-3270(03)00099-2. [DOI] [Google Scholar]
  • 29.Flandrin P., Rilling G., Goncalves P. Empirical mode decomposition as a filter bank. IEEE Signal Processing Letters . 2004;11(2):112–114. doi: 10.1109/lsp.2003.821662. [DOI] [Google Scholar]
  • 30.Fan X., Zuo M. J. Gearbox fault detection using empirical mode decomposition. ASME International Mechanical Engineering Congress and Exposition . 2004;47160:37–45. doi: 10.1115/imece2004-59349. [DOI] [Google Scholar]
  • 31.Antoniadou I., Manson G., Staszewski W. J., Barszcz T., Worden K. A time-frequency analysis approach for condition monitoring of a wind turbine gearbox under varying load conditions. Mechanical Systems and Signal Processing . 2015;64-65:188–216. doi: 10.1016/j.ymssp.2015.03.003. [DOI] [Google Scholar]
  • 32.Wu Z., Huang N. E. Ensemble empirical mode decomposition: a noise-assisted data analysis method. Advances in Adaptive Data Analysis . 2009;01(01):1–41. doi: 10.1142/s1793536909000047. [DOI] [Google Scholar]
  • 33.Lv Y., Yuan R., Song G. Multivariate empirical mode decomposition and its application to fault diagnosis of rolling bearing. Mechanical Systems and Signal Processing . 2016;81:219–234. doi: 10.1016/j.ymssp.2016.03.010. [DOI] [Google Scholar]
  • 34.Looney D., Mandic D. P. Multiscale image fusion using complex extensions of emd. IEEE Transactions on Signal Processing . 2009;57(4):1626–1630. doi: 10.1109/tsp.2008.2011836. [DOI] [Google Scholar]
  • 35.Rilling G., Flandrin P., Gonçalves P., Lilly J. M. Bivariate empirical mode decomposition. IEEE Signal Processing Letters . 2007;14(12):936–939. doi: 10.1109/lsp.2007.904710. [DOI] [Google Scholar]
  • 36.ur Rehman N., Mandic D. P. Empirical mode decomposition for trivariate signals. IEEE Transactions on Signal Processing . 2009;58(3):1059–1068. [Google Scholar]
  • 37.Rehman N., Mandic D. P. Multivariate empirical mode decomposition. Proceedings of the Royal Society A: Mathematical, Physical & Engineering Sciences . 2010;466(2117):1291–1302. doi: 10.1098/rspa.2009.0502. [DOI] [Google Scholar]
  • 38.Zhao X., Patel T. H., Zuo M. J. Multivariate emd and full spectrum based condition monitoring for rotating machinery. Mechanical Systems and Signal Processing . 2012;27:712–728. doi: 10.1016/j.ymssp.2011.08.001. [DOI] [Google Scholar]
  • 39.Ricci R., Pennacchi P. Diagnostics of gear faults based on emd and automatic selection of intrinsic mode functions. Mechanical Systems and Signal Processing . 2011;25(3):821–838. doi: 10.1016/j.ymssp.2010.10.002. [DOI] [Google Scholar]
  • 40.Siahpour S., Li X., Lee J. Deep learning-based cross-sensor domain adaptation for fault diagnosis of electro-mechanical actuators. International Journal of Dynamics and Control . 2020;8(4):1054–1062. doi: 10.1007/s40435-020-00669-0. [DOI] [Google Scholar]
  • 41.Vakharia V., Gupta V. K., Kankar P. K. Efficient fault diagnosis of ball bearing using relieff and random forest classifier. Journal of the Brazilian Society of Mechanical Sciences and Engineering . 2017;39(8):2969–2982. doi: 10.1007/s40430-017-0717-9. [DOI] [Google Scholar]
  • 42.Yu Y., Dejie Y., Junsheng C. A roller bearing fault diagnosis method based on emd energy entropy and ann. Journal of Sound and Vibration . 2006;294(1-2):269–277. doi: 10.1016/j.jsv.2005.11.002. [DOI] [Google Scholar]
  • 43.Bin G. F., Gao J. J., Li X. J., Dhillon B. S. Early fault diagnosis of rotating machinery based on wavelet packets-Empirical mode decomposition feature extraction and neural network. Mechanical Systems and Signal Processing . 2012;27:696–711. doi: 10.1016/j.ymssp.2011.08.002. [DOI] [Google Scholar]
  • 44.ur Rehman N., Park C., Huang N. E., Mandic D. P. Emd via memd: multivariate noise-aided computation of standard emd. Advances in Adaptive Data Analysis . 2013;05(02) doi: 10.1142/s1793536913500076.1350007 [DOI] [Google Scholar]
  • 45.Qu G., Hariri S., Yousif M. A new dependency and correlation analysis for features. IEEE Transactions on Knowledge and Data Engineering . 2005;17(9):1199–1207. doi: 10.1109/tkde.2005.136. [DOI] [Google Scholar]
  • 46.Tlig L., Sayadi M., Fnaiech F. A new fuzzy segmentation approach based on s-fcm type 2 using lbp-gco features. Signal Processing: Image Communication . 2012;27(6):694–708. doi: 10.1016/j.image.2012.03.001. [DOI] [Google Scholar]
  • 47.Yuan H., Wang X., Sun X., Ju Z. Compressive sensing-based feature extraction for bearing fault diagnosis using a heuristic neural network. Measurement Science and Technology . 2017;28(6) doi: 10.1088/1361-6501/aa6a07.065018 [DOI] [Google Scholar]
  • 48.Sheng S. Wind Turbine Gearbox Condition Monitoring Round Robin Study-Vibration Analysis . Golden, CO (United States): National Renewable Energy Lab.(NREL); 2012. [DOI] [Google Scholar]
  • 49.Mabie H. H., Reinholtz C. F. Mechanisms and Dynamics of Machinery . John Wiley & Sons; 1991. [Google Scholar]
  • 50.Randall R. B., Antoni J. Rolling element bearing diagnostics-A tutorial. Mechanical Systems and Signal Processing . 2011;25(2):485–520. doi: 10.1016/j.ymssp.2010.07.017. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The NREL wind turbine data used to support the findings of this study are included within the article.


Articles from Computational Intelligence and Neuroscience are provided here courtesy of Wiley

RESOURCES