Skip to main content
PLOS One logoLink to PLOS One
. 2020 Feb 4;15(2):e0228324. doi: 10.1371/journal.pone.0228324

A novel method of combining generalized frequency response function and convolutional neural network for complex system fault diagnosis

Lerui Chen 1,*, Zerui Zhang 1, Jianfu Cao 1
Editor: Jie Zhang2
PMCID: PMC6999895  PMID: 32017780

Abstract

To solve the problem of low accuracy in traditional fault diagnosis methods, a novel method of combining generalized frequency response function(GFRF) and convolutional neural network(CNN) is proposed. In order to accurately characterize system state information, this paper proposed a variable step size least mean square (VSSLMS) adaptive algorithm to calculate the second-order GFRF spectrum values under normal and fault states; In order to improve the ability of fault feature extraction, a convolution neural network (CNN) with gradient descent learning rate and alternate convolution layer and pooling layer is designed to extract the fault features from GFRF spectrum. In the proposed method, the second-order GFRF spectrum of each state of Permanent Magnet Synchronous Motor (PMSM) is obtained by VSSLMS; Then, the two-dimension GFRF spectrum, which is regarded as the gray value of the image,will be further transformed into image. Finally, the CNN is trained with learning rate by gradient descent way to realize the fault diagnosis of PMSM. Experimental results indicate that the accuracy of proposed method is 98.75%, which verifies the reliability of the proposed method in application of PMSM fault diagnosis.

1 Introduction

Permanent magnet synchronous motor (PMSM) is widely used in industrial robot drive system because of its small size, light weight and simple structure. Because PMSM works in continuous and heavy load environment, it is easy to suffer from abnormal fever, rotor poor lubrication and rotor magnetic leakage[1], so it is very important to monitor and diagnose the fault of PMSM.

At present, with the deepening research of motor, there are many researches on motor fault diagnosis. Most of them are based on monitoring and acquisition of motor state-related signal, such as voltage, speed, current and vibration signal, and then using feature extraction, information fusion and pattern recognition methods to achieve fault diagnosis.Ref. [2] considered stator current as fault signal feature to detect the fault of planetary gearbox of motor at different rotational speeds; Ref. [3] proposed a feature extraction method of smofs-22,which based on acoustic signal to diagnose stator, rotor and bearing faults of single-phase induction motor; Ref. [4] treated current signal as fault features, and convert the current signal by continuous wavelet transform (CWT) to realize the diagnosis of broken rotor bars in squirrel cage induction motor; Ref. [5] utilized the vibration signal in frequency domain to realize the motor bearing fault diagnosis by one-dimension fusion neural network; Ref. [6] applied discrete wavelet transform (DWT) to the stator current signal to realize the fault diagnosis of motor rotor. These methods above only analyzed single signal, such as current or vibration signal, which is difficult to extract fault characteristics effectively; Ref.[7] proposed a multi-source health state information fusion method based on DS theory, which can achieve better information fusion for automobile motor from X-axis, Y-axis and Z-axis and get good prediction results of health status; Ref. [8] proposed a fault signal classification method based on support vector machine (SVM) and short-time Fourier transform (STFT) for sensor data fusion, which combined vibration, current, voltage, sound and temperature information to detect and identify motor faults; Ref. [9] fused the vibration signal collected from two-direction of the motor, then the fused vibration signal is decomposed into multiple scales by continuous wavelet transform (CWT), and the first scale is used to represent the degradation of bearing. Although these methods above make up for the limitation of single signal by fusing two or more signal, the fault features were extracted from the output signal of the system without considering the influence of the input signal on the system, so it cannot fully reflect the non-linear characteristics of the system. Nonlinear spectrum analysis based on generalized frequency response function (GFRF) considers the influence of input on the system characteristics and can characterize the dynamic characteristics of the nonlinear system. Nonlinear frequency spectrum can well explain the characteristics of system, the accuracy of fault diagnosis can be improved by using it to characterize the state information, and achievements have been made in the field of fault diagnosis which can be shown in Refs. [10]-[15].However, up to now, it is very rare to study the fault diagnosis for permanent magnet synchronous motor (PMSM) based on GFRF spectrum.

In recent years, with the increasing number of equipment monitoring system, the monitoring system has obtained a large number of data, so the fault diagnosis method based on data-driven has been favored by scholars. Among many data-driven methods, deep learning has advantages in image classification with its powerful learning and recognition ability, and also has been applied in the field of fault diagnosis; Ref.[16] proposed a motor fault diagnosis method based on stacked de-noising auto-encoder, which can extract features adaptively. Compared with traditional learning methods, it can extract fault features more deeply and obtain higher diagnosis accuracy; Ref. [17] proposed deep belief networks (DBN), which can learn the characteristics of vibration signal from the frequency distribution to characterize the working state of induction motors. Combining feature extraction and classification, the intelligent fault diagnosis is realized; Ref.[18] combined stacked de-noising auto encoders (SDAE) and nonlinear output frequency response function(NOFRF) spectrum to realize the fault diagnosis of permanent magnet synchronous motor (PMSM). Ref.[19] combined convolution neural network (CNN) and short-time Fourier transform(SFT) to realize motor fault diagnosis. In order to solve the problem of fault feature extraction is not deep enough and fault diagnosis accuracy is low, a new fault diagnosis method for permanent magnet synchronous motor(PMSM) combining non-linear spectrum based on generalized frequency response function (GFRF) and convolution neural network(CNN) is proposed. In this method, CNN network has strong ability of feature extraction and classification, but the input network is required to be the form of image, of all nonlinear spectrum based on Volterra kernel, only the second-order GFRF spectrum can meet the input requirement of CNN. In addition, the second-order GFRF spectrum is essentially two dimensional, which does not need to generate two dimensional data through the truncation and stacking of one dimensional data, thus it can avoid the loss of nonlinear characteristics[20]-[21]. Therefore, the second-order GFRF spectrum selected in this paper as CNN input retains the nonlinear characteristics of the data to the greatest extent. At the same time, according to the characteristics of GFRF spectrum, CNN network is designed, in which the convolution layer and the pooling layer alternately appear, and the learning rate of gradient descent is used to train the network to improve the accuracy of network output.

The rest of this paper is organized as follows: Section II will present the theory of GFRF spectrum and CNN; In Section Ⅲ, the process of the proposed method will be shown. Section IV will show the result of experiments and discuss the reasons why different methods produce vary recognition rates. Finally, a concise conclusion of this work will be presented in Section Ⅴ.

2. Methodology

2.1 The definition of GFRF spectrum

The input and output relationship of continuous time-invariant nonlinear dynamic system can be expressed by Volterra series as shown in Eq (1)[18], [22]-[24]:

yn(t)=n=1Nhn(τ1,τ2,,τn)i=1nu(tτi)dτi (1)

where, u(t) is the input of system, yn(t) is the n-th output of system, hn(τ1,τ2,⋯,τn) is the n-th Volterra time domain kernel, then we make a multi-dimension Fourier transformation for hn(τ1,τ2,⋯,τn),which can get the result as Eq (2):

H(jω1,jω2,,jωn)=hn(τ1,τ2,,τn)ej(ω1τ1+ω2τ2++ωnτn)i=1ndτi (2)

As shown in Eq (2), H(1,2,⋯,n) is called the n-th Volterra frequency domain kernel or generalized frequency response function (GFRF). A large number of experiments show that the GFRF spectrum of the system varies greatly in different states. According to this conclusion, we can characterize the spectrum of GFRF as features for fault diagnosis.

2.2 The calculation of GFRF spectrum

At present, there are two methods for GFRF calculating: the recursive method and the identification method. The former method calculates the GFRF spectrum values of each order by recursive equation, which is suitable for the system with clear mathematical model; the latter method evaluates the GFRF spectrum values of each order by system identification based on the input and output of the system, which is not only suitable for the system with clear mathematical model, but also suitable for the complex system whose mathematical model is unclear, so the latter method has stronger applicability.

According to the GFRF definition, the output of Volterra model y˜(k) can be expressed as Eq (3):

y˜(k)=hv(k)UvT(k) (3)

where, UvT(k) is the input vector, hv(k) is the Volterra kernel vector. From the analysis above, it can be seen that the identification of GFRF is essentially a least squares problem, In order to calculate the second GFRF spectrum accurately,this paper adopted a variable step size least mean square (VSSLMS) adaptive algorithm, which is shown as Eq (4).

hv(k+1)=hv(k)+μke(k)UvT(k) (4)

where, e(k) is the error between y˜(k) and y(k),which equals y˜(k)y(k). UvT(k) is input of system. μk is the step size, which can be expressed as Eq (5).

μk={μmax,μkμmaxμmin,μkμminμk,other (5)

where μk = γμk−1+λ|e(k−1)|2,0<γ<1,λ>0; μmax is the upper limit of μk; μmin is the lower limit of μk.

In Eq (5),the step size μk is decided by error e(k),namely, when the error e(k) is large, μk is also large, which will accelerate the convergence speed of algorithm; when the error e(k) is small, μk is also small, which will reduce the steady-state error of the algorithm. From the analysis above, it can be found that the adaptive identification algorithm as proposed can take into account both convergence speed and estimation error. The specific identification process is shown as Fig 1.

Fig 1. The process of GFRF calculation.

Fig 1

The specific steps of VSSLMS adaptive algorithm are as follows:

Step 1: Parameters initialization, letting hv(k) = [0,0,…,0]T, μk = μmax;

Step 2: Collecting the k input data UvT(k) and k output data y(k);

Step 3: Calculating the estimated output y˜(k) by (3),then calculating the error e(k), e(k)=y˜(k)y(k); if |e(k)|≪ε, the identification is over; else,it jump to step 4.

Step 4: Calculating the next Volterra series hv(k+1) by (3).

Step 5: Calculating the step size μk+1, making k = k+1,and then jump to Step 3.

Step 6: when the algorithm is over, N×N Fourier Transform is performed on hv(k),and the second-order GFRF spectrum is obtained.

2.3 The theory of convolutional neural network

Convolutional neural network (CNN) is a kind of feedforward neural network, which can learn and extract information from data layer by layer, and also can reveal the essential characteristics of the system hidden in data. It is widely used in the field of image processing [25]-[27]. The network is usually composed of convolution layer(C), pooling layer(P) and full connection layer(F). In order to dig out the hidden characteristics of data, convolution layer and pooling layer often alternately appear. The structure of the network is shown as Fig 2.

Fig 2. Convolutional neural network structure.

Fig 2

2.3.1 Forward propagation

The forward propagation process of CNN network mainly involves convolution layer, pooling layer and full connection layer. The neurons between different layers are connected in the form of weights, which are important parameters of CNN, because they directly determine the mapping relationship of feature space.

Multiple convolution kernels are adopted in convolution layer to convolute the input images. After adding offset, some new features can be obtained by activation function. The mathematical expression of the convolution process can be shown as Eq (6):

Xjl=f(iMjXil1ωijl+bjl) (6)

where, Xjl is the j-th element in layer l; Mj is the set of input feature graphs needed to output the number j feature graph; ωijl is the weight matrix of the corresponding convolution kernel. bjl is the bias term; f(∙)is the activation function, and the ReLu function is commonly adopted, which is shown as Eq (7):

f(x)=max[0,x] (7)

The pooling layer is also called the lower sampling layer, which mainly reduces the dimension of the output feature map from the convolution layer. The pooling methods including maximum pooling, average pooling and random pooling are commonly used. Because the pooling layer only carries out dimension reduction operations and no parameters are involved in the calculation, it does not need to update the weights; the full connection layer mainly expands all the feature maps to form one dimensional feature vectors, and after weighted summation, it realizes the classification of feature types through the Softmax activation function.

2.3.2 Back propagation

Back propagation is used to adjust the network weight to minimize the error cost function, so as to minimize the error between the predicted results and the actual results in forward propagation to establish the mapping from the feature space to the fault space. There are many common cost functions. This paper chooses variance cost function, which is shown as Eq (8):

E=12k=1n(tknykn)2=12tnyn2 (8)

where, n is the number of failure samples; tn is the test value of the n-th sample; yn is the true value of the n-th sample. In the process of reverse propagation, in order to obtain a series of convolution kernels, the gradient descent method is usually used to iterate continuously to adjust the adaptive parameters of the network. At the same time, the residual error of each layer is reduced to the minimum through learning, and finally the updated convolution kernels are obtained.

ωijl=ωijlδEωijl (9)
bjl=bjlδEbjl (10)

As shown in Eqs (9)–(10), δ is a learning rate parameter, which is used to control the update step of the value. If δ is too large, the network will fall into local optimum. If δ is too small, the training time of the network will increase

3 Fault diagnosis method of permanent magnet synchronous motor based on GFRF spectrum and CNN

In this paper, the fault diagnosis method based on GFRF spectrum + CNN can integrate fault feature extraction and pattern recognition to realize intelligent fault diagnosis of the system. The diagnosis process is shown as Fig 3.

Fig 3. The flow of fault diagnosis based on GFRF spectrum +CNN.

Fig 3

The specific steps are as follows:

  1. Obtaining the second-order GFRF spectrum of normal state and fault state of the system, and normalizing the spectrum value;

  2. The normalized spectrum amplitudes were treated as image gray values, which were converted into two-dimension gray images.

  3. A large number of gray image sets were obtained by repeated experiments N times, which were divided into training samples and test samples.

  4. Initialize the convolution neural network and set parameters such as iteration times and learning rate.

  5. The training samples were fed into CNN network in batches, the predicted values of fault types were obtained by forward propagation, and the errors were calculated with the actual values.

  6. The error calculated in step (5) was back-propagated, and the weight parameters of each layer were updated layer by layer to obtain the ideal CNN network.

  7. The test samples were put into CNN network to realize the output of fault types of test samples.

4 Experimental verification

4.1 Data acquisitions

In this paper, the permanent magnet synchronous motor (PMSM) used in industrial robot is taken as the research object for fault diagnosis. According to the mathematical model of PMSM in Ref. [28], it can be expressed as Eq (11):

diddt=rLid+pωriq+1Luddiqdt=pωridrLiqϕgpLωr+1Luqdωrdt=1.5ϕgpJiqBJωr+TlJ (11)

As shown in Eq (11), id and iq are armature currents of stator winding on d axis and q axis respectively; ud and uq are armature voltage of stator winding on d axis and q axis respectively. For ease of analysis, this paper assumes ud = 0; L is stator winding inductance; ϕg is flux linkage produced by permanent magnet; r is stator winding resistance; J is rotor moment of inertia; Tl is load torque; p is the number of pole pair; B is rotor damping coefficient; ωr is rotor angular velocity. In (10), id is input and ωr is output of the system, respectively, the nonlinear differential model can be expressed as Eq (12).

id+rLid2J3ϕgωrωr2B3ϕgωr22Tl3ϕgωr=0 (12)

During the long-term operation of the equipment, the motor often suffers from fever, poor lubrication and magnetic leakage of the rotor. These fault states are represented by parameter r increasing, parameter B increasing and parameter ϕg decreasing, respectively. Under normal conditions, the parameters of a certain type of permanent magnet synchronous motor (PMSM)are as follows:

r[5.0,5.5]Ω,Tl=3Nm,L=0.005H,ϕg=0.186Wb,J=1.5×106kgm2,B[2.0×105,3.2×105]Nms/rad

when fever failure occurs, the parameters are as follows:

r>5.6Ω,Tl=3Nm,L=0.005H,ϕg=0.186Wb,J=1.5×106kgm2,B[2.0×105,3.2×105]Nms/rad

when poor lubrication of rotor failure occurs, the parameters are as follows:

r[5.0,5.5]Ω,Tl=3Nm,L=0.005H,ϕg=0.186Wb,J=1.5×106kgm2,B>3.2×105Nms/rad

when magnetic leakage of rotor failure occurs, the parameters are as follows:

r[5.0,5.5]Ω,Tl=3Nm,L=0.005H,ϕg[0,0.186]Wb,J=1.5×106kgm2,B[2.0×105,3.2×105]Nms/rad

Adopting the identification algorithm of VSSLMS proposed in this paper, the second-order GFRF spectrums of normal state and three failure states are shown as (Fig 4A–4D) under the conditions as Table 1.

Fig 4. Second-order GFRF spectrum in different states under working condition 1.

Fig 4

Table 1. Corresponding parameters of different states under working condition 1.

State r (Ω) B (N·m·s/rad) ϕg (Wb)
Normal 5 3×10−5 0.186
Fever 6 3×10−5 0.186
Rotor poor lubrication 5 6.4×10−5 0.186
Rotor magnetic leakage 5 3×10−5 0.091

The peak values of the second-order GFRF spectrum under working condition 1 are shown as Table 2.

Table 2. Peak distribution of second-order GFRF spectrum in different states.

State Second-order GFRF spectrum value
Maximum Minimum
Normal 0.6279 0.0865
Fever 0.8769 0.1245
Rotor poor lubrication 0.7465 0.1845
Rotor magnetic leakage 0.1503 0.0207

From Fig 4 and Table 2, it can be seen that the second-order GFRF spectrum values vary greatly in different states, so the fault can be characterized by the second-order GFRF spectrum. In order to satisfy the requirement of CNN input type, the spectrum values are treated as gray values of image, so that the spectrum of different states can be converted into two-dimension gray images, which are shown as Fig 5. Each state is repeated 30,000 times, so 30,000 two-dimension gray image sets are obtained. 80% of images are trained and 20% of images are tested in CNN.

Fig 5. Gray images of GFRF spectrum.

Fig 5

4.2 Experiment results and analysis

The CNN network constructed in this paper consists of three convolution layers (C1, C2, C3), two pooling layers (P1, P2), two full connection layers (F1, F2) and one Softmax output layer (O1). Three convolution layers and two pooling layers appear alternately. The number of convolution kernels in convolution layers are 36, 66 and 240. the size of convolution kernels is 5 *5, and the sliding step is 1; the activation function is adopted ReLU function; the average pooling operation is used in the pooling layer, and the pooling area is 2 *2. The learning rate is uniformly changed, namely, the initial learning rate is set to 0.01, and the value is reduced to 1/2 of the original value every 500 steps; the number of F1 and F2 in the full connection layer are 86 and 4, respectively. The input gray image size is 28 *28 and the batch equals 1000. The laboratory platform is configured as follows: Linux 18.04 operating system, GPU model is GTX1060, and the running environment is Python 3.7 + tensorflow 1.6. In order to prove the superiority of the proposed method, the following comparative experiments are carried out from the following aspects.

4.2.1 Comparison with the output signal+CNN methods

Most of data in current fault diagnosis research are collected from system output, such as current or vibration signal [2]-[6]. In order to verify the reliability of different data sets and their impacts on diagnosis results, the data set of this paper is GFRF spectrum estimated by VSSLMS, while the comparative test data sets are as follows: collecting the time domain signal(TS) of rotor speed ωr,and the frequency domain signal (FS) of ωr generated by Fourier transform. Meanwhile, the two kinds of one-dimension signal are intercepted and stacked to generate gray images, which are shown as Figs 6 and 7.

Fig 6. Gray images of time domain of output.

Fig 6

Fig 7. Gray images of frequency domain of output.

Fig 7

Putting the two types of gray images into the same CNN network of this paper for diagnosis. The diagnosis results are shown as Table 3.

Table 3. Recognition rates of different fault diagnosis methods.

Method State Samples Misjudgments Recognition rate Average rate

GFRF+CNN
Normal 5941 0 100.00%
98.75%
Fever 5946 298 94.99%
Rotor poor lubrication 5996 0 100.00%
Rotor magnetic leakage 6102 0 100.00%

FS+CNN
Normal 5990 113 98.11%
83.65%
Fever 5952 864 85.48%
Rotor poor lubrication 5989 3054 50.99%
Rotor magnetic leakage 6094 0 100.00%

TS+CNN
Normal 5922 651 89.01%
78.08%
Fever 6001 522 91.30%
Rotor poor lubrication 5909 4018 32.00%
Rotor magnetic leakage 6062 0 100.00%

From the Table 3, it can be seen that the average accuracy of GFRF spectrum + CNN method is 98.75%, the average accuracy of FS + CNN method is 83.65% and the average accuracy of TS + CNN method is 78.08%. The reason for this result is that the fault representation based on GFRF spectrum is designed from the whole system, which contains most information of the non-linear characteristics of the system. The distinctions between different faults are obvious, which can be seen from Fig 5, so the diagnosis accuracy is high. The output time-domain or frequency-domain signal only contain the local information of the system, which cannot represent the whole non-linear characteristics of the system. Meanwhile, unlike the second-order GFRF spectrum, the output data need to be converted into two-dimension gray image by interception and superposition. This operation may lead to the loss of non-linear characteristics. Therefore, the distinctions between different faults are not obvious, which can be seen from Fig 6 and Fig 7. It is worthy to be noticed that compared with Fig 6 and Fig 7, it is found that although the distinctions of output signal gray images in different states are not obvious, it is still discernible, gray images formed by output time domain signal cannot be directly distinguished by eyes. It is consistent with the conclusion that the information contained in frequency domain signal is more than that contained in time domain signal. Therefore, the accuracy of fault diagnosis using output time-domain signal to represent fault features is the lowest.

4.2.2. Comparison with other learning networks

In order to compare the fault diagnosis accuracy of different learning networks, three other deep learning networks, such as stack auto-encoder(SAE) network in Ref. [18],[29], deep belief neural (DBN)network in Ref.[30] and recurrent neural network(RNN) in Ref.[31] are adopted to diagnose PMSM faults. The accuracy of four methods are shown as Table 4.

Table 4. The fault diagnosis accuracy of different methods.

Method Samples Average accuracy rate
CNN 4×6000 98.75%
SAE 4×6000 96.33%
DBN 4×6000 97.69%
RNN 4×6000 94.14%

From Table 4, it can be seen that adopting deep learning network to diagnose the fault of PMSM can achieve high diagnosis accuracy, it is because that the deep learning network can extract fault features layer by layer, which can accurately reflect the changes of state. However, for PMSM system, the accuracy of fault diagnosis using CNN method is the highest.

4.2.3 Comparisons with traditional intelligent diagnosis methods

Feature extraction and pattern recognition are two processes of equipment fault diagnosis. At present, PCA in Ref. [32], KPCA in Ref. [33], t-SNE in Refs.[34]-[35] are the main methods of feature extraction, and SVM in Ref. [36] is the main method of fault recognition. Therefore, this paper compares the method based on GFRF spectrum + CNN with PCA + SVM, KPCA + SVM and t-SNE +SVM. The data sources of the latter methods are as follows: 800 points of second-order GFRF spectrum are collected randomly (mainly near the peak value) to form a set of 800-dimension vector, 200*800 data sets are obtained by 200 times experiments, 80% and 20% of which are treated as training sets and testing sets respectively. Gauss kernel function is adopted as KPCA kernel function, and the parameter of kernel function σ2 equals 4000, the cumulative contribution rate of principal components is set to 90%. The diagnosis accuracy of these methods are shown as Table 5. Meanwhile, the principal component analysis method is used to extract first two important components of fault features from three methods, and the visualization of first two principal components are shown as Figs 810.

Table 5. Accuracy rates of different fault diagnosis methods.

Method State Samples Misjudgments Accuracy rate Average rate
GFRF+CNN Normal 5941 0 100.00% 98.75%
Fever 5946 298 94.99%
Rotor poor lubrication 5996 0 100.00%
Rotor magnetic leakage 6102 0 100.00%
KPCA+SVM Normal 40 40 0.00% 63.13%
Fever 40 0 100.00%
Rotor poor lubrication 40 19 52.50%
Rotor magnetic leakage 40 0 100.00%
PCA+SVM Normal 40 40 0.00% 26.88%
Fever 40 0 100.00%
Rotor poor lubrication 40 40 0.00%
Rotor magnetic leakage 40 37 7.50%
t-SNE+SVM Normal 40 19 52.50% 82.50%
Fever 40 0 100.00%
Rotor poor lubrication 40 8 80.00%
Rotor magnetic leakage 40 1 97.50%

Fig 8. Visualization of principal components based on CNN feature extraction.

Fig 8

Fig 10. Visualization of principal components based on KPCA feature Extraction.

Fig 10

From Table 5, it can be seen that the recognition rate of the proposed method is 98.75%, the recognition rate of KPCA + SVM method is 63.13%, the recognition rate of PCA + SVM method is only 26.88%, and the recognition rate of t-SNE + SVM method is 82.50%. The reason for this result is that in this paper, CNN with multi-layer is adopted to extract fault characteristics layer by layer, and the network parameters can be adjusted by back propagation, which has strong self-adaptability. Therefore, the features distinction between different states are obvious, which can be seen from Fig 8; PCA+SVM adopts linear mapping function to compress high-dimension data, which is obviously inappropriate for a non-linear system. After dimension reduction, the state features contained in the data overlap with each other, which is shown as Fig 9; KPCA + SVM adopts non-linear mapping function of Gauss to improve the features discrimination between different faults, the features distinction is shown as Fig 10; Compared with the Gauss distribution in KPCA, the t-distribution adopted in t-SNE method is less affected by the outliers, the fitting results are more reasonable, and the overall characteristics of the data are better captured, the features distinction is shown as Fig 11.

Fig 9. Visualization of principal components based on PCA feature extraction.

Fig 9

Fig 11. Visualization of principal components based on t-SNE feature extraction.

Fig 11

5 Conclusions

A new method of fault diagnosis for permanent magnet synchronous motor(PMSM) is proposed. It combines the nonlinear frequency spectrum based on GFRF and convolution neural network(CNN) to improve the diagnosis accuracy. Nonlinear frequency spectrum based on GFRF is utilized to characterize the fault information, the CNN is utilized to extract fault features from GFRF spectrum, experimental results verified the effectiveness of the proposed method.

The main achievements can be summarized as follows:

  1. Compared with the method of fault diagnosis using output signal as fault feature, nonlinear spectrum based on GFRF can fully represent the state information of the system, which has higher diagnosis accuracy rate.

  2. Compared with other deep learning network, CNN has highest ability of feature extraction and classification for PMSM fault diagnosis.

  3. Compared with traditional diagnosis method PCA, KPCA and t-SNE, convolutional neural network has obvious advantages in fault diagnosis accuracy with its powerful data mining and strong fault feature extraction capabilities.

Although the proposed method has achieved excellent results for fault diagnosis of PMSM, it will be a future work to test the versatility of the proposed method by diagnosing other non-linear systems.

Supporting information

S1 Fig. The process of GFRF Calculation.

(DOCX)

S2 Fig. Convolutional neural network structure.

(DOCX)

S3 Fig. The flow of fault diagnosis based on GFRF Spectrum +CNN.

(DOCX)

S4 Fig. Second-order GFRF spectrum in different states under working condition 1.

(DOCX)

S5 Fig. Gray images of GFRF spectrum.

(DOCX)

S6 Fig. Gray images of time domain of output.

(DOCX)

S7 Fig. Gray images of frequency domain of output.

(DOCX)

S8 Fig. Visualization of principal components based on CNN feature extraction.

(DOCX)

S9 Fig. Visualization of principal components based on PCA feature extraction.

(DOCX)

S10 Fig. Visualization of principal components based on KPCA feature extraction.

(DOCX)

S11 Fig. Visualization of principal components based on t-SNE feature extraction.

(DOCX)

S1 Table. Corresponding parameters of different states under working condition 1.

(DOCX)

S2 Table. Peak distribution of second-order GFRF spectrum in different states.

(DOCX)

S3 Table. Recognition rates of different fault diagnosis methods.

(DOCX)

S4 Table. The fault diagnosis accuracy of different methods.

(DOCX)

S5 Table. Accuracy rates of different fault diagnosis methods.

(DOCX)

Acknowledgments

The authors would like to thank Dr. Hu Heyu for his high performance computer with GPU for this project.

Data Availability

All relevant data are within the manuscript and its Supporting Information files.

Funding Statement

This work was funded by the National Natural Science Foundation (NNSF) of China [NO.61573272], Shaanxi Key science and technology project [No.2019ZDLGY01-01-02], Foshan science and technology project [No.2016AG101813].

References

  • 1.Kao IH, Wang WJ, Lai YH, Peng JW. Analysis of permanent magnet synchronous motor fault diagnosis based on learning. IEEE Transactions on Instrumentation and Measurement, 2019;68(2):310–324. 10.1109/TIM.2018.2847800 [DOI] [Google Scholar]
  • 2.Chen XW, Feng ZP. Time-frequency space vector modulus analysis of motor current for planetary gearbox fault diagnosis under variable speed conditions. Mechanical Systems and Signal Processing 2019;121:636–654. 10.1016/j.ymssp.2018.11.049 [DOI] [Google Scholar]
  • 3.Glowacz A. Fault diagnosis of single-phase induction motor based on acoustic signal. Mechanical Systems and Signal Processing, 2019;117:65–80. 10.1016/j.ymssp.2018.07.044 [DOI] [Google Scholar]
  • 4.Monfared OA, Doroudi A, Darvishi A. Diagnosis of rotor broken bars faults in squirrel cage induction motor using continuous wavelet transform. Compel-The International Journal for Computation and Mathematics in Electrical and Electronic Engineering, 2019;38(1):167–182. 10.1108/COMPEL-11-2017-0487 [DOI] [Google Scholar]
  • 5.Jian XZ, Li WL, Guo XG, Wang RZ. Fault diagnosis of motor bearings based on a one-dimensional fusion neural network. Sensors, 2019;19(1):122–138. 10.3390/s19010122 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Taher SA, Malekpour M, Farshadnia M. Diagnosis of broken rotor bars in induction motors based on harmonic analysis of fault components using modified adaptive notch filter and discrete wavelet transform. Simulation Modelling Practice and Theory, 2016;44:26–41. 10.1016/j.simpat.2014.02.006 [DOI] [Google Scholar]
  • 7.Wang LZ, Sun YS, He Y, Zhao XJ, Fan WH, Wang XH. Motor health status prediction method based on information from multi-sensor and multi-feature parameters. Journal of Nondestructive Evaluation, 2019;38(2):46–51. 10.1007/s10921-019-0587-4 [DOI] [Google Scholar]
  • 8.Banerjee TP, Das S. Multi-sensor data fusion using support vector machine for motor fault detection. Information Sciences, 2012; 217:96–107. 10.1016/j.ins.2012.06.016 [DOI] [Google Scholar]
  • 9.Ayaz E, Ozturk A, Seker S, Upadhyaya BR. Fault detection based on continuous wavelet transform and sensor fusion in electric motors. Compel-The International Journal for Computation and Mathematics in Electrical and Electronic Engineering, 2009; 28(2):454–470. 10.1108/03321640910929326 [DOI] [Google Scholar]
  • 10.Zhang JL, Cao JF, Gao F. Fault diagnosis for multivariable non-linear systems based on non-linear spectrum feature. Transactions of the Institute of Measurement and Control,2017;39(7):1017–1026. 10.1108/03321640910929326 [DOI] [Google Scholar]
  • 11.Peng ZK, Lang ZQ. The Nonlinear Output Frequency Response Functions of One-Dimensional Chain Type Structures. Journal of Applied Mechanics-Transactions of the Asme,2010;77(1),011007 10.1115/1.3173604 [DOI] [Google Scholar]
  • 12.Peng ZK, Lang ZQ, Chu FL. Numerical analysis of cracked beams using nonlinear output frequency response functions. Computers & Structures,2018;86(17):1809–1818. 10.1016/j.compstruc.2008.01.011 [DOI] [Google Scholar]
  • 13.Peng ZK, Lang ZQ, Billings SA. Crack detection using nonlinear output frequency response functions. Journal of Sound and Vibration,2017;301(3):777–788. 10.1016/j.jsv.2006.10.039 [DOI] [Google Scholar]
  • 14.Bayma RS, Lang ZQ. Fault diagnosis methodology based on nonlinear system modelling and frequency analysis. IFAC Proceedings Volumes,2014;47(3):8278–8285. 10.3182/20140824-6-za-1003.02315 [DOI] [Google Scholar]
  • 15.Suzuki BR, Zhu YP, Lang ZQ. The analysis of nonlinear systems in the frequency domain using nonlinear output frequency response functions. Automatica,2018,94:452–457. 10.1016/j.automatica.2018.04.030 [DOI] [Google Scholar]
  • 16.Zhao XP, Wu JX, Zhang YH, Shi YQ, Wang LH. Fault diagnosis of motor in frequency domain signal by stacked de-noising auto-encoder. Cmc-Computers Materials & Continua,2018;57(2):223–242. 10.32604/cmc.2018.02490 [DOI] [Google Scholar]
  • 17.Shao SY, Sun WJ, Yan RQ, Wang P, Gao RX. A deep learning approach for fault diagnosis of induction motors in manufacturing. Chinese Journal of Mechanical Engineering, 2017;30(6):1347–1356. 10.1007/s10033-017-0189-y [DOI] [Google Scholar]
  • 18.Chen LR, Zhang ZR, Cao JF, Wang XQ. A novel method of combining nonlinear frequency spectrum and deep learning for complex system fault diagnosis.Measurement,2020;151:107190 10.1016/j.measurement.2019.107190. [DOI] [Google Scholar]
  • 19.Wang LH, Zhao XP, Wu JX, Xie YY, Zhang YH. Motor fault diagnosis based on short-time fourier transform and convolutional neural network. Chinese Journal of Mechanical Engineering, 2017;30(6):1357–1368. 10.1007/s10033-017-0190-5 [DOI] [Google Scholar]
  • 20.Xu GW, Liu M, Jiang ZF, Soffker D, Shen WM. Bearing fault diagnosis method based on deep convolutional neural network and random forest ensemble learning. Sensors, 2019;19(5):1088–1097. 10.3390/s19051088 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Wen L, Li XY, Gao L, Zhang YY. A new convolutional neural network-based data-driven fault diagnosis method. IEEE Transactions on Industrial Electronics, 2018;65(7):5990–5998. 10.1109/TIE.2017.2774777 [DOI] [Google Scholar]
  • 22.Jing XJ, Lang ZQ. On the generalized frequency response functions of volterra systems. Journal of Dynamic Systems Measurement and Control-Transactions of the Asme, 2009;131(6): 061002 10.1115/1.3211088 [DOI] [Google Scholar]
  • 23.Jing XJ, Lang ZQ, Billings SA. Parametric characteristic analysis for generalized frequency response functions of nonlinear systems. Circuits Systems and Signal Processing, 2009;28(5):699–733. 10.1007/s00034-009-9106-7 [DOI] [Google Scholar]
  • 24.Jing XJ, Lang ZQ, Billings SA. Mapping from parametric characteristics to generalized frequency response functions of non-linear systems. International Journal of Control, 2008;81(7):1071–1088. 10.1080/00207170701636542 [DOI] [Google Scholar]
  • 25.Amirkolaee H.A, Arefi H. CNN-based estimation of pre- and post-earthquake height models from single optical images for identification of collapsed buildings. Remote Sensing Letters, 2019;10(7):679–688. 10.1080/2150704X.2019.1601277 [DOI] [Google Scholar]
  • 26.Guo YD, Zhang JY, Cai JF, Jiang BY. CNN-based real-time dense face reconstruction with inverse-rendered photo-realistic face images. IEEE Transactions on Pattern Analysis and Machine Intelligence 2019;41(6):1294–1307. 10.1109/TPAMI.2018.2837742 [DOI] [PubMed] [Google Scholar]
  • 27.Zhang AJ, Yang XZ, Jia L, Xia JF. SRAD-CNN for adaptive synthetic aperture radar image classification. International Journal of Remote Sensing 2019;40(9):3461–3485. 10.1080/01431161.2018.1547450 [DOI] [Google Scholar]
  • 28.Garcia AM, Rivas Cambero IR, Anieva JAM. Modeling and analysis of speed tuning of a PMSM with presence of crack using genetic algorithms. REVISTA IBEROAMERICANA DE AUTOMATICA E INFORMATICA INDUSTRIAL, 2019;16(2):190–199. 10.4995/riai.2018.9767 [DOI] [Google Scholar]
  • 29.He YG, Li CC, Wang T, Shi TC, Tao L, Yuan WB. Incipient fault diagnosis method for IGBT drive circuit based on improved SAE.IEEE access,2019;7:92410–92418. 10.1109/ACCESS.2019.2923017 [DOI] [Google Scholar]
  • 30.Zhang TF, Li Z, Deng ZH, Hu B. Hybrid data fusion DBN for intelligent fault diagnosis of vehicle reducers. Sensors,2019;19(11):2504 10.3390/s19112504 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.He JY, Lee J, Song TT, Li H.T, Kandeepan S, Wang K. Recurrent neural network (RNN) for delay-tolerant repetition-coded (RC) indoor optical wireless communication systems. Optics letters,2019;44(15):3745–3748. 10.1364/OL.44.003745 [DOI] [PubMed] [Google Scholar]
  • 32.Chen HT, Jiang B, Lu N, Mao ZH. Deep PCA based real-time incipient fault detection and diagnosis methodology for electrical drive in high-speed trains. IEEE Transactions on Vehicular Technology, 2018;67(6):4819–4830. 10.1109/TVT.2018.2818538 [DOI] [Google Scholar]
  • 33.Yao YC, Bao J, Skyllas-Kazacos M, Welch BJ, Akhmetov S. Fault detection and diagnosis in hall-H, roult cells based on individual anode current measurements using dynamic kernel PCA. Metallurgical and Materials Transactions B-Process Metallurgy and Materials Processing Science, 2018;49(4):2077–2088. 10.1007/s11663-018-1254-3 [DOI] [Google Scholar]
  • 34.Van der Maaten L, Hinton G. Visualizing data using t-SNE. Journal of Machine Learning Research, 2008, 9(85):2579–2605. [Google Scholar]
  • 35.Van Der Maaten L. Accelerating t-SNE using tree-based algorithms. Journal of machine learning research, 2014, 15(1):3221–3245. [Google Scholar]
  • 36.Thai V, Cheng JS, Nguyen V, Daothi P. Optimizing SVM's parameters based on backtracking search optimization algorithm for gear fault diagnosis. Journal of Vibroengineering,2019,21(1):66–81. 10.21595/jve.2018.19859 [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Fig. The process of GFRF Calculation.

(DOCX)

S2 Fig. Convolutional neural network structure.

(DOCX)

S3 Fig. The flow of fault diagnosis based on GFRF Spectrum +CNN.

(DOCX)

S4 Fig. Second-order GFRF spectrum in different states under working condition 1.

(DOCX)

S5 Fig. Gray images of GFRF spectrum.

(DOCX)

S6 Fig. Gray images of time domain of output.

(DOCX)

S7 Fig. Gray images of frequency domain of output.

(DOCX)

S8 Fig. Visualization of principal components based on CNN feature extraction.

(DOCX)

S9 Fig. Visualization of principal components based on PCA feature extraction.

(DOCX)

S10 Fig. Visualization of principal components based on KPCA feature extraction.

(DOCX)

S11 Fig. Visualization of principal components based on t-SNE feature extraction.

(DOCX)

S1 Table. Corresponding parameters of different states under working condition 1.

(DOCX)

S2 Table. Peak distribution of second-order GFRF spectrum in different states.

(DOCX)

S3 Table. Recognition rates of different fault diagnosis methods.

(DOCX)

S4 Table. The fault diagnosis accuracy of different methods.

(DOCX)

S5 Table. Accuracy rates of different fault diagnosis methods.

(DOCX)

Data Availability Statement

All relevant data are within the manuscript and its Supporting Information files.


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES