Novel Probabilistic Neural Network Models Combined with Dissolved Gas Analysis for Fault Diagnosis of Oil-Immersed Power Transformers

Yichen Zhou; Lingyu Tao; Xiaohui Yang; Li Yang

doi:10.1021/acsomega.1c01878

. 2021 Jul 8;6(28):18084–18098. doi: 10.1021/acsomega.1c01878

Novel Probabilistic Neural Network Models Combined with Dissolved Gas Analysis for Fault Diagnosis of Oil-Immersed Power Transformers

Yichen Zhou ^†, Lingyu Tao ^‡, Xiaohui Yang ^‡,^*, Li Yang ^‡

PMCID: PMC8296607 PMID: 34308042

Abstract

Fault diagnosis technology of power transformers is essential for the stable operation of power systems. Fault diagnosis technology based on dissolved gas analysis (DGA) is one of the most commonly used methods. However, due to the lack of fault information, traditional DGA fault diagnosis techniques are difficult to meet increasing power demand in terms of accuracy and efficiency. To address this problem, this paper proposes a novel fault diagnosis model for oil-immersed transformers based on International Electrotechnical Commission (IEC) ratio methods and probabilistic neural network (PNN) optimized with the modified moth flame optimization algorithm (MMFO). PNN as a radial neural network has good utility and is often used in classification models, but its classification performance is easily affected by the smoothing factor (σ) of the hidden layer and is not stable. This paper addresses this issue using the MMFO to optimize the smoothing factor, which effectively improves the classification accuracy and robustness of PNN. The proposed method was validated by conducting the experiments with the real data collected from transformers. Experimental results show that the MMFO-PNN model improves the fault diagnosis accuracy rate from 70.65 to 99.04%, which is higher than other power transformer fault diagnosis models.

Introduction

An oil-immersed power transformer is one of the most important high-voltage devices in a transmission and transformer system, and its operational status determines whether the power grid can be reliably supplied.¹ Failure to detect power transformer faults in a timely and accurate manner will have a serious negative impact on grid paralysis and damage the normal development of social economy.² Therefore, the study of power transformer fault diagnosis is of great importance for the security and reliability of the power network.³

Over the years, dissolved gas analysis (DGA) has become the most popular method to identify the initial fault of a transformer.⁴ Normal operation of oil-immersed transformers due to insulation aging cracking and other reasons will produce a very small amount of gas: hydrogen (H₂), methane (CH₄), ethane (C₂H₆), ethylene (C₂H₄), acetylene (C₂H₂), carbon monoxide (CO), and carbon dioxide (CO₂). When a power transformer fails or has a potential fault, the content of various gases dissolved in the oil can change significantly. Therefore, the composition of dissolved gases in transformer oil can reflect the operating condition of power transformers to a large extent.⁵ At present, traditional power transformer fault diagnosis methods such as the three-ratio method, Rogers method, and Dornerburg method recommended by International Electrotechnical Commission (IEC) have been developed based on DGA data.⁶ However, the composition of gas components generated by oil-immersed power transformer faults is complex and the distribution characteristics are difficult to speculate, so it is difficult to map a precise relationship between the gas content or content ratio in oil and transformer fault type.⁷

At the same time, with the continuous development of artificial intelligence technology, the advantages of artificial intelligence technology in the calculation, classification, and prediction have been found by many researchers in other fields and widely used in their own research work. For example, multilayer perceptron artificial neural network (MLP-ANN) is used to predict the solubility of hydrogen sulfide at different temperatures, pressures, and concentrations, which shows good prediction performance;⁸ the least-squares support vector machine (LS-SVM) model was established by Ahmadi and Ahmadi⁹ to predict the solubility of CO₂ brine. The average relative absolute deviation between the model prediction and the experimental data is 0.1%; in the field of liquid production of condensate gas reservoir, a hybrid model of particle swarm optimization and artificial neural network¹⁰ is established. Compared with the traditional scheme, the intelligent model has superior performance in determining the dew point pressure of the condensate gas reservoir.

These research studies show that the artificial intelligence method has achieved good results in different fields. To solve the problem of low accuracy and efficiency of traditional DGA fault diagnosis technology and meet the needs of the increasingly complex power grid system, it is particularly important to adopt higher performance intelligent fault diagnosis technology in today’s growing power demand. Combined with artificial intelligence techniques, establishing a transformer fault classification model based on DGA data set is the basis of intelligent fault diagnosis in transformers. Su et al.¹¹ established a fault diagnosis model based on the fuzzy logic technique, which can diagnose multiple faults in transformers and can quantitatively represent the likelihood of each fault. Dhini et al.¹² established a support vector machine (SVM)-based oil immersion transformer fault diagnosis model with a higher recognition rate compared to traditional methods. Jiang et al.¹³ and Yu et al.¹⁴ established transformer fault diagnosis models based on the hidden Markov model (HMM) and k-nearest neighbor (KNN) algorithms, respectively, with faster diagnostic efficiency and showed superiority in dynamic fault prediction as well as real-time monitoring of operating conditions.

Although these methods can improve the accuracy of fault diagnosis to a certain extent, there are still some shortcomings. For example, Su¹¹ applied the fuzzy logic technique to the field of fault diagnosis, which has the advantages of convenience and intuitive effect. However, the disadvantage is that in the face of large data sets, the classification effect will decline sharply; the SVM-based fault diagnosis model¹² is limited by the limitations of SVM itself in the multiclassification problem, and it does not perform well in the face of complex high-dimensional data; although the KNN fault diagnosis model established by Yu¹⁴ improves the work efficiency of KNN algorithm, it does not solve the shortcomings of low fault tolerance rate of KNN to training data and is easy to fall into the dimension disaster, which leads to the weak generalization of the model; the fault diagnosis method based on the hidden Markov model (HMM) proposed by Jiang¹³ is difficult to guarantee the accuracy and stability in the medium- and long-term prediction of fault data. Compared with other artificial intelligence methods, artificial neural network (ANN) can significantly improve the accuracy of fault diagnosis.¹⁵ ANN-based fault diagnosis models for power transformers are trained by sampling data of power transformers under different operating conditions, and the connection weights and biases (significant parameters) of the network model are continuously adjusted during the training process to finally establish the corresponding mapping relationship between specific fault features and fault types.¹⁶ The researchers are integrating neural network-based, deep learning methods with transformer fault diagnosis techniques. Huang et al.¹⁷ proposed an evolutionary neural network approach for power transformer fault diagnosis. Based on the proposed evolutionary algorithm, the neural network automatically adjusts the network parameters (connection weights and deviation terms) to obtain the best model. Meng and Dong et al.¹⁸ proposed a radial basis function neural network (RBFNN) based on a hybrid adaptive training method for fault diagnosis of power transformers. The method is able to generate RBFNN models based on fuzzy c-means (FCM) and quantum-inspired particle swarm optimization (QPSO), which allows automatic configuration of the network structure and acquisition of model parameters. Compared to conventional neural networks, using these methods, the number of neurons, the center and radius of the hidden layer activation function, and the output connection weights can be automatically calculated. The classification accuracy of RBFNN is significantly improved. Dai et al.¹⁹ proposed a deep belief network (DBN)-based transformer fault diagnosis method. By analyzing the relationship between dissolved gas in transformer oil and fault type, the noncoding ratio of gas is determined as the feature parameter of the DBN model. DBN adopts a multilayer multidimensional mapping method to extract more detailed fault-type differences and proves through experiments that this method can effectively improve the accuracy of fault diagnosis. Huang et al.²⁰ proposed a transformer fault diagnosis method based on the gray wolf optimization (GWO) algorithm to optimize the hybrid kernel extreme learning machine (KELM); using the GWO algorithm can optimize the parameters of the hybrid kernel function, while using logistic chaos mapping can generate the initial population parameters of the GWO algorithm to avoid the adverse effects of too fast convergence on the optimization results and effectively improve the classification accuracy.

Although the evolutionary neural network model proposed by Huang¹⁷ can automatically adjust the network parameters, the convergence ability of the evolutionary algorithm is insufficient, and it is easy to fall into the local optimum, which limits the accuracy of the classification model; quantum-inspired particle swarm optimization (QPSO) proposed by Meng¹⁸ can solve the problem of slow convergence of PSO. However, when the data sample is large, the RBFNN has the disadvantage of complex structure and huge computation; the classification accuracy of fault diagnosis model based on DBN is very high,¹⁹ but it needs a lot of fault data for network training, and the classification performance is not stable in the case of a small amount of data; the method proposed in the literature²⁰ is very effective for KELM optimization, but the efficiency and accuracy need to be improved.

This paper chooses the probabilistic neural network (PNN) with certain advantages in fault classification as the basic classifier of the fault diagnosis model. Since the classification performance of PNN is susceptible to network parameters (such as smoothing factor and connection weight), the classification performance of PNN can be improved by optimizing network parameters.^21,22 We optimized the smoothing factor of PNN by improved moth flame optimization (MMFO) and established an MMFO-PNN fault diagnosis model. Compared with other fault diagnosis methods, MMFO-PNN has higher classification accuracy and efficiency in DGA data classification and has strong engineering practicability.

This paper is organized as follows: after the Introduction, Proposed Method describes the proposed machine learning theoretical approach, Implementation and Experimental Setup describes the establishment of a simulation model for transformer fault diagnosis, Experimental Results and Discussion presents and discusses the experimental results, and the final section draws conclusions.

Proposed Method

In this section, we present the proposed method for transformer fault diagnosis. We first discuss IEC ratio methods and the improved moth flame optimization algorithm and then detail the MMFO-based PNN model for fault diagnostics.

IEC Ratio Methods

This work focuses on the IEC three-ratio method. The IEC three-ratio method uses five gases: H₂, CH₄, C₂H₂, C₂H₄, and C₂H₆. These gases are produced in three gas ratios: C₂H₂/C₂H₄, CH₄/H₂, and C₂H₄/C₂H₆. It provides the values of the three key gas ratios corresponding to the suggested fault diagnosis. When the critical gas ratio exceeds a given threshold, the power transformer may fail early. Therefore, these given values can be used to detect early failures of power transformers.

Modified Moth Flame Optimization Algorithm

Moth flame optimization algorithm is a natural heuristic optimization algorithm, which was proposed by Mirjalili²³ in 2015. There are two important parts in MFO: moth and flame. The position of one moth corresponds to a solution to the problem. Flame stores all of the optimal solutions of moth population so far. Compared with other metaheuristics, MFO has the advantages of simple structure, good robustness, and easy implementation.²⁴

The MFO optimization process can be summarized into three phases:

(a) Randomly generated moth positions in the search space: The initialization of the position of each moth in the moth population is implemented in MFO as follows

where the two matrices, ub and lb, define the upper and lower bounds of the variables, respectively.

(b) Adaptive reduction of the number of flames: The equation for the adaptive reduction of the number of flames is as follows

where f_no represents the number of flames, N represents the number of moth populations, l is the number of current iterations, and T is the maximum number of iterations.

(c) Position update: The standard MFO chooses a logarithmic spiral update mechanism, which updates the flame list based on the best position at each iteration, enhancing the ability of spatial search. The logarithmic spiral update mechanism is formulated as follows

where b is the constant for constructing the logarithmic helix trajectory; after arranging the moth positions in ascending order according to their fitness values in the l-1th generation, the first f_no moth positions are taken as the list of lth generation flame positions, that is, F(l), and F_i(l) is the ith flame in the list; then, F_{f_no}(l) denotes the flame position with the worst fitness. D_i is calculated as follows

where t denotes the proximity of the moth to the location of the flame, which is determined by the following equation

Through the description of the above three stages, we summarize the whole optimization process of MFO as follows:

Step 1:
Setting the algorithm parameters: population size N, dimension d, maximum number of iterations T.
Step 2:
Initializing randomly the moth’s position in the search space by eq 1 and recording.
Step 3:
Calculating the fitness of each moth position, ranking the moth positions according to the fitness, and recording the better solution among them as the next generation of flame positions (the first generation of flame positions is generated from the first generation of moth positions).
Step 4:
Performing adaptive reduction of the number of flames by eq 2.
Step 5:
Updating D_i and t by eqs 4 and 5 and finally the individual moth positions by eq 3.
Step 6:
Checking if l is greater than T. If not, return to Step 3; if yes, stop iteration and output the result.

In the traditional moth flame algorithm, the moth approximates along a logarithmic helix trajectory for any flame, which can enhance the local search capability of the algorithm, but it is also easy to fall into the local optimum. To speed up the convergence speed and avoid the model falling into the local optimum, we introduce the chaotic operator after initialization and use the parallel light straight-line trajectory instead of the traditional logarithmic spiral trajectory to update the position. The pseudocode of MMFO is shown in Algorithm 1.

Because moth individuals search for flame individuals relatively independently without too much information, we select the top three relatively good individuals after adaptation ranking to perform a chaotic linear combination with each individual in the moth population and average them out to obtain the updated positions

where c_n is the chaotic mapping optimization operator, which is ergodic and stochastic and often achieves better results using the chaos mapping optimization operator instead of random numbers in the algorithm.²⁵ The introduction of the chaos operator in the position update process of MFO can effectively reduce the probability of falling into the local optimum and thus improve the global search capability, which is calculated as follows

where M_x represents the best individual and M_i represents each individual in the moth population.

As the number of flames adaptively decreases, when the number of flames is smaller than the number of moths, the traditional logarithmic helix is used for position updating

the whole update process can be expressed as (the meaning of the parameters in eq 9 is the same as described in the previous section)

graphic file with name ao1c01878_m009.jpg

PNN Optimized by MMFO

Probabilistic neural network (PNN) is a kind of neural network with a simple structure and wide application.²⁶ It is also a radial basis function feedforward neural network based on the Bayesian decision theory.

As shown in Figure 1, the structure of PNN is a parallel four-layer structure: input layer, mode layer, summation layer, and output layer. The input layer receives values from the training samples, converts them into feature vectors, and then assigns them to the network. The number of neurons in the input layer is equal to the number of sample vector dimensions. The pattern layer is calculated using the Euclidean distance between the training sample feature vectors and the center of the free base to match the input feature vectors to various types of relationships in the training set. The output of each pattern unit is

where X = [x₁, x₂, ...., x_n]^T, n = 1, 2, ...., l. l denotes the total number of training patterns. d denotes the feature vector dimension. X_ij is the jth center of the ith class of training samples, and σ is the smoothing factor. The summation layer performs a weighted average of the outputs of the neurons in the pattern layers belonging to the same type. The formula is as follows

where v_i is the output type of class i and L is the number of neurons of class i.

Topology of a probabilistic neural network.

The type corresponding to the maximum output in the output layer is the output type. The result is as follows

Due to the limitation of PNN itself, the smoothing factor σ has a great influence on the computational process of the input layer output to the hidden layer. If the value of σ is not chosen appropriately, too large or too small values can make the network converge too fast or easily fall into local optimum, allowing the classification accuracy to drop dramatically.²⁷ The MMFO described above is advantageous in global search capability and robustness compared with other traditional optimization algorithms. The classification performance of PNN can be greatly improved by MMFO optimization σ.

In the proposed MMFO-PNN model, the input feature vector can be expressed as

graphic file with name ao1c01878_m013.jpg

The steps for classifying the input feature vectors by the MMFO-PNN model are represented in Figure 2, which are described as follows:

Step 1:
Set the initial training sample X.
Step 2:
Initialize the probabilistic neural network by randomly defining the set of smoothing factors as follows
14
Step 3:
Initialize the MMFO algorithm parameters: population size N, dimensionality d, maximum number of iterations T, and initialize the fitness function f(x). It is worth noting that in our model, the mean square error (MSE) is set as the fitness value, and the corresponding fitness function can be expressed as
15

Diagram of the proposed MMFO-based PNN for fault diagnostics.

where Y_i is the actual output after network training and O_i is the theoretically expected output.

Step 4:
Initialize the position of the smoothing factor randomly and record it by eq 1.
Step 5:
Calculate the fitness of each smoothing factor position by eq 15 and record the current optimal solution position.
Step 6:
Sort the positions in ascending order according to the size of the fitness and select the better solution as the next-generation flame position (if l = 1, then as the contemporary flame position).
Step 7:
Update f_no by eq 2.
Step 8:
Update the smoothing factor positions by eq 9.
Step 9:
Continue to the next step if the maximum iterations condition (l < T) is satisfied; otherwise, return to Step 5.
Step 10:
Fed the optimized smoothing factor σ into the PNN network for training to obtain the best PNN fault diagnosis model.
Step 11:
Fed the test samples into the network instead of the training samples to get the corresponding data.

Implementation and Experimental Setup

Model Implementation

In this section, we discuss the implementation of the proposed MMFO-based PNN method. Figure 3 shows the implemented framework for fault diagnosis. First, a series of DGA data are sampled during the actual operation of the oil-immersed power transformer, and a portion of the DGA data is randomly selected as training samples and input into the MMFO-PNN model to optimize the training of the neural network and output its fault-type classification results. Then, the remaining DGA data, i.e., the test samples, are used to test the established neural network and verify its effectiveness.

Implemented framework of the power transformer fault diagnosis.

According to the dissolved gas content in power transformer oil, oil-immersed power transformer faults can be classified into four categories, namely, low-temperature (LT) overheating (<150 °C), low-temperature overheating (LT) (150–300 °C), partial discharge (PD), and arc discharge (AD).

Table 1 shows some of the real data of the DGA method used by the China Electric Power Research Institute to determine the fault types of oil-immersed power transformers. In addition, to facilitate the training of the probabilistic neural network, the four fault types of power transformers will be coded in the form shown in Table 2.

Table 1. Some Real Data from China Electric Power Research Institute Diagnosing Power Transformer Fault Types by the DGA Method^a.

	dissolved gas (μL/L)
fault type	CH₄	C₂H₂	C₂H₄	C₂H₆	TH	sources
LT (<150 °C)	83	53	13	1.2	150.2	Jiujiang PSC
LT (150–300 °C)	6.5	98	16	1.5	122	Fuzhou PSC
LT (150–300 °C)	193	191	28	16	428	Yingtan PSC
LT (150–300 °C)	12	46	11	1.8	70.8	Nanchang PSC
LT (150–300 °C)	3.5	31	8.2	1	43.7	Yichun PSC
AD	61	307	105	6	479	Yingtan PSC

Open in a new tab

PSC is the power supply company, TH is the total hydrocarbon, the temperature is 25 °C, the humidity is 50.

Table 2. Coding Format for Different Fault Types.

fault type	LT (<150 °C)	LT (150–300 °C)	PD	AD
coding format	1	0	0	0
	0	1	0	0
	0	0	1	0
	0	0	0	1

Open in a new tab

Data Collection and Preprocessing

Considering the influence of temperature, humidity, and other parameters on power transformer fault diagnosis, we collected several groups of real data of various gas contents in oil of oil-immersed power transformers from various power supply companies and substations in Jiangxi Province. For each group of gas data, we selected some characteristic gas contents (C₂H₂, C₂H₄, CH₄, H₂, C₂H₄, C₂H₆) dissolved in the oil as the main basis for fault-type determination. After screening all of the gas data and processing with the IEC three-ratio method, 525 sets of valid DGA data were obtained, including 333 sets of low-temperature overheating (LT) (<150 °C), 39 sets of low-temperature overheating (LT) (150–300 °C), 65 sets of partial discharge (PD), and 88 sets of arc discharge (AD), where 470 sets of data were used as training samples and the remaining 55 sets of data were used as test samples. Some of the DGA data are shown in Table 3.

Table 3. Partial Sample Data.

dissolved gas (μL/L)
C₂H₂/C₂H₄	CH₄/H₂	C₂H₄/C₂H₆	fault type
0.07143	0.54667	0.03571	LT (<150 °C)
0.03333	0.5	0.03333	LT (<150 °C)
0.07831	0.14945	0.44828	LT (<150 °C)
0.01583	1.68618	0.00179	LT (150–300 °C)
0.04878	1.16761	0.025	LT (150–300 °C)
0.0117	1.13993	0.00514	LT (150–300 °C)
0.05556	0.07524	0.05882	PD
0	0.08595	0	PD
0.06667	0.06754	0.2191	PD
0.16667	0.26891	0.06818	AD
0.27372	0.12839	0.3012	AD
0.26642	0.1946	0.23175	AD

Open in a new tab

Experimental Setting

To effectively evaluate the performance of our proposed method in fault diagnosis of oil-immersed power transformer, we compare MMFO-PNN with 11 other methods. It includes five types of PNN-based fault diagnosis models, namely, PNN, particle swarm optimization (PSO)-PNN, bat algorithm (BA)-PNN,²⁷ genetic algorithm (GA)-PNN, and adaptive (Sa)-PNN;²⁸ it also includes six fault diagnosis methods that have been proposed in previous work and verified by experiments, which are self-adaptive evolutionary extreme learning machine (SaE-ELM),²⁹ modified bat algorithm (MBA)-BP,³⁰ gray wolf optimizer optimized hybrid kernel extreme learning machine (GWO-hybrid ELM),²⁰ genetic algorithm (GA)-SVM,³¹ modified cuckoo search (MCS)-BP,³² and IEC three-ratio method (hereinafter referred to as IEC).³³ Through the MATLAB 2018a simulation platform based on the same DGA data set training and testing to compare their performance. The parameter settings of these 11 methods are shown in Table 4.

Table 4. Parameters Settings of Different Methods.

methods	parameters settings
MMFO-PNN	N = 10, T = 20
Sa-PNN	s_min = 0.1, s_max = 0.8, s_inter = 4.9, T = 20
GA-PNN	P_m = 0.01, P_c = 0.7, N = 10, T = 20
BA-PNN	A = 0.5, r = 0.5, N = 10, T = 20
PSO-PNN	c₁ = 1.49445, c₂ = 1.49445, T = 20
GA-SVM	P_m = 0.01, P_c = 0.9, N = 10, T = 20, k_max = 10³, k_min = 10^–3, p_max = 10³, p_min = 10^–3
GWO-hybrid KELM	N = 10, T = 20
SaE-ELM	N = 10, T = 20, strategy = 1, numst = 4, hidden number = 10
MBA-BP	A = 0.5, r = 0.5, N = 10, T = 20, hidden number = 10
MCS-BP	Pa = 0.25, N = 10, T = 20, hidden number = 10
PNN	σ = 0.07
IEC	non

Open in a new tab

Experimental Results and Discussion

The results of accuracy comparison of different methods based on PNN are shown in Table 5 with 86.09% for PNN and 92.34, 93.68, 95.33, and 95.91% for the remaining four improved algorithms of PNN (PSO-PNN, BA-PNN, GA-PNN, and Sa-PNN), respectively. The MMFO-PNN algorithm achieves 96.15% (25/26) accuracy for LT (<150 °C) fault type, 100% accuracy for the remaining three fault types, and 99.04% average accuracy. MMFO-PNN has the highest accuracy and average accuracy for each fault type, which is significantly better than the remaining five algorithms. Even in the AD fault type where the accuracy is generally low, the accuracy of MMFO-PNN reaches 100%. It fully shows that MMFO is better than PSO, BA, GA, Sa, and other optimization methods for PNN optimization, demonstrating the superiority of MMFO-PNN in power transformer fault diagnosis.

Table 5. Comparison of Different Methods Based on PNN.

	accuracy (%)
fault type	MMFO-PNN	PNN	PSO-PNN	BA-PNN	GA-PNN	Sa-PNN
LT (<150 °C)	96.15	96.15	96.15	96.15	88.46	96.15
LT (150–300 °C)	100.00	62.50	87.50	100.00	100.00	87.50
PD	100.00	100.00	100.00	85.71	100.00	100.00
AD	100.00	85.71	85.71	92.86	92.86	100.00
average	99.04	86.09	92.34	93.68	95.33	95.91

Open in a new tab

To show the prediction results of various PNN models more intuitively, we plotted the classification results of different algorithms, as shown in Figure 4, where a, c, e, g, i, and k are the classification results of training samples and b, d, f, h, j, and l are the classification results of test samples. In Figure 4 (each sample on the x-axis in the subgraph represents a faulty transformer; 1, 2, 3, and 4 on the y-axis are the labels of each type of fault, respectively), we can visually see that the classification output of the remaining five algorithms is not satisfactory; especially in the first, second, and fourth categories of faults, there are several classification errors, but from the k and l subfigures, we can see that MMFO-PNN has a high accuracy for these classification faults; only in the first category of faults, there is a classification error; the other three faults are all correctly classified. The excellent classification performance of MMFO-PNN in the test and classification samples shows that the stability and classification performance of the MMFO-PNN algorithm itself are better than those of the other algorithms, and it is more applicable and reliable in the field of power transformer fault diagnosis.

Classification results of different models. (a), (c), (e), (g), (i), and (k) represent the results of train sample classification for different methods. (b), (d), (f), (h), (j), and (l) are the results of test sample classification for different methods.

The above comparison experiments show that MMFO optimizes PNN better and has better convergence capability than other intelligent algorithms. Next, to demonstrate that the MMFO-PNN model has strong practicality in the field of transformer fault diagnosis, we let the MMFO-PNN model be compared with a variety of transformer fault diagnosis methods proposed by previous authors (including empirical methods and methods in the field of artificial intelligence) to prove the superiority of the proposed model from several perspectives. Among them, MCS-BP, MBA-BP, SaE-ELM, GWO-hybrid KELM, and GA-SVM are the fault diagnosis methods proposed by the researchers combined with artificial intelligence techniques. Through previous experiments, they were proved to be effective methods in the field of transformer fault diagnosis. The IEC three-ratio method is a traditional and commonly used fault diagnosis method for oil-immersed transformers. It is essentially an empirical discrimination method in which the researcher makes an empirical judgment of the fault type based on the range of dissolved gas contents sampled. For the existing fault types in this paper, the IEC three-ratio method is identified, as shown in Table 6.

Table 6. Fault Diagnosis Based on the IEC Three-Ratio Method.

fault type	C₂H₂/C₂H₄	CH₄/H₂	C₂H₄/C₂H₆
LT (<150 °C)	<0.1	0.1–1	1–3
LT (150–300 °C)	<0.1	≥1	<1
PD	<0.1	<0.1	<1
AD	0.1–3	<1	NS

Open in a new tab

The comparison of the diagnostic accuracy of MMFO with the above methods is shown in Table 7, where the diagnostic simulation results of the training set and test set of the artificial intelligence class methods are shown in Figure 5.

Table 7. Comparison Results of Different Methods.

	accuracy (%)
fault type	MMFO-PNN (%)	SaE-ELM (%)	GWO-hybrid KELM (%)	MCS-BP (%)	MBA-BP (%)	GA-SVM (%)	IEC (%)
LT (<150 °C)	96.15	88.00	96.00	92.00	96.00	100.00	77.17
LT (150–300 °C)	100.00	100.00	100.00	100.00	87.50	100.00	97.44
PD	100.00	100.00	100.00	100.00	100.00	71.43	18.47
AD	100.00	100.00	93.33	93.33	93.33	93.33	100.00
average	99.04	97.00	97.33	96.33	94.21	91.19	70.65

Open in a new tab

The comparison results shown in Table 7 indicate that MMFO-PNN still has a significant advantage in terms of accuracy compared with the other six methods. At the same time, combining Tables 5 and 7, we can also see that accuracy of the IEC three-ratio method is much lower than that of AI class methods, which shows that the empirical discriminative approach is not effective in the face of complex concentrated data. Among the artificial intelligence methods, the accuracy of GA-SVM is relatively low. Li and Zhang et al. used the genetic algorithm to filter the parameters of SVM for optimization, which significantly improved the classification accuracy of SVM, but there is still a gap between the SVM classifier and the neural network-based methods for multiclassification problems.

For engineering-type problems like transformer fault diagnosis, efficiency is also an important indicator of fault diagnosis model performance. We use the running time of various fault diagnosis algorithms as a measure of efficiency (simulation platform, MATLAB 2018a; computational platform, i7-9750CPU@2.60GHz).

Also, we use the error rate to measure the ability of fault diagnosis models to identify fault types. As can be seen from Table 8, the time consumption of fault diagnosis models based on artificial intelligence methods is generally higher than that of IEC three-ratio methods (except PNN). This is in accordance with the law of objective facts because machine learning requires sufficient training in identifying the same number of test sets, while the IEC three-ratio method based on empirical discrimination does not require training and can directly discriminate fault types based on the range of the three gas ratios. However, in contrast, the IEC three-ratio method has the highest error rate of all methods. Here, we can also analyze the characteristics of the traditional empirical fault diagnosis method: although it is easy and fast to use, the error rate is too high to meet the needs of today’s industry. In contrast, MMFO-PNN has the lowest error rate and takes only a little more time than PNN and IEC three-ratio methods. Considered together, MMFO-PNN is the fault diagnosis method with the best engineering practical performance among all of the methods involved in the comparison.

Table 8. Efficiency and Error Rate of Different Methods.

method	time (s)	error rate (%)
MCS-BP	18.0127	5.4545
GWO-hybrid KELM	16.3495	3.636
GA-SVM	15.8398	5.4545
MBA-BP	24.2396	5.4545
BA-PNN	14.9398	5.4545
SaE-ELM	13.3365	5.4545
GA-PNN	11.2378	5.4545
PSO-PNN	9.3433	7.273
Sa-PNN	8.3158	3.636
MMFO-PNN	7.2763	1.818
IEC	3.7653	24.76
PNN	2.9606	10.91

Open in a new tab

The error rate is one of the simplest indicators to discriminate the classifier performance. However, the error rate can only calculate the percentage of cases with wrong judgments among all cases, and it does not reflect how the cases with wrong judgments are classified wrong. Therefore, we use the mean square error (MSE) to reflect the dispersion of the classification results. The MSEs of the training and test sets for different methods are shown in Table 9. It is worth noting that since the IEC three-ratio method is not a machine learning class method, there is no need to divide the training and test sets, but to ensure the fairness of the experiments, we use the same training set data and test set data as the machine learning class method for the classification experiments when testing the performance of the IEC three-ratio method.

Table 9. Mean Square Error of Different Methods.

algorithms	MSE of train sample	MSE of test sample
MMFO-PNN	0.0085	0.1636
Sa-PNN	0.0191	0.1818
GWO-hybrid KELM	0.0213	0.1818
MBA-BP	0.0043	0.2000
MCS-BP	0.0085	0.2545
GA-SVM	0.0447	0.3091
PSO-PNN	0.0170	0.3273
BA-PNN	0.0085	0.3455
GA-PNN	0.0021	0.4000
SaE-ELM	0.0468	0.4000
PNN	0.0255	0.5455
IEC	0.4787	0.6545

Open in a new tab

It can be seen in Table 9 that the MSE of MMFO-PNN is the smallest among all methods when identifying the test set data. Therefore, it can be seen that, compared with other methods, MMFO-PNN has strong robustness and generalization ability.

Considering the unbalanced nature of the experimental data (far more fault data of the low-temperature overheating (LT) (<150 °C) type than other types of faults), we use the metric F1-score to evaluate the performance of different fault diagnosis methods. The Marco F1-score for different methods is shown in Table 10.

Table 10. Comparison of Different Machine Learning Methods with the F1-Score.

	Marco F1-score
method	LT (%) (<150 °C)	LT (%) (150–300 °C)	PD (%)	AD (%)	average (%)
MMFO-PNN	98.04	100.00	100.00	96.55	98.65
Sa-PNN	96.15	93.33	100.00	96.55	96.51
GWO-hybrid KELM	97.96	100.00	93.33	93.75	96.26
SaE-ELM	93.62	100.00	94.12	94.44	95.54
GA-PNN	95.83	94.12	100.00	89.66	94.90
MCS-BP	95.83	100.00	90.00	93.75	94.90
BA-PNN	96.15	100.00	92.31	89.66	94.53
MBA-BP	96.15	93.33	93.33	93.75	94.14
GA-SVM	94.34	93.33	90.91	96.55	93.78
PSO-PNN	96.15	82.35	100.00	88.89	91.85
PNN	89.29	76.92	100.00	88.89	88.77

Open in a new tab

When the data samples are unbalanced, the F1-score is more indicative of the practical performance of the classifier compared to the accuracy. Table 10 shows that the Marco F1-score of MMFO-PNN is the highest among all of the fault diagnosis methods, showing the excellent performance of MMFO-PNN in the field of transformer fault diagnosis.

To evaluate the predictive performance of the proposed MMFO-PNN fault diagnosis model, we performed fivefold cross validation on the proposed model and other machine learning models. Nonrepetitive sampling divides the gas ratio data into five randomly. Each time, one of them is selected as the test set, and the remaining four are used as the training set for model training, and the average prediction accuracy of each test set is calculated. After repeating the above process five times, the average of the five sets of test results is calculated as an estimate of the model accuracy. The results are shown in Table 11.

Table 11. Results of Fivefold Cross Validation of Different Machine Learning Models (%).

model	fold 1	fold 2	fold 3	fold 4	fold 5	average accuracy
MMFO-PNN	99.68	99.64	100.00	95.80	99.88	98.40
PNN	78.03	73.89	77.42	82.41	75.00	77.35
PSO-PNN	94.73	95.69	91.25	93.10	93.75	93.70
BA-PNN	93.73	93.98	90.97	93.94	97.17	93.96
GA-PNN	95.38	91.71	98.08	90.36	97.62	94.63
Sa-PNN	93.44	92.39	85.88	93.03	91.16	91.18
SaE-ELM	96.46	97.68	95.59	94.92	91.32	95.19
GWO-hybrid KELM	98.33	98.36	97.67	96.19	97.84	97.68
MCS-BP	97.05	95.43	96.15	93.12	95.86	95.52
MBA-BP	93.12	99.71	92.84	94.58	95.55	95.16
GA-SVM	89.03	87.34	92.05	86.14	90.06	88.92

Open in a new tab

The results show that the performance of the proposed MMFO-PNN model in fivefold cross validation is still better than other machine learning models. The average accuracy of some models after cross validation is lower than the diagnostic accuracy shown in Table 5, but the results of cross validation can better reflect the true performance of the model.

Figure 6 shows the fitness curves of MMFO-PNN when the maximum number of iterations is set to 4, 10, 80, and 100, respectively, and Figure 7 shows its corresponding average classification accuracy. It can be seen that MMFO-PNN converges quickly, and when the maximum number of iterations is 10, it converges to the optimum in the seventh generation, with rapid classification and accurate results, which fully reflects the engineering practicality of MMFO-PNN. When the maximum number of iterations is 80, the sixth generation converges to the optimum. It can also be seen that the initial error of MMFO-PNN is very small and converges to the optimum in just one time, which fully demonstrates its excellent global search and convergence ability. In addition, when the maximum number of iterations is 80, even if it is optimal in the sixth generation, it does not fall into overfitting afterward, which affects the accuracy. Only when the maximum number of iterations is 100, the 85th generation fitness is 0 and the accuracy decreases from 99.038 to 58.036%.

Fitness curves of different iteration times.

Average accuracy of different iterations.

The performance of machine learning algorithm models is often very sensitive to the input parameters of the model. The adjustable input parameters of the MMFO-PNN model we built are the number of populations (N) and the maximum number of iterations (T). In the previous comparison experiments, we uniformly set T to 20 and N to 10 for all models requiring input T and N. The purpose is to control the variables and ensure the fairness of the experiments. The results of the previous experiments (Table 8) show that MMFO-PNN has superior diagnostic performance compared to other models under fair conditions. In this section, we will focus on the effect of input parameters on the performance of the MMFO-PNN model by adjusting the input parameters to make the MMFO-PNN model optimal.

Figures 6 and 7 only show the effect of T on accuracy and do not discuss the effect of N on the overall model performance. To observe the model performance as a whole under different input parameters, we plot the three-dimensional (3D) plot shown in Figure 8 to investigate the effect on MMFO-PNN when the only two input parameters, N and T, are varied simultaneously. We use the fitness defined by eq 15 to judge the model performance; the lower the value, the better the model performance. To give the reader a clearer understanding of the 3D plot shown in Figure 8a, we also plot its corresponding top view, which is Figure 8b. In Figure 8, the bluer the color, the smaller its fitness, and the better the performance of the model. As can be seen, T that is too large leads to a surge in fitness, while T that is too small and N that is small may also lead to a larger fitness. In addition, if N is relatively large, then fitness will increase more quickly with the number of iterations. Also, if N is too small, then fitness will appear to be larger due to a small number of iterations, or even always larger no matter how the number of iterations changes. Combining the above phenomena, we can learn that for both N and T, neither can be too large or too small; otherwise, the model cannot be put in an optimal state. Moreover, if N is chosen appropriately, the fault tolerance for the choice of T will also increase.

Three-dimensional diagram of the change in fitness: (a) main view and (b) top view.

To show more clearly the effect of T and N each on the fitness of the MMFO-PNN model, as shown in Figures 9 and 10, we investigate the effect of another input parameter on the model at T = 10 and N = 3, respectively. It can be seen that neither too large nor too small is good for either N or T, which will lead to the degradation of the model performance. Therefore, the choice of model input parameters is very important to the final learning effect of the model, which is also a common problem in the field of machine learning. Through the above findings, considering that the fault diagnosis model needs to balance the error rate and efficiency, N = 3 and T = 10 can be chosen as the final input parameters for the established MMFO-PNN model.

Trends in fitness values with the number of population (max iteration = 10).

Trends of fitness values with max iteration (number of population = 3).

Conclusions

In this paper, we proposed an MMFO-based PNN as a fault diagnosis method for power transformers using MMFO to optimize the smoothing factor (σ), which is crucial to the performance of PNN, to improve the performance of PNN. For the optimization of PNN, the MMFO algorithm can search the solution space to a greater extent than other optimization algorithms, enhancing the global search and convergence to find a better global optimal solution. We validated these algorithms using real data collected from transformers by evaluating the performance of the algorithmic models. The experimental results show that the proposed MMFO outperforms other algorithms, can effectively enhance the global optimal solution search performance, and has good stability to overcome the perturbation of noisy data, thus improving the fault determination accuracy.

In addition, the developed technique is also applicable to other engineering fields, such as sensor and diesel machine fault diagnosis.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (51765042, 61963026).

Glossary

Abbreviations

AD: arc discharge
ANN: artificial neural network
BA: bat algorithm
DBN: deep belief network
DGA: dissolved gas analysis
ELM: extreme learning machine
FCM: fuzzy c-means
GA: genetic algorithm
GWO: gray wolf optimization
HMM: hidden Markov model
IEC: International Electrotechnical Commission
KELM: kernel extreme learning machine
KNN: k-nearest neighbor
MLP: multilayer perceptron
MBA: modified bat algorithm
MCS: modified cuckoo search
MMFO: modified moth flame optimization
PD: partial discharge
PNN: probabilistic neural network
PSC: power supply company
PSO: particle swarm optimization
QPSO: quantum-inspired particle swarm optimization
RBFNN: radial basis function neural network
Sa: self-adaptive
SaE: self-adaptive evolutionary
SVM: support vector machine
σ: smoothing factor

Author Contributions

Conceptualization, X.Y. and Y.Z.; methodology, Y.Z.; software, L.T.; validation, Y.Z., L.T., and X.Y.; formal analysis, L.Y.; investigation, Y.Z.; resources, X.Y.; writing—original draft preparation, Y.Z. and L.T.; writing—review and editing, Y.Z., L.T., and X.Y.; visualization, Y.Z.; supervision, X.Y. All authors have read and agreed to the published version of the manuscript.

The authors declare no competing financial interest.

References

Yu S.; Zhao D.; Chen W.; Hou H.. Oil-immersed Power Transformer Internal Fault Diagnosis Research Based on Probabilistic Neural Network. ANT/SEIT, 2016.
Chen W.; Pan C.; Yun Y.; Liu Y. Wavelet networks in power transformers diagnosis using dissolved gas analysis. IEEE Trans. Power Delivery 2009, 24, 187–194. 10.1109/TPWRD.2008.2002974. [DOI] [Google Scholar]
Kim J.-W.; Park B.; Jeong S. C.; Kim S. W.; Park P. Fault diagnosis of a power transformer using an improved frequency-response analysis. IEEE Trans. Power Delivery 2005, 20, 169–178. 10.1109/TPWRD.2004.835428. [DOI] [Google Scholar]
Wang X.; Wang Z.; Liu Q.; Dyer P. Dissolved gas analysis of thermal faults in transformer liquids simulated using immersed heating method. IEEE Trans. Dielectr. Electr. Insul. 2018, 25, 1749–1757. 10.1109/TDEI.2018.007158. [DOI] [Google Scholar]
Kari T.; Gao W.; Zhao D.; Zhang Z.; Mo W.; Wang Y.; Luan L. An integrated method of ANFIS and Dempster-Shafer theory for fault diagnosis of power transformer. IEEE Trans. Dielectr. Electr. Insul. 2018, 25, 360–371. 10.1109/TDEI.2018.006746. [DOI] [Google Scholar]
Cong H.; Du S.; Li Q.; Liu J. Electro-thermal fault diagnosis method of rapo vegetable oil transformer based on characteristic gas and ratio criterion. IEEE Access 2019, 7, 101147–101159. 10.1109/ACCESS.2019.2928817. [DOI] [Google Scholar]
Kirkbas A.; Demircali A.; Koroglu S.; Kizilkaya A. Fault diagnosis of oil-immersed power transformers using common vector approach. Electr. Power Syst. Res. 2020, 184, 106346 10.1016/j.epsr.2020.106346. [DOI] [Google Scholar]
Amedi H.; Baghban A.; Ahmadi M. Evolving machine learning models to predict hydrogen sulfide solubility in the presence of various ionic liquids. J. Mol. Liq. 2016, 216, 411–422. 10.1016/j.molliq.2016.01.060. [DOI] [Google Scholar]
Ahmadi M. A.; Ahmadi A. Applying a sophisticated approach to predict CO2 solubility in brines: application to CO2 sequestration. Int. J. Low-Carbon Technol. 2016, 11, 325–332. 10.1093/ijlct/ctu034. [DOI] [Google Scholar]
Ahmadi M.; Ebadi M.; Yazdanpanah A. Robust intelligent tool for estimating dew point pressure in retrograded condensate gas reservoirs: Application of particle swarm optimization. J. Pet. Sci. Eng. 2014, 123, 7–19. 10.1016/j.petrol.2014.05.023. [DOI] [Google Scholar]
Su Q.; Mi C.; Lai L.; Austin P. A fuzzy dissolved gas analysis method for the diagnosis of multiple incipient faults in a transformer. IEEE Trans. Power Syst. 2000, 15, 593–598. 10.1109/59.867146. [DOI] [Google Scholar]
Dhini A.; Surjandari I.; Faqih A.; Kusumoputro B. In Intelligent Fault Diagnosis for Power Transformer Based on DGA Data Using Support Vector Machine (SVM), 2018 3rd International Conference on System Reliability and Safety (ICSRS), 2018; pp 294–298.
Jiang J.; Chen R.; Chen M.; Wang W.; Zhang C. Dynamic fault prediction of power transformers based on hidden Markov model of dissolved gases analysis. IEEE Trans. Power Delivery 2019, 34, 1393–1400. 10.1109/TPWRD.2019.2900543. [DOI] [Google Scholar]
Yu H.; Wu Q.; Lu Y.; Hu C.; Wang Y.; Liu G. In Research on Fault Diagnosis of Power Transformer Equipment Based on KNN Algorithm, International Conference on Mechatronics and Intelligent Robotics, 2017; pp 172–176.
Zhang Y.; Ding X.; Liu Y.; Griffin P. An artificial neural network approach to transformer fault diagnosis. IEEE Trans. Power Delivery 1996, 11, 1836–1841. 10.1109/61.544265. [DOI] [Google Scholar]
Zhang W.; Yang X.; Deng Y.; Li A. An Inspired Machine-Learning Algorithm with a Hybrid Whale Optimization for Power Transformer PHM. Energies 2020, 13, 3143 10.3390/en13123143. [DOI] [Google Scholar]
Huang Y. Evolving neural nets for fault diagnosis of power transformers. IEEE Trans. Power Delivery 2003, 18, 843–848. 10.1109/TPWRD.2003.813605. [DOI] [Google Scholar]
Meng K.; Dong Z. Y.; Wang D. H.; Wong K. P. A self-adaptive RBF neural network classifier for transformer fault analysis. IEEE Trans. Power Syst. 2010, 25, 1350–1360. 10.1109/TPWRS.2010.2040491. [DOI] [Google Scholar]
Dai J.; Song H.; Sheng G.; Jiang X. Dissolved gas analysis of insulating oil for power transformer fault diagnosis with deep belief network. IEEE Trans. Dielectr. Electr. Insul. 2017, 24, 2828–2835. 10.1109/TDEI.2017.006727. [DOI] [Google Scholar]
Huang X.; Wang X.; Tian Y. In Research on Transformer Fault Diagnosis Method based on GWO Optimized Hybrid Kernel Extreme Learning Machine, 2018 Condition Monitoring and Diagnosis (CMD), 2018; pp 1–5.
Song T.; Jamshidi M. M.; Lee R. R.; Huang M. A modified probabilistic neural network for partial volume segmentation in brain MR image. IEEE Trans. Neural Netw. 2007, 18, 1424–1432. 10.1109/TNN.2007.891635. [DOI] [PubMed] [Google Scholar]
Kusy M.; Zajdel R. Probabilistic neural network training procedure based on Q (0)-learning algorithm in medical data classification. Appl. Intell. 2014, 41, 837–854. 10.1007/s10489-014-0562-9. [DOI] [Google Scholar]
Mirjalili S. Moth-flame optimization algorithm: A novel nature-inspired heuristic paradigm. Knowledge-Based Syst. 2015, 89, 228–249. 10.1016/j.knosys.2015.07.006. [DOI] [Google Scholar]
Ma L.; Wang C.; gang Xie N.; Shi M.; Ye Y.; Wang L. Moth-flame optimization algorithm based on diversity and mutation strategy. Appl. Intell. 2021, 1–37. [Google Scholar]
Yu Y.; Gao S.; Cheng S.; Wang Y.; Song S.; Yuan F. CBSO: a memetic brain storm optimization with chaotic local search. Memet. Comput. 2018, 10, 353–367. 10.1007/s12293-017-0247-0. [DOI] [Google Scholar]
Specht D. F. Probabilistic neural networks. Neural Networks 1990, 3, 109–118. 10.1016/0893-6080(90)90049-Q. [DOI] [PubMed] [Google Scholar]
Yang X.; Chen W.; Li A.; Yang C.; Xie Z.; Dong H. BA-PNN-based methods for power transformer fault diagnosis. Adv. Eng. Inform. 2019, 39, 178–185. 10.1016/j.aei.2019.01.001. [DOI] [Google Scholar]
Yi J.-H.; Wang J.; Wang G.-G. Improved probabilistic neural networks with self-adaptive strategies for transformer fault diagnosis problem. Adv. Mech. Eng. 2016, 8, 1–13. 10.1177/1687814015624832. [DOI] [Google Scholar]
Li S.; Wu G.; Gao B.; Hao C.; Xin D.; Yin X. Interpretation of DGA for transformer fault diagnosis with complementary SaE-ELM and arctangent transform. IEEE Trans. Dielectr. Electr. Insul. 2016, 23, 586–595. 10.1109/TDEI.2015.005410. [DOI] [Google Scholar]
Dong H.; Yang X.; Li A.; Xie Z.; Zuo Y. Bio-Inspired PHM Model for Diagnostics of Faults in Power Transformers Using Dissolved Gas-in-Oil Data. Sensors 2019, 19, 845 10.3390/s19040845. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li J.; Zhang Q.; Wang K.; Wang J.; Zhou T.; Zhang Y. Optimal dissolved gas ratios selected by genetic algorithm for power transformer fault diagnosis based on support vector machine. IEEE Trans. Dielectr. Electr. Insul. 2016, 23, 1198–1206. 10.1109/TDEI.2015.005277. [DOI] [Google Scholar]
Li A.; Yang X.; Dong H.; Xie Z.; Yang C. Machine Learning-Based Sensor Data Modeling Methods for Power Transformer PHM. Sensors 2018, 18, 4430 10.3390/s18124430. [DOI] [PMC free article] [PubMed] [Google Scholar]
Liu Z.; Song B.; Li E.; Mao Y.; Wang G. Study of ”code absence” in the IEC three-ratio method of dissolved gas analysis. IEEE Electr. Insul. Mag. 2015, 31, 6–12. 10.1109/MEI.2015.7303257. [DOI] [Google Scholar]

[ref1] Yu S.; Zhao D.; Chen W.; Hou H.. Oil-immersed Power Transformer Internal Fault Diagnosis Research Based on Probabilistic Neural Network. ANT/SEIT, 2016.

[ref2] Chen W.; Pan C.; Yun Y.; Liu Y. Wavelet networks in power transformers diagnosis using dissolved gas analysis. IEEE Trans. Power Delivery 2009, 24, 187–194. 10.1109/TPWRD.2008.2002974. [DOI] [Google Scholar]

[ref3] Kim J.-W.; Park B.; Jeong S. C.; Kim S. W.; Park P. Fault diagnosis of a power transformer using an improved frequency-response analysis. IEEE Trans. Power Delivery 2005, 20, 169–178. 10.1109/TPWRD.2004.835428. [DOI] [Google Scholar]

[ref4] Wang X.; Wang Z.; Liu Q.; Dyer P. Dissolved gas analysis of thermal faults in transformer liquids simulated using immersed heating method. IEEE Trans. Dielectr. Electr. Insul. 2018, 25, 1749–1757. 10.1109/TDEI.2018.007158. [DOI] [Google Scholar]

[ref5] Kari T.; Gao W.; Zhao D.; Zhang Z.; Mo W.; Wang Y.; Luan L. An integrated method of ANFIS and Dempster-Shafer theory for fault diagnosis of power transformer. IEEE Trans. Dielectr. Electr. Insul. 2018, 25, 360–371. 10.1109/TDEI.2018.006746. [DOI] [Google Scholar]

[ref6] Cong H.; Du S.; Li Q.; Liu J. Electro-thermal fault diagnosis method of rapo vegetable oil transformer based on characteristic gas and ratio criterion. IEEE Access 2019, 7, 101147–101159. 10.1109/ACCESS.2019.2928817. [DOI] [Google Scholar]

[ref7] Kirkbas A.; Demircali A.; Koroglu S.; Kizilkaya A. Fault diagnosis of oil-immersed power transformers using common vector approach. Electr. Power Syst. Res. 2020, 184, 106346 10.1016/j.epsr.2020.106346. [DOI] [Google Scholar]

[ref8] Amedi H.; Baghban A.; Ahmadi M. Evolving machine learning models to predict hydrogen sulfide solubility in the presence of various ionic liquids. J. Mol. Liq. 2016, 216, 411–422. 10.1016/j.molliq.2016.01.060. [DOI] [Google Scholar]

[ref9] Ahmadi M. A.; Ahmadi A. Applying a sophisticated approach to predict CO2 solubility in brines: application to CO2 sequestration. Int. J. Low-Carbon Technol. 2016, 11, 325–332. 10.1093/ijlct/ctu034. [DOI] [Google Scholar]

[ref10] Ahmadi M.; Ebadi M.; Yazdanpanah A. Robust intelligent tool for estimating dew point pressure in retrograded condensate gas reservoirs: Application of particle swarm optimization. J. Pet. Sci. Eng. 2014, 123, 7–19. 10.1016/j.petrol.2014.05.023. [DOI] [Google Scholar]

[ref11] Su Q.; Mi C.; Lai L.; Austin P. A fuzzy dissolved gas analysis method for the diagnosis of multiple incipient faults in a transformer. IEEE Trans. Power Syst. 2000, 15, 593–598. 10.1109/59.867146. [DOI] [Google Scholar]

[ref12] Dhini A.; Surjandari I.; Faqih A.; Kusumoputro B. In Intelligent Fault Diagnosis for Power Transformer Based on DGA Data Using Support Vector Machine (SVM), 2018 3rd International Conference on System Reliability and Safety (ICSRS), 2018; pp 294–298.

[ref13] Jiang J.; Chen R.; Chen M.; Wang W.; Zhang C. Dynamic fault prediction of power transformers based on hidden Markov model of dissolved gases analysis. IEEE Trans. Power Delivery 2019, 34, 1393–1400. 10.1109/TPWRD.2019.2900543. [DOI] [Google Scholar]

[ref14] Yu H.; Wu Q.; Lu Y.; Hu C.; Wang Y.; Liu G. In Research on Fault Diagnosis of Power Transformer Equipment Based on KNN Algorithm, International Conference on Mechatronics and Intelligent Robotics, 2017; pp 172–176.

[ref15] Zhang Y.; Ding X.; Liu Y.; Griffin P. An artificial neural network approach to transformer fault diagnosis. IEEE Trans. Power Delivery 1996, 11, 1836–1841. 10.1109/61.544265. [DOI] [Google Scholar]

[ref16] Zhang W.; Yang X.; Deng Y.; Li A. An Inspired Machine-Learning Algorithm with a Hybrid Whale Optimization for Power Transformer PHM. Energies 2020, 13, 3143 10.3390/en13123143. [DOI] [Google Scholar]

[ref17] Huang Y. Evolving neural nets for fault diagnosis of power transformers. IEEE Trans. Power Delivery 2003, 18, 843–848. 10.1109/TPWRD.2003.813605. [DOI] [Google Scholar]

[ref18] Meng K.; Dong Z. Y.; Wang D. H.; Wong K. P. A self-adaptive RBF neural network classifier for transformer fault analysis. IEEE Trans. Power Syst. 2010, 25, 1350–1360. 10.1109/TPWRS.2010.2040491. [DOI] [Google Scholar]

[ref19] Dai J.; Song H.; Sheng G.; Jiang X. Dissolved gas analysis of insulating oil for power transformer fault diagnosis with deep belief network. IEEE Trans. Dielectr. Electr. Insul. 2017, 24, 2828–2835. 10.1109/TDEI.2017.006727. [DOI] [Google Scholar]

[ref20] Huang X.; Wang X.; Tian Y. In Research on Transformer Fault Diagnosis Method based on GWO Optimized Hybrid Kernel Extreme Learning Machine, 2018 Condition Monitoring and Diagnosis (CMD), 2018; pp 1–5.

[ref21] Song T.; Jamshidi M. M.; Lee R. R.; Huang M. A modified probabilistic neural network for partial volume segmentation in brain MR image. IEEE Trans. Neural Netw. 2007, 18, 1424–1432. 10.1109/TNN.2007.891635. [DOI] [PubMed] [Google Scholar]

[ref22] Kusy M.; Zajdel R. Probabilistic neural network training procedure based on Q (0)-learning algorithm in medical data classification. Appl. Intell. 2014, 41, 837–854. 10.1007/s10489-014-0562-9. [DOI] [Google Scholar]

[ref23] Mirjalili S. Moth-flame optimization algorithm: A novel nature-inspired heuristic paradigm. Knowledge-Based Syst. 2015, 89, 228–249. 10.1016/j.knosys.2015.07.006. [DOI] [Google Scholar]

[ref24] Ma L.; Wang C.; gang Xie N.; Shi M.; Ye Y.; Wang L. Moth-flame optimization algorithm based on diversity and mutation strategy. Appl. Intell. 2021, 1–37. [Google Scholar]

[ref25] Yu Y.; Gao S.; Cheng S.; Wang Y.; Song S.; Yuan F. CBSO: a memetic brain storm optimization with chaotic local search. Memet. Comput. 2018, 10, 353–367. 10.1007/s12293-017-0247-0. [DOI] [Google Scholar]

[ref26] Specht D. F. Probabilistic neural networks. Neural Networks 1990, 3, 109–118. 10.1016/0893-6080(90)90049-Q. [DOI] [PubMed] [Google Scholar]

[ref27] Yang X.; Chen W.; Li A.; Yang C.; Xie Z.; Dong H. BA-PNN-based methods for power transformer fault diagnosis. Adv. Eng. Inform. 2019, 39, 178–185. 10.1016/j.aei.2019.01.001. [DOI] [Google Scholar]

[ref28] Yi J.-H.; Wang J.; Wang G.-G. Improved probabilistic neural networks with self-adaptive strategies for transformer fault diagnosis problem. Adv. Mech. Eng. 2016, 8, 1–13. 10.1177/1687814015624832. [DOI] [Google Scholar]

[ref29] Li S.; Wu G.; Gao B.; Hao C.; Xin D.; Yin X. Interpretation of DGA for transformer fault diagnosis with complementary SaE-ELM and arctangent transform. IEEE Trans. Dielectr. Electr. Insul. 2016, 23, 586–595. 10.1109/TDEI.2015.005410. [DOI] [Google Scholar]

[ref30] Dong H.; Yang X.; Li A.; Xie Z.; Zuo Y. Bio-Inspired PHM Model for Diagnostics of Faults in Power Transformers Using Dissolved Gas-in-Oil Data. Sensors 2019, 19, 845 10.3390/s19040845. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref31] Li J.; Zhang Q.; Wang K.; Wang J.; Zhou T.; Zhang Y. Optimal dissolved gas ratios selected by genetic algorithm for power transformer fault diagnosis based on support vector machine. IEEE Trans. Dielectr. Electr. Insul. 2016, 23, 1198–1206. 10.1109/TDEI.2015.005277. [DOI] [Google Scholar]

[ref32] Li A.; Yang X.; Dong H.; Xie Z.; Yang C. Machine Learning-Based Sensor Data Modeling Methods for Power Transformer PHM. Sensors 2018, 18, 4430 10.3390/s18124430. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref33] Liu Z.; Song B.; Li E.; Mao Y.; Wang G. Study of ”code absence” in the IEC three-ratio method of dissolved gas analysis. IEEE Electr. Insul. Mag. 2015, 31, 6–12. 10.1109/MEI.2015.7303257. [DOI] [Google Scholar]

PERMALINK

Novel Probabilistic Neural Network Models Combined with Dissolved Gas Analysis for Fault Diagnosis of Oil-Immersed Power Transformers

Yichen Zhou

Lingyu Tao

Xiaohui Yang

Li Yang

Abstract

Introduction

Proposed Method

IEC Ratio Methods

Modified Moth Flame Optimization Algorithm

PNN Optimized by MMFO

Figure 1.

Figure 2.

Implementation and Experimental Setup

Model Implementation

Figure 3.

Table 1. Some Real Data from China Electric Power Research Institute Diagnosing Power Transformer Fault Types by the DGA Methoda.

Table 2. Coding Format for Different Fault Types.

Data Collection and Preprocessing

Table 3. Partial Sample Data.

Experimental Setting

Table 4. Parameters Settings of Different Methods.

Experimental Results and Discussion

Table 5. Comparison of Different Methods Based on PNN.

Figure 4.

Table 6. Fault Diagnosis Based on the IEC Three-Ratio Method.

Table 7. Comparison Results of Different Methods.

Figure 5.

Table 8. Efficiency and Error Rate of Different Methods.

Table 9. Mean Square Error of Different Methods.

Table 10. Comparison of Different Machine Learning Methods with the F1-Score.

Table 11. Results of Fivefold Cross Validation of Different Machine Learning Models (%).

Figure 6.

Figure 7.

Figure 8.

Figure 9.

Figure 10.

Conclusions

Acknowledgments

Glossary

Abbreviations

Author Contributions

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Table 1. Some Real Data from China Electric Power Research Institute Diagnosing Power Transformer Fault Types by the DGA Method^a.