Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2023 Jun 6:1–13. Online ahead of print. doi: 10.1007/s00170-023-11602-y

Manufacturing industry based on dynamic soft sensors in integrated with feature representation and classification using fuzzy logic and deep learning architecture

Shakir Khan 1,2,, Tamanna Siddiqui 3, Azrour Mourade 4, Bayan Ibrahimm Alabduallah 5,, Saad Abdullah Alajlan 1, Abrar almjally 1, Bader M Albahlal 1, Amani Alfaifi 1
PMCID: PMC10243703  PMID: 37360660

Abstract

Soft sensors are data-driven devices that allow for estimates of quantities that are either impossible to measure or prohibitively expensive to do so. DL (deep learning) is a relatively new feature representation method for data with complex structures that has a lot of promise for soft sensing of industrial processes. One of the most important aspects of building accurate soft sensors is feature representation. This research proposed novel technique in automation of manufacturing industry where dynamic soft sensors are used in feature representation and classification of the data. Here the input will be data collected from virtual sensors and their automation-based historical data. This data has been pre-processed to recognize the missing value and usual problems like hardware failures, communication errors, incorrect readings, and process working conditions. After this process, feature representation has been done using fuzzy logic-based stacked data-driven auto-encoder (FL_SDDAE). Using the fuzzy rules, the features of input data have been identified with general automation problems. Then, for this represented features, classification process has been carried out using least square error backpropagation neural network (LSEBPNN) in which the mean square error while classification will be minimized with loss function of the data. The experimental results have been carried out for various datasets in automation of manufacturing industry in terms of computational time of 34%, QoS of 64%, RMSE of 41%, MAE of 35%, prediction performance of 94%, and measurement accuracy of 85% by proposed technique.

Keywords: Soft sensors, Deep learning, Automation, FL_SDDAE, Classification, LSEBPNN

Introduction

Soft sensor is a virtual inferential prediction method that uses easily measured variables to forecast process variables that are difficult to measure directly due to technological, economic constraints as well as a complex environment. Soft sensor attempts to construct a regression prediction method between easily measured variables as well as difficultly measured variables, which is used to address issue that hinders measurements from being used as feedback signals in quality control methods [1]. For at least 10 years, there has been a growing trend in use of data-driven AI (artificial intelligence) approaches to enhance machines, processes, and products across several industrial domains [2]. In recent years, reducing emissions as a result of stronger environmental restrictions has also been a major motivator [3]. However, gathering the data required for such approaches is fraught with difficulties, one of which is the long life of industrial gear. Official depreciation estimates range from (rarely) 6 to more than 30 years, depending on the country, type of machinery, and industrial sector [4]. Experience suggests that, particularly in small and medium-sized businesses, resilient equipment can last even longer in regular usage. Soft sensor approaches are used more widely in industrial processes, and they have become a key emerging trend in both academics as well as industry [5]. Early academics proposed model predictive control like generalized predictive control, dynamic matrix predictive control and model control method, in light of model prediction in industrial production process [6]. However, these soft sensor prediction approaches have several flaws. ANN (Artificial neural networks), rough set, SVM (support vector machine) and hybrid techniques are some AI and ML methods based on data-driven technologies that have been proposed to solve issues where it is difficult to measure key processes as well as quality variables for soft sensor methods as a result of DL in soft sensor control method as well as continuous progress in engineering technology [7].

The contribution of this research is as follows:

  • To design novel techniques in automation of manufacturing industry where the dynamic soft sensors are used in feature representation and classification of the data

  • To collect the data cloud storage and create the virtual sensors dataset based on gear fault detection, spindle fault detection, and bearing fault detection in automation industry

  • To represent the feature using fuzzy logic-based stacked data driven auto-encoder (FL_SDDAE) where the features of input data have been identified with general automation problems.

  • Then, the features have been classified using least square error backpropagation neural network (LSEBPNN) in which the mean square error while classification will be minimized with loss function of the data

  • Here the experimental results have been carried out in terms of QoS, measurement accuracy, RMSE, MAE, prediction performance, and computational time.

Research organization is as follows. In Section 2, related works are described. Section 3 gives details of proposed method. proposed method performance, and the results are present in Section 4. Finally, Section 5 concludes the work.

Related works

DL-based techniques are recently exhibited solid representation competency and success in a variety of computer science domains, including image processing, computer vision, NLP, and more [8]. Stack autoencoder (SAE) [9], DBN (deep belief network) [10], CNN [11], and LSTM [12] are some of widely utilized deep network architectures. Greedy layer-wise unsupervised pre-training, as well as supervised fine-tuning, are highly important for DL architectures like SAE. The SAE weights evaluated during unsupervised pre-training step are used in supervised fine-tuning stage, which is a more significant method than random weight initialization [13]. As a result, various industrial applications of soft sensors based on SAE [14] are presented. Same authors improved this result significantly by utilizing a TDNN in [15].Mean error dropped to just 1.14 to 1.32% and 1.65° to 3.08°, in the same conditions utilized in [16, 17]. As a result, the type of network used in these two papers had a significant impact on the algorithms’ performance. In [18], an RNN is presented that collects information regarding air–fuel ratio λ, ignition angle, and turbocharger boost pressure in addition to rotational speed signal. Focus was on neural network design, which had a significant impact on the algorithm’s performance. To estimate cylinder pressure curves, [19] uses a NN with RBF (radial basis functions) as well as consequently no recurrence. Authors of [20] presents a novel convolutional, BiGRU, and Capsule network-based deep learning model, HCovBi-Caps, to classify the hate speech and authors of [21] introduce BiCHAT: a novel BiLSTM with deep CNN and hierarchical attention-based deep learning model for tweet representation learning toward hate speech detection. Authors of [15] do not use raw rotational speed signal, but instead translate it into frequency domain as well as process only first 20 harmonics, to earlier research are used an RBF network. They also employ structure-borne sound signal’s 21st–50th harmonics. As a result, the preparation of the given data is the most important aspect of this project. The typical errors for pMax as well as its position in crank angle range are 3.4% and 1.5°, respectively. Using a multi-layer perceptron, [22] predicts combustion parameters directly from crankshaft’s rotational speed as well as acceleration data, in contrast to the previous studies (MLP). The mean error lies between 1.38° and 9.1°, with a range of 4.1 to 8.0%. A deep learning-based R2DCNNMC model is proposed for detection and classification of COVID-19 employed chest X-ray images data [23]. Privacy of data driven uses on the k-anonymity and l-diversity supervised models classifies the healthcare data [24]. Virtualization for dynamics on cloud for network operation and management is discussed in [25] and proposed hybrid model on cloud ensures the maximum benefits from virtualization.

The effective implementations of SAE-based DL listed above reveal a significant capacity to extract features. Deep structures exceed typical soft-sensing prediction performance thanks to unsupervised layer-wise pre-training as well as supervised fine-tuning processes. Proposed industrial soft sensors are static methods based on notion of a static process as well as steady-state. However, the inherently dynamic nature of industrial processes cannot be neglected. Chemical processes, for example, are highly dynamic, with current state being linked to earlier ones. As a result, time-related characteristics of time-series recorded data are important.

System model

This section discusses the proposed design in automation of manufacturing industry based on dynamic soft sensors. Here the data has been processed to recognize the missing value and usual problems like hardware failures, incorrect readings, communication errors, process working conditions. Then their features have been represented in module 1 and the represented feature has been classified in module 2 using deep learning techniques. The overall research architecture is given in Fig. 1.

Fig. 1.

Fig. 1

Overall Proposed diagram for virtual sensor-based fault detection in automation industry

Feature representation using fuzzy logic based stacked data driven auto-encoder (FL_SDDAE)

ENotice is replaced Eq. (2) represents overall input–output transfer function about general autoencoder (AE) structure. The input xαRd is supplied to hidden layer, whose output is utilized to reconstruct xα input through output layer (y) as shown in Eq. (1).

xα=yW,bhW,bxαxα 1

An encoder or recognition model is another name for this approach. Optimize variational parameters φ such that as shown in Eq. (2):

qϕzxpθzx 2

As stated in Eq. (3), inference models are any directed graphical model.

qϕzxz1,,zMx=j=1MqϕzjPazj,x 3

In the directed graph, Pazj is set of parent variables of variable zj.

logpθx=Eqϕzxlogpθx=Eqϕzxlogpθx,zpθzx=Eqϕzxlogpθx,zqϕzxqϕzxpθzx=Eqϕzxlogpθx,zqϕzx=Lθ,ϕx+Eqϕzxlogqϕzxpθzx=DKLqϕzxpθzx 4

The non-negative Kullback–Leibler (KL) divergence between qϕzx and pθzx is the second term in Eq. (5):

DKLqϕzxpθzx0 5

The variational lower bound, commonly known as ELBO, is the first term in Eq. (6):

Lθ,ϕ(x)=Eqϕzxlogpθ(x,z)-logqϕzx 6

Because the KL divergence is non-negative, ELBO shows a lower bound on data’s log-likelihood, as demonstrated in Eq. (7).

Lθ,ϕx=logpθx-DKLqϕzxpθzxlogpθx 7
θLθ,ϕx=θEqϕzxlogpθx,z-logqϕzx=Eqϕzxθlogpθx,z-logqϕzxθlogpθx,z-logqϕzx=θlogpθx,z 8

Because the ELBO’s expectation is taken qϕzx, which is a function of φby Eq. (9):

ϕLθ,ϕx=ϕEqϕzxlogpθx,z-logqϕzxEqϕzxϕlogpθx,z-logqϕzx 9

Apply a reparameterization approach to compute unbiased estimates of ϕLθ,ϕx, in case of continuous latent variables.

Replace an expectation w.r.t. qϕzx with one w.r.t. pθ via reparameterization given by Eq. (10).

Lθ,ϕlogx=Eqϕzxlogpθx,z-logqϕzx=Epϵlogpθx,z-logqϕzx 10
ϵpϵz=gϕ,x,ϵLθ,ϕx=logpθx,z-logqϕzxEpϵθ,ϕLθ,ϕx;ϵ=Epϵθ,ϕlogpθx,z-logqϕzx=θ,ϕEpϵlogpθx,z-logqϕzx=θ,ϕLθ,ϕx 11

A simple factorized vGaussian encoder by Eq. (12)

qϕzx=Nz;μ,diagσ2:μ,logσ=EncoderNeuralNetϕxqϕzx=iqϕzx=iNzi;μi,σi2
z=μ+σϵ 12

The log determinant of the Jacobian is given by Eq. (13):

logdϕx,ϵ=logdetzϵ=ilogσi 13

and the posterior density is given by Eq. (14):

logqϕzx=logpϵ-logdϕx,ϵ=ilogNϵi;0,1-logσiwhenz=gϵ,ϕ,x 14

From Eq. (15)

Σ=Ez-Ezz-EzT=ELϵLϵT=LEϵϵTLT=LLT 15

Let Gx be defined: X ⊂ Rn → R, that is, a function on compact set X = α1,1 × … × [αnn] and analytic formula of Gx be unknown.

Define Njj=1,2,,n fuzzy sets Aj1,Aj2,,AjNjαj,βj, which are normal, consistent, and complete with triangular MFs μAj1xj;aj1,bj1,cj1,,μAjNjxj;ajNj,bjNj,cjNj, and Aj1<Aj2<<AjNj with aj1=bj1=αj and bjNj=cjNj=βj, which,

  • e11=α1,e1N1=β1, and e1j=b1j for j=2,3,,N1-1, -e21=α2,e2N2=β2, and e1j=b2j for j=2,3,,N2-1,:en1=αn,enNn=βn, and e1j=b1 for j=2,3,,Nn-1.

  • Construct I=N1×N2××Nn fuzzy if–then rules in following form:

  • RXj1-jn:IFx1 is A1j1 and x2 is A2j2 and … and xn is Anjn Then y is Bj1-jn, where j1=1,2,,N1,j2=1,2,,N2,,jn=1,2,,Nn, and center of the fuzzy set Bj1/n, denoted by y1/n, is chosen as Eq. (16):

  • yj1/n=Ge1j1,,enjn
    ϑl=τμA11-j-in,ix1,μA21-jn,ix2,,μAnj1-jn,ixn 16

Therefore, from μB4¯y=tϑi,μBiy,yR, fuzzy inference produces fuzzy set of output by: μB/1-jn,A-y=tϑi,μBj1-jn,iyyR.tϑi,μBj1-jn,iyyR. μB/1-/n(y)=sμB/1-1n,1(y),μB/1-jn,2y,,μB/1-jn,(y). Inline graphic , where aji are parameters, and are evaluated by LSM.

μQIMx,y=minμA1x,μA2y,QIMX×YμBy=maxisupxXminμAx,μA1x1,,μAntxn,μBiyμAx=1ifx=x0otherwicey=i=1lyiwli=1Iwl 17

Since the fuzzy sets Aj1,,AjNj are complete at every xX, then there exist j1,j2,,jn such that: minμA1x1,μA22x2,,μAnnxn0. Let fx be fuzzy system in (13) and Gx be unknown function in (18). If Gx is continuously differentiable on X=α1,β1×α2,β2××αn,βn, then:

G-fGx1h1+Gx2h2++Gxnhn. 18

where infinite norm . is given as: dx=supxXdx and hj=max1kNjejk+1-ejk,(j=1,2,,nLetXj1/n_=e1j1,e1j1+1×e2j2,e2j2+1××enjn,enjn+1, where j1=1,2,,N1-1,j2=1,2,,N2-1,, jn=1,2,,Nn-1. Since αj,βj=ej1,ej2ej2,ej3ejNj-1,ejNj,j=1,2,,n. From Eq. (19):

fx=k1=j1j1+1kn=jnjn+1y¯k1.knmμA1k1x1,μA2k2x2,,μAnknxnk1=j1j1+1kn=jnjn+1mμA1k1x1,μA2k2x2,,μAnknxn 19

From (20), (21), (22), we obtain:

fx=k1=j1j1+1kn=jnjn+1mμA1k1x1,,μAnknxnk1=j1j1+1kn=jnjn+1mμA1k1x1,,μAnknxnGe1k1,,enkn 20
k1=j1j1+1kn=jnjn+1mμA1k1x1,,μAnknxnk1=j1j1+1kn=jnjn+1mμA1k1x1,,μAnknxn=1 21
G(x)-fxk1=j1j1+1kn=jnjn+1mμA1k1x1,,μAnknxnk1=j1j1+1kn=jnjn+1mμA1k1x1,,μAnknxnmaxk1=j1+1Gx-Ge1k1,,enknGx-Ge1k1,,enkn 22

From the Mean Value kn=fn:/n+1

From the Mean Value model is given (23) as:

Gx-fxmaxk1=11j1+1Gx1x1-e1k1+Gx2x2-e2k2++Gxnxn-enkn 23

Since xXj1-jn, means that x1e1j1,e1j1+1,x2e2j2,e2j2+1xnenjn,enjn+1, have by Eq. (24),

x1-e1k1e1j1+1-e1j1,x2-e2k2e2j2+1-e2j2,andxn-enknenjn+1-enjnfork1=j1,j1+1,k2=j2,j2+1,,andkn=jn,jn+1 24

Then, (25) becomes:

Gx-fxGx1e1j1+1-e1j1+Gx2e2j2+1-e2j2++Gxnenjn+1-enjnSincedx=supxXdxthenG-f=supxXG-f,weget:G-fGx1max11N1-1e1j1+1-e1j1++Gxn1maxnIn-1enjn+1-enjnG-fGx1h1+Gx2h2++Gxnhn 25

From (26), conclude that fuzzy systems in form.

Gx1,Gx2,,Gxn are finite numbers for any given ε>0, select h1,h2,,hn small enough such that Gx1h1+Gx2h2++Gxnhn<ε. Hence from (27):

supxXG-f=G-f<ε 27

We can see from (28) that we need to know the boundaries of the derivatives of G(x) about x1,x2,,xn to represent a fuzzy system with a pre-specified accuracy.

Gx1,Gx2,,Gxn 28

Select a fuzzy method with a MIS, an SF, a CADand a Triangular MF, which then derive using Eq. (29).

fx=i=1lyiminjμAjxji=1lminjμAjxj=i=1lyiminjmaxminvjxj-ajibjl-aj,cji-xjcjl-bjl,0i=1lminvjmaxminjxj-ajlbjl-aji,cji-xjcjl-bjl,0 29

The more rules you have, the more parameters you will have and the more computation you will have to do, but you will get better accuracy. When initial parameters yi (0), aji (0), bji (0), cji (0) are specified, the fuzzy system becomes by Eq. (30).

fx=j1=1N1jnNnyj1-jn0mmminkxk0p-akj1-jn0bkj1j2j30-akj1j2j30ckj1-jn0-xk0pckj1-jn0-bkj1-jn0,0j1=1N1jnNnmmminkxk0p-akj1-jn0bkj1,jn0-akj1-jn0ckj1-jn0-xk0pckj1-jn0-bkj1,jn0,0 30

for a sigmoid activation function, it gives by Eq. (31):

hlγt=11+exp-u=1dwl,uxφ(u)t+blγtwl,u=f=1Nfp=1Pq=1QKp,qfwl,u[γ]YutYi,jt=f=1Nfp=1Pq=1QKp,qfxφ(u)thlγt=σu=1dwl,uγYut+blγt,l1,,sykT=ΨkThγ1,,hγT,k{1,,r}hlρ=σk=1rwl,kρykT+blρ,l1,,r 31

Thus, if consider X=0,,0,blγt=0t1,,T, the Taylor series expansion of hlγt is given by Eq. (32):

hlγthlγtX+hlγtXt=12+u=1dhlγtXxφutxφut=12+u=1dwl,uxφut 32

with w′ l u, given by Eq. (33), and Xt=xφ1t,,xφd,t being a column vector of the input at time t. Let H=hγ(1),,hγT,forX=X then:

H=HX=12s,,12sT 33

where s is number of hidden neurons. Taylor series expansion of Ψk T isgiven by Eq. (34):

ykT=ΨkTHΨkTHH-H=ΨkTH+t=1Tu=1sΨKTHhuγkhuγt-12 34

By replacing huγt of Eq. (35) with hlγt of Eq. (36):

ΨkTHΨkTH+14t=1Tu=1sν=1dΨkTHhuγ(t)wl,νxφvt 35

Finally, by substituting

hlρ=σk=1rwl,keΨkTH+14t=1Tu=1sν=1dΨkTHhuγtwl,νxφvt+blρhlϱ=σk=1rwl,kρΨkTH+14k=1r(ν=1du=1sΨKTHhuY1wl,νwl,k[ρ]xφν1wl,ν1+,··+ν=1du=1sΨΨHhuTwl,νwl,kρxφvTwl,νT+blρ 36
wl,νt=u=1sf=1Nfp=1Pq=1QΨkTXhuγtKp,qfwl,νγwl,kϱ 37

derived features wl,νt through summations on indexes f, p, and q combine features wl,νγ and wl,kρ extracted from both Fuzzy based SAEs and gives compact representation of input over time.

Least square error back propagation neural network (LSEBPNN)

Let, training set in a C-class issue contains vector pairs x1,y1,x2,y2,,xP,yP where xpRN refers to pth input pattern and yptc,c=1,2,,C;tcRc refers to target output of c network corresponding to this input.

All weights and bias terms are included in LSEBPNN’s adaptive parameters. The training phase’s main aim is to establish the best weights and bias terms for minimizing difference between network output as well as target output. The difference is referred regarded as the network’s training error. MSE for pth input pattern in the traditional BP technique is Ep=12k=1Ctpk-opk2. It shows that an input pattern’s target value could be several. To put it another way, any input pattern can have any target value with any membership value. To put it another way, the training problem can be thought of as a fuzzy constraint fulfillment problem.Suggested network modifies parameters throughout training phase to ensure that these limitations are overcome as efficiently as possible. The constraints for pth input pattern are stated mathematically as fuzzy MSE term, which is given by Eq. (38)

.f=12k=1Cc=1Cμcqxptck-opko2 38

The learning laws for networks are derived using same approach as traditional BP technique. Suppose that the weight update, Dw, happens after each input pattern has been presented. Assuming that all weight changes in network are made with same learning-rate parameter h, weight changes applied to weights w and w are k j ji determined according to the gradient-descent rules by Eq. (39), (40):

Δwkjo=-ηEpfwkjoandΔwjih=-ηEpfwjih 39
Δwkjo=ημkqxp-c=1Cμcqxpopko×opko1-opkoopjh=ηδpkoopjh 40

where by Eq. (41)

δpko=μkqxp-c=1Cμcqxpopkoopko1-opko 41

Again, from Eq. (42),

Δwjih=ηfjhnetpjhxpik=1Cμkq(xp-c=1Cμcq(xpopkoopko1-opkowkjo=ηfjhnetpjhxpik=1Cδpkowkjo=ηδpjhxpi, 42

where by Eq. (43)

δpjh=fjhnetpjhk=1Cδpkowkjo. 43

In many circumstances, the traditional BP technique may not converge quickly, when classes overlap. Because ambiguous vectors are assigned full weightage in one class, this is case. In suggested version, error to be back propagated is given more weight in the case of nodes with higher membership values.

The learning algorithm’s purpose is to reduce the squared error cost function, which is given by Eq. (44)

jis=12q=1mdi,qs-vi(s)2 44

Equation (45), where m is total number of vectors in training data set given by

jis=12q=1mdi,qs-wist.xout.q(s-1)2 45

Partial derivative about w i (s) and equate it to zero to determine weight vector that minimizes cost function given by Eq. (46).

jiswis=q=1m-di,qsxout.qs-1+xout.qs-1xout.qs-1tis=0 46
cis=q=1mxout.q(s-1)xout.q(s-1)tpi(s)=di,q(s)xout.q(s-1) 47

In vector matrix form, Eq. (48) are rearranged as

ciswis=pis 48

wis is weight vector to ith linear combiner in sth layer, Eq. (49) is given as deterministic normal equation

wis=cs-1pis 49

By equating partial derivative of performance index wik(n) and setting it equal to zero, the performance index is minimised (50)

Jnwikn=&2t=1nλn-t×j=1NLεj,RLtεj,RLtwi(k)n+εj,ILtεj,ILtwi(k)n=&-2t=1nλn-t×j=1NLζj,RLtεj,RLtyj,RLtwikn+ζj,ILtεj,IL(t)yj,ILtwikn=0 50

Equation (51), (52) is set to the following

t=1nλn-tψi,Rkt-yi,Rktζi,Rktfsi,Rkt+&ψi,Ikt-yi,Iktζi,Iktfsi,Ikt×xk(t)=0. 51
rik(n)=Rik(n)wik(n) 52

where by Eq. (53)

rikn=t=1nλn-t×ζi,Rktψi,Rktfsi,Rk(n)+ȷζi,Iktψi,Iktfsi,I(k)(n)×xktRikn=t=1nλn-txkt×ζi,Rk(t)yi,Rktfsi,Rkt+ȷζi,Ik(t)yi,Ik(t)fsi,Ikt×sik-1(t)xkTt. 53

Now, define a matrix operation for simplicity ABARBR+ȷAIBI. The flow chart for LSEBPNN is represented in Fig. 2.

Fig. 2.

Fig. 2

The flow chart for LSEBPNN

Performance analysis

Proposed method is implemented into a prototype software system utilizing Python 3.7 to evaluate and assess potential contribution of proposed strategy for future real-world applications. Resources utilized to combine proposed method were an Intel i7 processor (Intel(R) Core(TM) i7-3770 CPU @3.40 GHz 3.80 Ghz) and an eight (8) gigabyte RAM (Intel, Santa Clara, CA, USA) (Samsung, Seoul, Korea). Microsoft Windows 10 was the operating system on which the suggested system was hosted and tested.

Table 1 shows comparative analysis for various fault situations for proposed and existing techniques. Here the fault situation has been detected by virtual sensor-based datasets of automation industry. The parametric analysis has been carried out in terms of QoS, measurement accuracy, RMSE, MAE, prediction performance, and computational time.\

Table 1.

comparative analysis for various fault situation for proposed and existing technique

Virtual sensor-based datasets of automation industry Techniques Computational rate QoS RMSE MAE Prediction performance Measurement accuracy
Spindle-based dataset CNN 41 59 47 43 91 76
RBF 36 61 45 40 93 79
FL_SDDAE-LSEBPNN 34 64 41 35 94 85
Gear-based dataset CNN 50 62 51 51 73 79
RBF 46 63 48 45 76 81
FL_SDDAE-LSEBPNN 43 67 43 39 79 85
Bearing-based dataset CNN 59 63 53 49 79 73
RBF 53 65 49 45 83 77
FL_SDDAE-LSEBPNN 49 68 45 41 86 84

Figures 3, 4, and 5 show comparative analysis for various virtual sensor-based datasets from automation industry. The dataset collected from cloud is based on spindle fault detection-based data, gear fault detection-based data, and bearing fault detection-based data. For spindle fault detection data, the proposed technique obtained computational time of 34%, QoS of 64%, RMSE of 41%, MAE of 35%, prediction performance of 94%, measurement accuracy of 85%.The proposed technique obtained computational time of 43%, QoS of 67%, RMSE of 43%, MAE of 39%, prediction performance of 79%, and measurement accuracy of 85% by gear-based fault detection dataset. For bearing fault detection data, the proposed technique obtained computational time of 49%, QoS of 68%, RMSE of 45%, MAE of 41%, prediction performance of 86%, and measurement accuracy of 84%. From the above analysis, proposed technique obtained optimal results for all the fault detection based on automation industry data.

Fig. 3.

Fig. 3

Comparative analysis of spindle-based dataset in terms of a computational time, b QoS, c RMSE, d MAE, e prediction performance, f measurement accuracy

Fig. 4.

Fig. 4

Comparative analysis of gear-based dataset in terms of a computational time, b QoS, c RMSE, d MAE, e prediction performance, f measurement accuracy

Fig. 5.

Fig. 5

Comparative analysis of bearing-based dataset in terms of a computational time, b QoS, c RMSE, d MAE, e prediction performance, f measurement accuracy

The fundamental challenge in dealing with soft sensor principles is a lack of understanding due to their novelty and, as a result, a lack of typical mathematical descriptions or structure. On the other hand, it allows for more creative expression. In general, vast arrays of statistics for calculations are required when working with soft sensors. It is vital to have a thorough understanding of the controlled process’s principles, physical characteristics, and the parameters’ relationships.

Conclusion

This research propose novel technique in virtual soft sensor-based fault detection in automation industry using deep learning technique integrated with cloud module. Here the aim is to design novel techniques in automation of manufacturing industry where the dynamic soft sensors are used in feature representation and classification of the data. The data has been collected from cloud storage and created the virtual sensors dataset based on gear fault detection, spindle fault detection, and bearing fault detection in automation industry. Then to represent the feature using fuzzy logic-based stacked data-driven auto- encoder (FL_SDDAE) where the features of input data have been identified with general automation problems. Then the features have been classified using least square error back propagation neural network (LSEBPNN) in which the mean square error while classification will be minimized with loss function of the data. Here the experimental results have been carried out in terms of computational time of 34%, QoS of 64%, RMSE of 41%, MAE of 35%, prediction performance of 94%, and measurement accuracy of 85% has been obtained by proposed technique. One is that nonlinear systems’ predictive control cannot be solved successfully. Another issue is that stability as well as resilience of multivariable predictive control algorithms must be addressed, and accurate principle models for complex systems are extremely difficult to construct. Despite the contributions made so far, there are still areas where future work might be improved. On the loss function, targeted-output regularizes would extract even better features, improving the suggested work. Another future intervention would be to use approaches on the unsupervised pre-training to identify dynamic-related aspects. In addition, industrial research scenarios were used to apply the proposed method, however developing a soft sensor proposal for a real-world industrial scenario could be challenging. Non-linearities, abnormalities, and highly complex ecosystems must all be taken into account. The industrial study cases have shown to be suitable and widely used in the implementation and evaluation of models, and they serve as the foundation for many contributions in this field of research.

Funding

The authors extend their appreciation to the Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University (IMSIU) for funding and supporting this work through Research Partnership Program no. RP-21-07-06. The authors acknowledge the support from Princess Nourah Bint Abdulrahman University Researchers Supporting Project number (PNURSP2023R440), Princess Nourah Bint Abdulrahman University, Riyadh, Saudi Arabia.

Declarations

Ethical approval

This article does not contain any studies with animals performed by any of the authors.

Conflict of interest

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Shakir Khan, Email: sgkhan@imamu.edu.sa.

Tamanna Siddiqui, Email: tsiddiqui.cs@amu.ac.in.

Azrour Mourade, Email: mo.azrour@umi.ac.ma.

Bayan Ibrahimm Alabduallah, Email: Bialabdullah@pnu.edu.sa.

Saad Abdullah Alajlan, Email: saalajlan@imamu.edu.sa.

Bader M. Albahlal, Email: bmalbahlal@imamu.edu.sa

Amani Alfaifi, Email: Amani0004@hotmail.com.

References 

  • 1.Savytskyi O, Tymoshenko M, Hramm O, Romanov S (2020) Application of soft sensors in the automated process control of different industries. In E3S Web of Conferences (Vol. 166, p 05003). EDP Sciences. 10.1051/e3sconf/202016605003
  • 2.Sun Q, Ge Z (2021) A survey on deep learning for data-driven soft sensors. IEEE Transactions on Industrial Informatics 17(9):5853–5866
  • 3.Andronie M, Lăzăroiu G, Iatagan M, Uță C, Ștefănescu R, CocoȘatu M. Artificial intelligence-based decision-making algorithms, internet of things sensing networks, and deep learning-assisted smart process management in cyber-physical production systems. Electronics. 2021;10(20):2497. doi: 10.3390/electronics10202497. [DOI] [Google Scholar]
  • 4.Kovacova M, Lăzăroiu G. Sustainable organizational performance, cyber-physical production networks, and deep learning-assisted smart process planning in Industry 4.0-based manufacturing systems. Econ Manag Fin Markets. 2021;16(3):41–54. [Google Scholar]
  • 5.Bibi R, Saeed Y, Zeb A, Ghazal TM, Rahman T, Said RA, ... Khan MA (2021) Edge AI-based automated detection and classification of road anomalies in VANET using deep learning. Comput Intell Neurosci 2021:1–16 [DOI] [PMC free article] [PubMed]
  • 6.Valaskova K, Ward P, Svabova L. Deep learning-assisted smart process planning, cognitive automation, and industrial big data analytics in sustainable cyber-physical production systems. J Self-Governance Manag Econ. 2021;9(2):9–20. doi: 10.22381/jsme9220211. [DOI] [Google Scholar]
  • 7.Hndoosh RW, Kumar S, Saroa MS. Fuzzy mathematical models for the analysis of fuzzy systems with application to liver disorders. IOSR J Comput Eng. 2014;16(5):71–85. doi: 10.9790/0661-16577185. [DOI] [Google Scholar]
  • 8.Kingma DP, Welling M (2019) An introduction to variational autoencoders. Foundations and Trends® in Machine Learning 12(4):307–392
  • 9.D'Angelo G, Palmieri F. A stacked autoencoder-based convolutional and recurrent deep neural network for detecting cyberattacks in interconnected power control systems. Int J Intell Syst. 2021;36(12):7080–7102. doi: 10.1002/int.22581. [DOI] [Google Scholar]
  • 10.Abrar S, Zerguine A, Bettayeb M. Recursive least-squares backpropagation algorithm for stop-and-go decision-directed blind equalization. IEEE Trans Neural Netw. 2002;13(6):1472–1481. doi: 10.1109/TNN.2002.804282. [DOI] [PubMed] [Google Scholar]
  • 11.Bargellesi N, Beghi A, Rampazzo M, Susto GA (2021) AutoSS: a deep learning-based soft sensor for handling time-series input data. IEEE Robot Autom Lett 6(3):6100–6107 
  • 12.Thomopoulos SC (2021) Risk assessment and automated anomaly detection using a deep learning architecture. IntechOpen. 10.5772/intechopen.96209
  • 13.Senthilkumar P, Rajesh K (2021) Design of a model based engineering deep learning scheduler in cloud computing environment using Industrial Internet of Things (IIOT). J Ambient Intell Humanized Comput 1–9
  • 14.Yao L, Ge Z. Dynamic features incorporated locally weighted deep learning model for soft sensor development. IEEE Trans Instrum Meas. 2021;70:1–11. [Google Scholar]
  • 15.Moreira de Lima JM, Ugulino de Araújo FM. Industrial semi-supervised dynamic soft-sensor modeling approach based on deep relevant representation learning. Sensors. 2021;21(10):3430. doi: 10.3390/s21103430. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Sahoo SR, Gupta BB. Multiple features based approach for automatic fake news detection on social networks using deep learning. Appl Soft Comput. 2021;100:106983. doi: 10.1016/j.asoc.2020.106983. [DOI] [Google Scholar]
  • 17.Fazil M, Khan S, Albahlal BM, Alotaibi RM, Siddiqui T, Shah MA. Attentional multi-channel convolution with bidirectional LSTM cell toward hate speech prediction. IEEE Access. 2023;11:16801–16811. doi: 10.1109/ACCESS.2023.3246388. [DOI] [Google Scholar]
  • 18.Zhang J, Gao RX. Deep learning-driven data curation and model interpretation for smart manufacturing. Chin J Mech Eng. 2021;34(1):1–21. doi: 10.1186/s10033-021-00587-y. [DOI] [Google Scholar]
  • 19.Xia K, Sacco C, Kirkpatrick M, Saidy C, Nguyen L, Kircaliali A, Harik R. A digital twin to train deep reinforcement learning agent for smart manufacturing plants: environment, interfaces and intelligence. J Manuf Syst. 2021;58:210–230. doi: 10.1016/j.jmsy.2020.06.012. [DOI] [Google Scholar]
  • 20.Khan S, et al. HCovBi-caps: hate speech detection using convolutional and Bi-directional gated recurrent unit with Capsule network. IEEE Access. 2022;10:7881–7894. doi: 10.1109/ACCESS.2022.3143799. [DOI] [Google Scholar]
  • 21.Khan S, et al. BiCHAT: BiLSTM with deep CNN and hierarchical attention for hate speech detection. J King Saud Univ-Comput Inform Sci. 2022;34(7):4335–4344. [Google Scholar]
  • 22.Maschler B, Ganssloser S, Hablizel A, Weyrich M. Deep learning based soft sensors for industrial machinery. Procedia CIRP. 2021;99:662–667. doi: 10.1016/j.procir.2021.03.115. [DOI] [Google Scholar]
  • 23.Khan S, Saravanan V, Lakshmi TJ, Deb N, Othman NA. Privacy protection of healthcare data over social networks using machine learning algorithms. Comput Intell Neurosci. 2022;2022(9985933):8. doi: 10.1155/2022/9985933. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  • 24.Haq AU, Li JP, Ahmad S, Khan S, Alshara MA, Alotaibi RM. Diagnostic approach for accurate diagnosis of COVID-19 employing deep learning and transfer learning techniques through chest X-ray images clinical data in E-healthcare. Sensors. 2021;21(24):8219. doi: 10.3390/s21248219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Khan S, Al-Mogren AS, AlAjmi MF (2015) Using cloud computing to improve network operations and management. 2015 5th National Symposium on Information Technology: Towards New Smart World (NSITNSW), Riyadh, Saudi Arabia, 2015, pp. 1–6. 10.1109/NSITNSW.2015.7176418

Articles from The International Journal, Advanced Manufacturing Technology are provided here courtesy of Nature Publishing Group

RESOURCES