Skip to main content
. 2024 Feb 28;10:e1901. doi: 10.7717/peerj-cs.1901
Algorithm 2: Processing DWT output signals through CNN-BLSTM Algorithm
Input: speech signals, Deep learning parameter (batch size, feature dimension, classes, train test ratio).
Output: enhanced speech signal with recognition rate performance.
Step 1: capture speech signals by using DMA microphone array
Step 2: Apply an analogue to digital converter to convert an analogue signal into a digital signal.
Step 3: apply wavelet transform by applying Xa,b=1aψtbbxtdt• Decompose signal into LL, HL, LH, and HH bands by computing the wavelet coefficients as cjk=Wψf2j,k2j
Step 4: Input these coefficients to deep learning
• Process through convolutional layers nout=nin+2pks+1, nin denotes the input attributes, nout denotes the output features, k convolution kernel size, p padding size, s is the stride
• Process the convolved data through pooling layer hxyl=maxi=0,s,j=0,..shx+1y+jl1
• Perform linearization by applying linear layer
• Apply BiLSTM layer
• Process the memory unit data through fully connected layer zl = Wlhl−1
• Soft max layer softmaxzi=ezijezj
Step 5: obtain the final output speech data and measure the performance