Skip to main content
MethodsX logoLink to MethodsX
. 2023 Jan 18;10:102017. doi: 10.1016/j.mex.2023.102017

Method for real-time prediction of cutter wear during shield tunnelling: A new wear rate index and MCNN-GRU

Nan Zhang a, Lin-Shuang Zhao b,
PMCID: PMC9880237  PMID: 36713303

Abstract

Cutter wear is one of the key factors influencing construction efficiency during shield tunnelling. Prediction of cutter wear can improve construction efficiency by reducing the times of cutter inspections in engineering practice. Evaluation of cutter life is vital for cutter wear prediction, however, existing cutter life indices can only estimate the health condition of all cutters on cutterhead on a holistic basis. A new index was proposed to evaluate cutter wear located at a specific installation position on cutterhead. A deep learning model integrating the index was developed for the estimation of accumulated cutter wear during real time shield tunnelling. The new index can be obtained by monitored field parameters and can predict cutter wear with historical wear patterns. The input and output data samples were reshaped for multi-step prediction. A shield tunnelling section in Guangzhou weathered granite was used for validation. The proposed method can help reduce the cost of cutter replacement by reducing the times of machine interventions. The method article is a companion paper to the original article [1].

  • Proposed index for prediction of cutter wear rate.

  • Deep learning model of 1D-CNN and GRU.

  • Multi-step cutter wear prediction.

Keywords: 1D-CNN; Wear rate index, Gated recurrent unit, Wear prediction

Method name: Integrated method based on a new index and CNN-GRU

Graphical abstract

Image, graphical abstract


Specifications table

Subject area: Engineering
More specific subject area: Shield tunnelling
Name of your method: Integrated method based on a new index and CNN-GRU
Name and reference of original method: Jarrett, K., Kavukcuoglu, K., Ranzato, M., Lecun, Y. (2009). What is the best multi-stage architecture for object recognition? in: IEEE 12th International Conference on Computer Vision, 2009, 2146–2153, https://doi.org/10.1109/ICCV.2009.5459469. [2]

Cho, K., Van Merrienboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., Bengio, Y. (2014). Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint, https://arxiv.org/abs/1406.1078. [3]
Resource availability: DOI: https://doi.org/10.1016/j.tust.2022.104830.

Method details

Index for evaluation of cutter wear

The proposed index

Wear of cutters replaced from a certain position on cutterhead was defined in Eq. (1):

Wr=w/(t×d) (1)

where Wr is the proposed index for estimation of cutter wear rate, w is the cumulative wear of cutters in millimeter (mm), t is the working time between cutter and ground during shield tunnelling in minute (min), d is the cutter ring diameter in (mm).

Cutter wear records were derived from maintenance sheets during shield machine interventions where cutter radial wear was measured manually with customized calibrators. For a specific tunnelling case, cutter ring diameters are usually determined. Cutter ring diameters of 432 mm (17 inches) and 483 mm (19 inches) are used by the overwhelming majority of shield machines [4]. The accumulated cutter wear at a certain installation position equals to the addition of wear of each worn cutter replaced at that position. The raw data included non-working time data and data noise which needs to be eliminated. Non-working time data was filtered using Eq. (2) and (3) [5,6]:

G=g(P)×g(A)×g(C) (2)
g(x)={1,0,x0x=0 (3)

Where x is the collected boring data samples, P is penetration depth per revolution, A is the advancing speed of the shield machine, and C is cutterhead rotation speed. If G equals to zero, the corresponding data sample will be removed. Otherwise, the data was regarded as working time data which will be reserved for follow-up calculations. The data noise in the working time data was then filtered with the assumption that the boring data of each ring followed the Gaussian distribution. Data within 68% confidence interval was qualified for calculation of machine working time. Data noise can be removed by Eq. (4) [7]:

gj(μjiσji,μji+σji),1im;j{P,A,C,Tf,To} (4)

Where i and j are boring parameters and tunnel rings, respectively, μ is the mean value, σ is the standard deviation, gj is data samples, m is the number of rings, Tf and To are thrust and torque, respectively.

Working time was related to the number of remaining data in that ring after the filter of empty data and data noise elimination. Working time in Eq. (1) can be determined by recording the frequency of boring parameters which can be adjusted manually during tunnelling [8]. For example, if boring data was recorded at a one-minute interval, working time equals to the number of remaining data. If boring data was recorded every 30 seconds, working time equals to half of the number of remaining data.

The wear data was recorded during cutter inspections which were conducted at regular intervals. Linear interpolation, which was used to calculate the accumulated cutter wear of a certain cutter between two adjacent wear records, is formulated in Eq. (5):

Wa=TTn×(WnWn1)+Wn1 (5)

Where, Wa is the accumulated cutter wear of current time, T is the working time of the shield machine from the last cutter wear record, Tn equals to the construction time between the last wear record and current wear record, n equals to the number of wear records, Wn and Wn-1 are the accumulated wear until current wear record and last wear record, respectively.

Deep learning model

Data preparation

The input parameters included boring parameters, covered depth, and the proposed index. Detailed parameter settings and statistical distributions of the boring parameters can be found in the companion paper [1]. The boring data samples constituted a data matrix which is presented in Eq. (6):

P=[Pi×j]n×m=[p11p12p21p22p1ip1mp2ip2mpj1pj2pn1pn2pjipjmpnipnm] (6)

Where m is the number of boring parameters, and n is the number of data samples. pji is the ith boring parameter of data sample j. Each row in Eq. (6) is a data sample recorded at a one-minute interval. As cutter inspections can be conducted at any time during tunnelling of one ring, cutter wear should be recorded to obtain the exact time, other than the ring number, when the cutter was replaced. Then, a logarithmic form of the proposed index was fitted into the data matrix according to time stamps. The complete data matrix for the prediction of a certain cutter is shown in Eq. (7):

P=[Pi×j,Wr]n×(m+1)=[p11p12p21p22p1ip1mp2ip2mpj1pj2pn1pn2pjipjmpnipnmlogWr1logWr2logWrjlogWrn] (7)

Where logWrj is wear rate index based on Eq. (1) and Eq. (5). Following the time series in Eq. (7), around 80% of data samples were used as training dataset, while the remaining 20% was used for validation. Since the number of cutter replacements of cutters with different installation radii was different, the ratio of training data to test data was determined to be around 4:1. The data samples in Eq. (7) were then limited within [0, 1] by min-max normalization to accelerate model convergence.

Reshape of input and output data

After data preparation, the input was shaped into a tensor with the shape of the data sample length, time step, and input dimension. The time step determined the number of time intervals used in one data sample. The output was programmed into [sample size, leads]. The leads are the number of time intervals predicted ahead of the current time. After reshaping of input and output data samples, the length of the input array can be calculated by Eq. (8):

L=N×T×D (8)

Where L is the length of the input array, N represents the number of input parameters, T represents the time step, and D represents the input dimensions.

Integration of proposed Wr and deep learning model

The integrated deep learning model consists of a convolution layer, a max pooling layer, a gated recurrent unit (GRU) layer, and a fully connected layer [9], [10], [11]. The proposed index, Wr, was used as the output parameter of the deep learning model. The mean squared error between predicted and measured Wr was used as the loss function of the model which was presented in Eq. (9).

Loss=1nj=1n(Wrj^Wrj)2 (9)

Where n represents the number of data samples in one batch size, Wrj^ is the predicted wear rate index, Wrj is the measured wear rate index. The parameters of the deep learning model to be optimized were trained jointly by the Adam algorithm [12,13] during the model training process. Fig. 1 shows the flowchart of the proposed method.

Fig. 1.

Fig 1

Flowchart of real-time prediction of cutter wear.

Computational tool

Keras, programmed with Python, was used for model establishment [14,15]. TensorFlow was used as the backend. Table A1 shows the pseudocode for multi-step prediction. The code includes reshaping input and output data samples for multi-step cutter wear prediction and interpretation of prediction results. Table A2 shows the pseudocode for the construction of the CNN-GRU model based on Keras. The code comprises of development of model architecture and the determination of model hyperparameters. The model was trained and tested using an Intel Xeon E5-2650 v2 @ 2.6 GHz CPU. The following steps were used for the prediction of cutter wear during shield tunnelling in real time.

  • (1)

    Empty data removal and elimination of data noise. Users should determine the cutter number for wear prediction. Then, training and test datasets which contain boring parameters obtained from the auto-acquisition system of the shield machine should be prepared. Non-working time data can be deleted based on Eq. (2) and Eq. (3). Then, data noise should be filtered using Eq. (4). It should be noted that Eq. (4) is case specific and can be influenced by the characteristics of the distribution of parameters recorded by the shield machine during tunnelling of one ring. Users should eliminate the influence of data which represents the non-working time between cutter and ground.

  • (2)

    Accumulated wear of the selected cutter should be calculated at each cutter inspection position [16]. Then, accumulated wear data can be integrated with boring data according to time stamps. Cutter wear conditions between adjacent cutter inspections can be obtained by Eq. (5). The wear rate index can be determined by Eq. (1).

  • (3)

    Preparation of training and test data. The input of the hybrid model included cutterhead parameters, soil conditioning parameters, power parameters, and spoil discharge parameters. Specifically, covered depth and tunnel face pressure was used to present variations of ground formations [17,18]. The result of the wear rate index should be transformed into a logarithmic form. One wear record, which is the critical timeline for grouping training and test datasets with a ratio of around 4:1, should be determined. Data samples before the selected wear record are used for model training while the rest is for model validation. The number of data samples in one input array can be calculated with Eq. (8) while the length of output equals to the leads. Time steps and leads can be adjusted by users according to the time needed for the construction of one ring in different cases for real time prediction.

  • (4)

    Model construction. The reshaped input data samples were first inputted into a convolutional layer where the convolution kernel scrolled on the input data samples with a determined stride. The output of the convolutional layer was down-sampled in the max pooling layer. The GRU layer was then used to consider the historical wear information of the selected cutter and to learn new wear characteristics when encountering new types of ground. The results of the size of the input array and output array determined the number of neurons in the input and output layers, respectively. Other model architectures and hyperparameters can be determined by trial and error.

  • (5)

    Model training. Eq. (9) was used as a loss function of the deep learning model during model training. Then, the model can yield prediction results of the selected cutter ahead of the current time. To predict cutter wear with multi-step results, a multi-step prediction strategy was developed. Five data samples, i.e., five rows in Eq. (7), were used as input. Three proposed Wr after the input were used as output. The next input consisted of the last two data samples from the previous input and three new data samples after the previous input. The next three proposed Wr were the next output. The training data was used to calibrate the intelligent model and the computational cost was 285s.

  • (6)

    The model with optimized trainable parameters was then used for the prediction of cutter wear conditions. Accumulated cutter wear of the selected cutter can be obtained using Eq. (1) with known working time between cutter and strata. Wear can be calculated by subtracting accumulated wear until the last wear records from the predicted result.

Method validation

Most existing indices cope with the health conditions of all cutters on cutterhead and reflect cutter life in terms of cutter consumptions or cutter radial wear after excavation of a certain distance [19]. The proposed method introduced a new index that can be integrated with deep neural networks for the prediction of cutter wear in real time. The new wear rate index integrated with the hybrid deep neural network was validated by field data from an earth pressure balance tunnelling case in Guangzhou city, China. Single-ring disc cutter with a diameter of 483 mm was used for excavation. Cutterhead specifications and cutter distribution can be found in the companion paper [1]. Fig. 2 shows the predicted and measured Log (Wr) of cutter number 35. Wear conditions of cutter number 35 can be calculated using Eq. (1). Fig. 3 shows the wear increase of cutter number 35. The results showed the method can predict variations of cutter wear during real time excavation successfully. The coefficient of determination (R2), root mean squared error (RMSE), and mean absolute error (MAE) are 85.5%, 2.6e-3, and 2e-3, respectively. Comparison with other deep neural networks including MCNN-LSTM, MCNN-RNN, and MCNN with similar model structure and prediction strategy proved the superiority of the proposed model [1]. The proposed index shows the impact of ground conditions, boring parameters, and cutter ring material and geometry on wear features as the shield machine advances. Wear characteristics obtained from previous tunnelling sections can be experience for follow-up excavation. The proposed method in this article is case specific due to different cutterhead layouts and cutter geometry in different shield tunnelling projects. Supplementary field data from different shield tunnelling cases should be used to calibrate the new index and deep learning model before implementation. For the future application of the method, a database that includes operational parameters and corresponding wear data derived from different geological conditions and shield machine specifications should be established. The determination of hyperparameters of deep learning models can be optimized with machine learning techniques which can help accelerate model convergence and increase model performance.

Fig. 2.

Fig 2

Comparison between predicted and measured log (Wr) of cutter number 35.

Fig. 3.

Fig 3

Prediction result of accumulated wear of cutter number 35.

CRediT authorship contribution statement

Nan Zhang: Data curation, Methodology, Investigation, Software, Writing – original draft. Lin-Shuang Zhao: Visualization, Writing – review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This research has been supported by the National Natural Science Foundation of China (Grant No. 42102308) and the Research Funding of Shantou University for New Faculty Member (Grant No. NTF21008-2021). The author would like to thank Shantou University and Shanghai Jiao Tong University (SJTU) for technical support. In addition, the author would like to thank reviewers for their comments to improve this article.

Footnotes

Related research article: Zhang, N., Shen, S.L., Zhou, A.N. (2023). A new index for cutter life evaluation and ensemble model for prediction of cutter wear. Tunnelling and Underground Space Technology, 131, 104830, https://doi.org/10.1016/j.tust.2022.104830.

Supplementary material associated with this article can be found, in the online version, at doi:10.1016/j.mex.2023.102017.

Contributor Information

Nan Zhang, Email: nan_zhang@sjtu.edu.cn.

Lin-Shuang Zhao, Email: lshzhao@stu.edu.cn.

Appendix. Supplementary materials

mmc1.docx (14.1KB, docx)

Data availability

  • Data will be made available on request.

References

  • 1.Zhang N., Shen S.L., Zhou A.N. A new index for cutter life evaluation and ensemble model for prediction of cutter wear. Tunnell. Underground Space Technol. 2023;131 doi: 10.1016/j.tust.2022.104830. [DOI] [Google Scholar]
  • 2.Jarrett K., Kavukcuoglu K., Ranzato M., Lecun Y. IEEE 12th International Conference on Computer Vision. 2009. What is the best multi-stage architecture for object recognition? pp. 2146–2153. [DOI] [Google Scholar]
  • 3.Cho, K., Van Merrienboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., Bengio, Y. (2014). Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint, 2014, https://arxiv.org/abs/1406.1078.
  • 4.Rostami J. Hard rock TBM cutterhead modeling for design and performance prediction. Geomechanik Tunnelbau. 2008;1(1):18–28. doi: 10.1002/geot.200800. [DOI] [Google Scholar]
  • 5.Yan T., Shen S.L., Zhou A. Identification of geological characteristics from construction parameters during shield tunnelling. Acta Geotech. 2023;18(1):535–551. doi: 10.1007/s11440-022-01590-w. [DOI] [Google Scholar]
  • 6.Yan T., Shen S.L., Zhou A., Chen X.S. Prediction of geological characteristics from shield operational parameters using integrating grid search and K-fold cross validation into stacking classification algorithm. J. Rock Mech. Geotechn. Eng. 2022;14:1292–1303. doi: 10.1016/j.jrmge.2022.03.002. [DOI] [Google Scholar]
  • 7.Zhang N., Zhou A., Pan Y.T., Shen S.L. Measurement and prediction of tunnelling-induced ground settlement in karst region by using expanding deep learning. Measurement. 2021;183:109700. doi: 10.1016/j.measurement.2021.109700. [DOI] [Google Scholar]
  • 8.Elbaz K., Yan T., Zhou A., Shen S.L. Deep learning analysis for energy consumption of shield tunneling machine drive system. Tunnell. Underground Space Technol. 2022;123:104405. doi: 10.1016/j.tust.2022.104405. [DOI] [Google Scholar]
  • 9.Atangana Njock P.G., Shen S.L., Zhou A., Modoni G. Artificial neural network optimized by differential evolution for predicting diameters of jet grouted columns. J. Rock Mech. Geotechn. Eng. 2021;13:1500–1512. doi: 10.1016/j.jrmge.2021.05.009. [DOI] [Google Scholar]
  • 10.Zhang N., Shen S.L., Zhou A., Jin Y.F. Application of LSTM approach for modelling stress-strain behavior of soil. Appl. Soft Comput. 2021;100:106959. doi: 10.1016/j.asoc.2020.106959. [DOI] [Google Scholar]
  • 11.Shen S.L., Atangana Njock P.G., Zhou A., Lyu H.M. Dynamic prediction of jet grouted column diameter in soft soil using Bi-LSTM deep learning. Acta Geotech. 2021;16(1):303–315. doi: 10.1007/s11440-020-01005-8. [DOI] [Google Scholar]
  • 12.Shen S.L., Elbaz K., Shaban W.M., Zhou A. Real-time prediction of shield moving trajectory during tunnelling. Acta Geotech. 2022;17(4):1533–1549. doi: 10.1007/s11440-022-01461-4. [DOI] [Google Scholar]
  • 13.Kingma, D.P., Ba, J. (2014). Adam: a method for stochastic optimization, arXiv:1412.6980, https://arxiv.org/abs/1412.6980.
  • 14.Gao S., Huang Y.F., Zhang S., Han J.C., Wang G.Q., Zhang M.X., Lin Q.S. Short-term runoff prediction with GRU and LSTM networks without requiring time step optimization during sample generation. J. Hydrol. 2020;589 doi: 10.1016/j.jhydrol.2020.125188. [DOI] [Google Scholar]
  • 15.Shen S.L., Zhang N., Zhou A., Yin Z.Y. Enhancement of neural networks with an alternative activation function tanhLU. Expert Syst. Appl. 2022;199:117181. doi: 10.1016/j.eswa.2022.117181. [DOI] [Google Scholar]
  • 16.Lyu H.M., Shen S.L., Zhou A., Yin Z.Y. Assessment of safety status of shield tunnelling using operational parameters with enhanced SPA. Tunnell. Underground Space Technol. 2022;123 doi: 10.1016/j.tust.2022.104428. [DOI] [Google Scholar]
  • 17.Liu M.B., Liao S.M., Yang Y.F., Men Y.Q., He J.Z., Huang Y.L. Tunnel boring machine vibration-based deep learning for the ground identification of working faces. J. Rock Mech. Geotechn. Eng. 2021;13(6):18. doi: 10.1016/j.jrmge.2021.09.004. [DOI] [Google Scholar]
  • 18.Xu D.M., Wang Y.S., Huang J.Q., Liu S.J., Xu S.J., Zhou K. Prediction of geology condition for slurry pressure balanced shield tunnel with super-large diameter by machine learning algorithms. Tunnell. Underground Space Technol. 2023;131 doi: 10.1016/j.tust.2022.104852. [DOI] [Google Scholar]
  • 19.Kilic K., Toriya H., Kosugi Y., Adachi T., Kawamura Y. One-Dimensional convolutional neural network for pipe jacking EPB TBM cutter wear prediction. Appl. Sci. 2022;12(5):2410. doi: 10.3390/app12052410. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.docx (14.1KB, docx)

Data Availability Statement

  • Data will be made available on request.


Articles from MethodsX are provided here courtesy of Elsevier

RESOURCES