Hyperelastic and Stacked Ensemble-Driven Predictive Modeling of PEMFC Gaskets Under Thermal and Chemical Aging

. 2024 Nov 20;17(22):5675. doi: 10.3390/ma17225675

Algorithm A1: Machine learning pipeline for model-type and aging-type classification

Input: LSR dataset $D_{LSR} = {X_{LSR}, y_{model_LSR}, y_{aging_LSR}}$ , EPDM dataset $D_{EPDM} = {X_{EPDM}, y_{model_EPDM}, y_{aging_EPDM}}$

Output: Final accuracy $A_{final}$ , classification report, and confusion matrix $C_{final}$

₁ Load datasets $D_{LSR}$ and $D_{EPDM}$ from CSV files;

₂ Concatenate features: $X_{model} \leftarrow [X_{LSR}, X_{EPDM}]$ ;

₃ Concatenate model-type labels: $y_{model} \leftarrow [y_{model_LSR}, y_{model_EPDM}]$ ;

₄ Encode model-type labels: $y_{model_encoded} \leftarrow LE (y_{model})$ ;

₅ Normalize features: $X_{model_scaled} \leftarrow SC (X_{model})$ ;

₆ Concatenate features: $X_{aging} \leftarrow [X_{LSR}, X_{EPDM}]$ ;

₇ Concatenate aging-type labels: $y_{aging} \leftarrow [y_{aging_LSR}, y_{aging_EPDM}]$ ;

₈ Encode aging-type labels: $y_{aging_encoded} \leftarrow LE (y_{aging})$ ;

₉ Normalize features: $X_{aging_scaled} \leftarrow SC (X_{aging})$ ;

₁₀ Split $X_{model_scaled}, y_{model_encoded}$ and $X_{aging_scaled}, y_{aging_encoded}$ into 80–20 training and testing sets;

Input: Model M, training and testing sets for model-type or aging-type classification task

Output: Performance metrics: accuracy A, classification report, and confusion matrix C

₁₁ Train model M on training data $X_{train}, y_{train}$ to minimize the loss function L;

₁₂

M \leftarrow arg min L (M (X_{train}), y_{train})

Predict test labels ${\hat{y}}_{test}$ on $X_{test}$ :;

₁₃

{\hat{y}}_{test} = M (X_{test})

Calculate accuracy A:;

₁₄

A = \frac{1}{n} \sum_{i = 1}^{n} ⊮ ({\hat{y}}_{i} = y_{i})

Output classification report and confusion matrix C;

₁₅ Define and initialize models for both classification tasks as follows:

XGBoost: Define with default hyperparameters.
Multilayer Perceptron (MLP): Define with 100 hidden units and ReLU activation.
Deep Neural Network (DNN): Define with hidden layers $[150, 100, 50]$ .
Stacked Model (RF + SVM): Use Random Forest (RF) and Support Vector Machine (SVM) as base estimators, with an RF as the final estimator.

Input: Stacked model $M_{stacked}$ , Parameter grid G

Output: Optimal parameters $G^{*}$ and best model $M_{best}$

₁₆ Perform a randomized search on $M_{stacked}$ with grid G;

₁₇ for each parameter configuration $g \in G$ do

₁₈ $˪$ Evaluate $M_{stacked} (g)$ on the training set and compute accuracy $A (g)$ ;

₁₉ Set $G^{*} \leftarrow arg max A (g)$ and update best model $M_{best} = M_{stacked} (G^{*})$ ;

Input: Optimized stacked model $M_{best}$ , testing set $X_{test}, y_{test}$

Output: Final accuracy $A_{final}$ , classification report, and confusion matrix $C_{final}$

₂₀ Use $M_{best}$ to predict labels ${\hat{y}}_{test}$ on test data:;

₂₁

{\hat{y}}_{test} = M_{best} (X_{test})

Calculate and output final accuracy $A_{final}$ , classification report, and confusion matrix $C_{final}$ ;

₂₂ return $A_{final}, C_{final}$