Prediction of CO2 solubility in Ionic liquids for CO2 capture using deep learning models

Mazhar Ali; Tooba Sarwar; Nabisab Mujawar Mubarak; Rama Rao Karri; Lubna Ghalib; Aisha Bibi; Shaukat Ali Mazari

doi:10.1038/s41598-024-65499-y

. 2024 Jun 26;14:14730. doi: 10.1038/s41598-024-65499-y

Prediction of CO₂ solubility in Ionic liquids for CO₂ capture using deep learning models

Mazhar Ali ¹, Tooba Sarwar ¹, Nabisab Mujawar Mubarak ^2,^3,^✉, Rama Rao Karri ^2,^4,^✉, Lubna Ghalib ⁵, Aisha Bibi ⁶, Shaukat Ali Mazari ^1,^✉

PMCID: PMC11208552 PMID: 38926595

Abstract

Ionic liquids (ILs) are highly effective for capturing carbon dioxide (CO₂). The prediction of CO₂ solubility in ILs is crucial for optimizing CO₂ capture processes. This study investigates the use of deep learning models for CO₂ solubility prediction in ILs with a comprehensive dataset of 10,116 CO₂ solubility data in 164 kinds of ILs under different temperature and pressure conditions. Deep neural network models, including Artificial Neural Network (ANN) and Long Short-Term Memory (LSTM), were developed to predict CO₂ solubility in ILs. The ANN and LSTM models demonstrated robust test accuracy in predicting CO₂ solubility, with coefficient of determination (R²) values of 0.986 and 0.985, respectively. Both model's computational efficiency and cost were investigated, and the ANN model achieved reliable accuracy with a significantly lower computational time (approximately 30 times faster) than the LSTM model. A global sensitivity analysis (GSA) was performed to assess the influence of process parameters and associated functional groups on CO₂ solubility. The sensitivity analysis results provided insights into the relative importance of input attributes on output variables (CO₂ solubility) in ILs. The findings highlight the significant potential of deep learning models for streamlining the screening process of ILs for CO₂ capture applications.

Keywords: Ionic liquids, CO₂ capture, Deep learning, ANN, LSTM, Global sensitivity analysis

Subject terms: Environmental sciences, Energy science and technology, Engineering, Materials science, Mathematics and computing

Introduction

Carbon dioxide (CO₂) released into the atmosphere through industrial production has resulted in significant environmental issues, including global climate change¹. To mitigate the emission and accumulation of CO₂, the capture and separation of CO₂ from natural and flue gas have emerged as effective approaches². Various technologies have been developed for CO₂ separation, including amine scrubbing³, pressure swing adsorption (PSA)⁴, temperature swing adsorption (TSA)⁵, and membrane separation technology⁶. Among these technologies, amine absorption is widely utilized in industries. The commonly employed amine solvents for CO₂ absorption include monoethanolamine (MEA), methyldiethanolamine (MDEA), and diethanolamine (DEA)¹. However, these absorbents have limitations, such as being prone to volatility and demanding high energy consumption during desorption⁷. Traditional CO₂ capture methods, like amine scrubbing, are hindered by their high energy demands for regeneration and significant solvent loss. This combination not only increases operational costs but also contributes to a larger environmental footprint⁸.

In the past decade, ionic liquids (ILs) have become the most potential applicants for CO₂ capture. The utilization of ILs in carbon capture represents a favourable alternative to conventional amine-based solvents, primarily due to two key advantages: their remarkably low vapour pressure and the ability to tailor their molecular structure to suit specific requirements⁹. These remarkable achievements of ILs are due to their unique molecular structures (anions, cations, and functional groups) and exceptional properties such as thermal stability, nonvolatility, and outstanding CO₂ solubility^10–14. The general properties of the majority of ILs are presented in Table 1¹⁵.

Table 1.

General properties of ILs¹⁵.

Property	General characters
Salt ions	Large cations and anions
Freezing temperature	< 100 °C
Liquidous temperature	> 200 °C
Thermal stability	High
Viscosity	< 100 cP, workable
Dielectric constant	< 30
Polarity	Moderate
Specific conductivity	< 10 mS/cm, good
Vapor pressure	Negligible
Solvency	Strong
Catalytical character	Excellent (for organic reactions)

Open in a new tab

One major challenge in utilizing ILs for CO₂ capture is their high viscosity because of the complex synthesis and purification processes required to create ILs. Compared to conventional solvents typically used for CO₂ capture, ILs generally exhibit significantly higher viscosity¹⁶. As highlighted by Krupiczka et al.¹⁷, the viscosity of ILs can be altered by employing appropriate combinations of cations and anions. Notably, the anion has a greater viscosity influence than the cation. Increasing the alkyl chain length within the cation generally leads to a corresponding increase in IL viscosity¹⁷. In terms of anion effects on viscosity in imidazolium based ILs, the reported order is [bmim][NTf₂] < [bmim][CF₃SO₃] < [bmim][BF₄] < [bmim][PF₆]. ILs are highly adaptable and can be customized for specific applications by varying the types and ratios of cations and anions. This versatility serves as the basis for designing¹⁸.

The development of accurate models to predict the solubility of CO₂ in ILs is a critical aspect of the design of ILs for carbon capture using computer-aided molecular design (CAMD). Traditional thermodynamic models have been utilized to estimate gas solubilities, including CO₂, in ILs. Some of these models include the Peng–Robinson–Stryjek–Vera (PRSV) equation of state¹⁹, group contribution-based Statistical Associating Fluid Theory (SAFT)²⁰, cubic equations of state combined with the UNIFAC (UNIQUAC Functional-group Activity Coefficients) method²¹, and COSMO-RS (Conductor-like Screening Model for Real Solvents)²². These models are developed on robust thermodynamic principles and can accurately assess the effects of temperature and pressure. However, their ability to deliver precise quantitative solubility predictions may sometimes be inadequate.

In addition to rigorous thermodynamic modelling, the quantitative structure–property relationship (QSPR) method provides another practical approach for predicting solubility. This method establishes a quantitative correlation between the property of interest and specific structural descriptors of the molecules. Group contribution (GC) methods, which utilize the occurrences of functional groups in the molecule as molecular descriptors, are commonly employed in CAMD. Linear GC models are suitable for specific properties, while nonlinear GC models are required for accurately predicting other properties. Recently, there has been significant advancement and broad adoption of machine learning (ML) models for developing complex nonlinear QSPR or GC models. These models have demonstrated their effectiveness in estimating various properties, including CO₂ solubility²³, H₂S solubility²⁴, and surface tension²⁵. ML models have emerged as a powerful tool for CO₂ capture research. Their ability to learn from data allows them to rapidly predict complex material properties, like CO₂ solubility in ILs²³. This reduces the time and cost associated with traditional methods and provides valuable insights into the key factors governing CO₂ capture efficiency²⁶.

Neural network-based machine learning models have gained significant popularity in predictive analytics, particularly for estimating CO₂ solubility. Eslamimanesh et al.²⁷ designed an artificial neural network (ANN) model to predict the solubility of CO₂ in 24 commonly used ILs for a dataset consisting of 1128 data points. Venkatraman and Alsberg²⁸ applied various machine learning algorithms such as Partial-Least-Squares Regression (PLSR), Conditional Inference Trees (CTREE), and Random Forest (RF) to a dataset comprising 10,848 solubility measurements with 185 ILs. Soleimani et al.²⁹ applied a decision tree-based stochastic gradient boosting (SGB) algorithm to predict H₂S solubility in ILs using 465 experimental data points. Song et al.³⁰ have developed ANN-GC and support vector machine (SVM-GC) models for CO₂ solubility prediction in ILs with data containing 10,116 data points (with 124 different ILs). Deng et al.³¹ used three deep-learning models to predict CO₂ solubility in ILs. They used a Convolutional Neural Network (CNN), Deep Neural Network (DNN), and a Recurrent Neural Network (RNN) with a relatively small dataset of 218 data points for 13 types of ILs. Recently, Tian et al.³² utilized ionic fragment contribution (IFC) with ANN and SVM models to predict CO₂ solubility data with 13,055 instances in 164 kinds of ILs. Liu et al.³³ estimated the CO₂ solubility of 1517 data in 20 different ILs using Particle Swarm Optimization (PSO), Grey Wolf Optimization (GWO), and Sparrow Search Algorithm (SSA) based on SVM ML models. These models achieved higher prediction accuracy depending on the algorithm architecture and the associated dataset. Smaller datasets, which are less complex to train, generally achieve higher accuracy than larger datasets. DNNs excel at handling large datasets due to their ability to learn complex patterns with a higher number of neurons. However, for optimal performance with extensive data, optimization and regularization techniques become crucial to prevent overfitting.

This study aims to develop deep neural network-based models for the larger dataset to predict CO₂ solubility in ILs. An ANN model and a long short-term memory (LSTM)-RNNs architecture are employed to address CO₂ solubility prediction on this extensive dataset, which contains 10,116 CO₂ solubility measurements from the work of Song et al.³⁰. In their study, a simple ANN model with one hidden layer (8 neurons) was implemented on this data. However, their work lacks information about validation, hyperparameter tuning, and regularization techniques for such a large dataset of CO₂ solubilities.

This work builds upon the previous study³⁰ by proposing a DNNs-based ANN model with three hidden layers, each containing 64 neurons. Model validation and hyperparameter tuning were performed to assess the model's performance. The effectiveness of both the ANN and LSTM models was assessed based on computational costs and memory usage during model training. Furthermore, global sensitivity analysis tools, such as Sobol and Morris methods, were used to investigate the impact of various variables (including functional groups) on CO₂ solubility in different ILs. ILs are promising for capturing CO₂ emissions from power plants and industrial processes. By accurately predicting CO₂ solubility, researchers design ILs with optimal CO₂ capture capacity, leading to more efficient CCS technologies.

Methods

Dataset/experimental data

This study utilizes CO₂ solubility data originally collected by Venkatraman and Alsberg²⁸ and meticulously preprocessed and compiled by Song et al.³⁰ for machine learning model training. The quality of the preprocessed data rendered further modifications unnecessary for our current analysis. This dataset includes 10,116 data points with 53 features that predict CO₂ solubility in ILs. It covers 124 ILs across a temperature range of 243.2 K to 453.15 K and a pressure range of 0.00798 bar to 499 bar. The cations include imidazolium, pyridinium, piperidinium, pyrrolidinium, phosphonium, sulfonium, and ammonium. The anions include tetrafluoroborate [BF₄], dicyanamide [DCA], hexafluorophosphate [PF₆], chloride [Cl], nitrate [NO₃], tricyanomethanide [C(CN)₃], thiocyanate [SCN], bis(trifluoromethylsulflonyl)amide [Tf₂N], hydrogen sulfate [HSO₄], and methylsulfate [MeSO₄] etc.

This study aims to develop deep-learning models for predicting CO₂ solubility in ILs. Previous research by Song et al.³⁰ on this dataset has not adequately addressed the optimization and regularization of neural network modelling. Our study fills this gap by focusing on several critical aspects, including model validation, hyperparameter tuning, computational efficiency, and the impact of neuron configurations on model performance. The modelling will use temperature, pressure (considered the most important features in CO₂ capture due to their direct impact on IL performance), and other relevant factors (referred to as input parameters) to predict CO₂ solubility (the output). The dataset of 10,116 data points is divided into training (80%) and testing (20%) sets to develop the deep learning models. This means the training set contains 8093 data points (80%), and the testing set contains 2023 (20%). During the model's training, 10% of the data was set aside for validation to ensure optimal performance. This dedicated validation set enabled monitoring of the model's validation loss curves throughout the training process. By analyzing these curves, potential overfitting could be identified, allowing for necessary adjustments to the model's architecture or training parameters.

Model development

This section delves into the development of two deep learning models—an ANN and an LSTM-RNN network—for predicting CO₂ solubility in ILs.

Artificial neural network (ANN)

ANN is a biologically inspired network of artificial neurons modelled to perform various tasks³⁴. These tasks include regression³⁵, classification³⁶, verification, and recognition. ANN model can recognize nonlinear complex relationships and can be used to predict CO₂ solubility³⁷. The literature indicates that various studies have used an ANN model to predict CO₂ solubility in ILs^37–40. An ANN model consists of different layers and certain numbers of neurons in each layer. As a feed-forward neural network, ANN consists of three layers—input, hidden, and output. The topology is shown in Fig. 1.

The input layer receives 53 features consisting of temperature, pressure, and functional groups, which gives an input vector p with a size of (53 $\times$ 1). The function of the hidden layer is to transfer this input information to the output layers where solubility can be predicted. The output from a hidden layer $f_{1} (a_{1})$ is demonstrated in Eqs. (1) and (2) define the output of the output layer.

f_{1} (a_{1}) = f_{1} (W_{1} \times p \times b_{1})

f_{2} (a_{2}) = f_{2} (W_{2} \times f_{1} (a_{1}) + b_{2})

The ANN architecture comprises one input layer, one output layer, and three hidden layers. Each hidden layer is equipped with 64 neurons to optimize the model accuracy (see Supplementary Fig. S1). A detailed discussion regarding the adjustment of neurons is presented in "ANN model". The activation functions are used for the hidden and output layers. The primary role of these transfer functions is to transform the summed weighted input from the node into the output value for the next layer. In other words, the basic principle of function is to decide whether the neuron’s input is necessary in predicting data. Different activation functions are used for the neural network, such as the Sigmoid function, Tanh function (hyperbolic tangent), rectified linear activation function (ReLU), SoftMax, etc. The present study used ReLU for both hidden and output layers. A ReLU is a type of linear function. It is computationally more efficient than sigmoid and tanh functions due to its certain numbers of neuron activation since it doesn’t activate all neurons at the same time⁴¹. The mathematical expression for the ReLU function is given below.

f (x) = m a x (0, x)

Long short-term memory (LSTM) model

An LSTM is a special type of RNN architecture. RNN model performs poorly on long-term dependencies due to the vanishing gradient problem⁴². The LSTM is an extension model of RNN that uses memory structures to learn long-term information. These models can efficiently remove gradient problems^43,44. The LSTM model gathers important information from input and saves this information for a long period, which is stored by a memory cell in the LSTM unit. A simple LSTM unit contains a cell, an input gate, a forget gate, and an output gate, as shown in Fig. 2. The cell remembers the values over arbitrary time intervals. The input gate decides which information should be added to the memory cell, while the forget gate decides whether to remove/save that information. Lastly, the output gate decides whether the existing information should have proceeded for analysis. Each LSTM cell contains six components in each timestep: a forget gate $" f "$ (a neural network with sigmoid function), a candidate layer $" C "$ (a neural network with tanh function), an input gate $" i "$ (a neural network with sigmoid), an output gate $" O "$ (a neural network with sigmoid), a hidden state $" h "$ (a vector) and a memory state $" \tilde{C} "$ (a vector) as shown in Eqs. (4) to (9). The first parameter is the forget gate parameter $(f_{t})$ that decides linear calculation based $x_{t}$ (current input) and $h_{t - 1}$ (previous hidden state) values. The value of output for this gate is between 0 and 1, where 0 means the previous memory state is completely forgotten; else, if the value is 1, then it means the previous memory state is completely passed to the cell. The second parameter is the input gate, which contains two layers (a sigmoid layer and a tanh layer). The sigmoid layer decides regarding an update of values, and the tanh creates the addition of a new vector candidate $({\tilde{C}}_{t})$ values to the LSTM memory. The output for these values is obtained with the help of Eqs. (5) and (6) and with the help of these, the cell state ${(C}_{t})$ is updated with the help of Eq. (7). Equation (8) helps to calculate the output parameter ${(o}_{t})$ . The final output result at hidden state $h_{t}$ It can be obtained by using Eq. (9)

f_{t} = σ (W_{f} . [h_{t - 1}, x_{t}] + b_{f})

i_{t} = σ (W_{i} . [h_{t - 1}, x_{t}] + b_{i})

{\tilde{C}}_{t} = tanh (W_{C} . [h_{t - 1}, x_{t}] + b_{C})

C_{t} = f_{t} \times C_{t - 1} + i_{t} \times {\tilde{C}}_{t}

o_{t} = σ (W_{o} . [h_{t - 1}, x_{t}] + b_{o})

h_{t} = o_{t} \times tanh (C_{t})

The LSTM architecture comprises an input layer, two hidden layers of 64 neurons, and an output layer (see Supplementary Fig. S1). While RNN architectures exist, this study employs LSTM networks due to their well-established capability to handle sequential data with long-term dependencies. In CO₂ solubility prediction, the relationship between past and present data points can be crucial, especially when considering factors like temperature history or pressure fluctuations. Unlike simpler RNNs that struggle with vanishing gradients, LSTMs incorporate memory cells and gates that effectively capture and utilize these long-term dependencies, leading to potentially more accurate CO₂ solubility predictions.

Sobol sensitivity analysis

Sobol sensitivity analysis, introduced by Sobol⁴⁶ is a variance-based method that offers a global perspective. It aims to determine the contribution of each parameter and the interactions among parameters to the variance observed in the model output. Generally, the allocation of the overall output variance to individual model parameters and their interactions are written as

D (f) = \sum_{i} D_{i} + \sum_{i < j} D_{ij} + \sum_{i < j < k} D_{ijk} + D_{12 \dots p},

where $D (f)$ represents the total variance of the output metric $f ;$ $D_{i}$ is the first-order variance contribution of the $i^{th}$ parameter, $D_{ij}$ is the second-order contribution of the interaction between parameters $i$ and $j$ ; and $D_{12 \dots p}$ contains all interactions higher than third-order, up to $p$ total parameters.

The first-order and total-order sensitivity indices are defined as follows.

First-order index:

S_{i} = \frac{D_{i}}{D}

Total order index:

S_{T_{i}} = 1 - \frac{D_{\sim i}}{D}

The first-order index captures the relative contribution of the parameter $i$ to the total output variance, excluding any effects or interactions with other parameters. The total order index equals one minus the fraction of the total variance assigned to $D_{i}$ , which includes all parameters except $i$ . By excluding parameter $i$ from the analysis, the total order index attributes the decrease in variance to that specific parameter⁴⁷. The difference between a parameter’s first and total order indices corresponds to the impact of its interactions with other parameters.

This study analyzes the total order indices to ascertain the relative importance of model parameters regarding sensitivity. Total order indices, obtained through Sobol sensitivity analysis, capture the combined impact of each input parameter on the model output, accounting for both individual effects and interactions with other parameters. This analysis is crucial for identifying the parameters that significantly influence the variation in predicted CO₂ solubility. Alternative sensitivity analysis methods might not provide the same level of detail. For instance, Morris sensitivity analysis, while efficient for initial screening, might not offer the in-depth information about individual and interactive effects that Sobol sensitivity analysis provides through total order indices. To ensure the robustness of our findings, we also employed the Morris method, allowing us to compare and select the most effective approach.

Morris sensitivity analysis

The method of Morris⁴⁸ calculates global sensitivity measures by utilizing a set of local derivatives, also known as elementary effects. These effects are sampled on a grid that covers the parameter space. The method is based on a one-at-a-time (OAT) approach, where each parameter, denoted as $x_{i}$ , is perturbed along a grid with a step size of $Δ_{i}$ . This perturbation allows for the creation of a trajectory through the parameter space, enabling sensitivity analysis across different parameter values. In a model consisting of $p$ parameters, a single trajectory comprises a sequence of $p$ perturbations. Each trajectory provides an estimate of the elementary effect for each parameter, which is determined by the ratio of the change in the model output to the change in the respective parameter. Equation (13) demonstrates the computation of a single elementary effect for the $i^{th}$ parameter.

E E_{i} = \frac{f (x_{1}, \dots, x_{i} + Δ_{i}, \dots, x_{p}) - f (x)}{Δ_{i}}

where $f (x)$ represents the prior point in the trajectory. In alternative formulations, both the numerator and denominator in the calculation are normalized by the values of the function and parameter $x_{i}$ , respectively, at a reference or prior point $x$ ⁴⁹. This normalization ensures that the elementary effect is expressed relative to the function and parameter values at the reference point. Employing the single trajectory presented in Eq. (8) makes it possible to compute the elementary effects for each parameter with just p + 1 model evaluations. Nevertheless, since this one-at-a-time (OAT) method relies solely on a single trajectory, its results heavily rely on the initial point $x$ location within the parameter space and do not account for interactions between parameters. To address this limitation, the Morris⁴⁸ extends the OAT method by conducting it across N trajectories throughout the parameter space.

The Morris method relies on the concept of elementary effects. These effects represent the change in the model output (predicted CO₂ solubility) caused by small perturbations to a single input parameter across different points in the parameter space. The Morris method utilizes a grid-based approach to compute elementary effects. It repeatedly samples the parameter space, slightly increasing or decreasing the value of a single parameter at each sample point while keeping all other parameters fixed. The difference between the model outputs obtained with the original and perturbed parameter value is the corresponding elementary effect⁴⁸. The mean effect (μ) defines a parameter's average of elementary effects and indicates its overall influence on the model output. A positive value suggests the parameter generally increases CO₂ solubility, while a negative value indicates the opposite.

Statistical indexes as an error function

In this section, the reliability and accuracy of the predicted models were evaluated through statistical analysis. Five key statistical indexes were determined: coefficient of determination (R²), root mean square error (RMSE), mean squared error (MSE), mean absolute error (MAE), and average absolute relative deviation (AARD). These indexes provide a comprehensive assessment of the model's performance and ability to predict CO₂ solubility accurately.

R^{2} = 1 - \frac{\sum_{i - 1}^{N} {(X_{i}^{\exp} - X_{i}^{predicted})}^{2}}{\sum_{i - 1}^{N} {(X_{i}^{\exp} - \bar{X_{i}^{\exp}})}^{2}}

R M S E = \sqrt{(\frac{1}{N}, \sum_{i = 1}^{N}, {(X_{i}^{\exp} - X_{i}^{predicted})}^{2})}

M S E = \frac{1}{N} \sum_{i = 1}^{N} {(X_{i}^{\exp} - X_{i}^{predicted})}^{2}

M A E = \frac{1}{N} \sum_{i = 1}^{N} X_{i}^{\exp} - X_{i}^{predicted}

A A R D = \frac{100}{N} \sum_{i = 1}^{N} |\frac{X_{i}^{\exp} - X_{i}^{predicted}}{X_{i}^{\exp}}|

Results and discussion

ANN model

The performance efficiency in predicting CO₂ solubility was different for different models while considering the same parameters and optimization methods for prediction. A better choice of optimizer was needed to modify the attributes of the neural network models. Ruder⁵⁰ in their comprehensive review of modern optimization algorithms and recommended ‘Adam’ as the superior choice among various optimizer techniques; hence, Adam optimizers, coupled with the ReLU activation function, were employed for each model to achieve optimized efficiency.

In neural network modelling, the learning rate is a crucial hyperparameter that influences how the model updates its weights during training. A well-chosen learning rate ensures the model learns effectively, at neither too slow a pace (which can lead to underestimation) nor too quickly (which can cause overestimation). A learning rate of 0.001 was selected within the tested range because further decrease resulted in a significant decline in model performance.

A critical step in neural network design is determining the ideal number of neurons in hidden layers. Having too few neurons can lead to underfitting, where the model fails to capture crucial patterns in the data. Conversely, too many neurons can cause overfitting, where the model memorizes noise instead of learning the underlying relationships. This study began with an architecture containing 8 neurons per hidden layer. We then systematically increased this number to 64 neurons per layer, searching for the optimal balance between underfitting and overfitting (see Fig. 14). The ANN model incorporates three hidden layers, each containing 64 neurons. A visual representation of this architecture, generated using the NETRON tool⁵¹, is provided in Supplementary Fig. S1 of the Supplementary Material.

Sobol and Morris sensitivity indices of temperature and pressure with 53 functional groups.

Model validation was performed to verify the accuracy and fit of the model. The training and validation loss functions served as metrics for evaluating the efficiency of the ANN model. The training loss measured how effectively the model fits the training data, while the validation loss indicated its ability to fit new data. The training loss function served as a metric to gauge how effectively the model learned the patterns within the training data, and the validation loss function assessed the ability to generalize these patterns and fit test data. Ideally, the training loss should decrease as the model learns.

In contrast, the validation loss should remain stable or slightly increase, indicating that the model avoids overfitting the training data. Figure 3 illustrates the training and validation loss curves using the MAE metric. The MAE loss curves depict a significant decrease in training loss (blue line), indicating successful learning of the ANN model for the training data. The validation loss (red line) remains stable, suggesting that the model avoids overfitting and generalizes well to new data.

Mean absolute error (MAE) loss curves for the LSTM model showing training and validation performance over epochs.

This study aims to optimize the performance of the ANN model by utilizing the different activation functions and a higher number of neurons compared to the previous study³⁰ for this dataset. The ANN model exhibited improved performance with an increased number of neurons. The higher number of neurons enables the network to understand complex decision boundaries better and express a broader spectrum of functions, ultimately leading to an improved model capacity^52,53. Table 2 summarizes the performance of the ANN model on the training (8,093 data points) and testing datasets (2,023 data points) using R², MAE, RMSE, MSE, and AARD metrics. The R² of 0.986 and MAE of 0.0171 indicate a good fit between the predictions and the experimental values. The ANN model showed a decrease in MSE values as the number of neurons in the hidden layers increased, indicating that a more complex architecture enhanced its learning capability. Figure 4 visualizes the comparison between the actual and predicted CO₂ solubility values for both the training and testing sets. It is evident from Fig. 4 that both the training and testing datasets exhibit a strong relationship with the diagonal line, indicating a good fit with the experimental CO₂ solubility data. However, a few outliers are observed, which may be attributed to measurement variations.

Table 2.

Comparison of performance evaluation metrics for training and testing datasets in the ANN model.

ANN model	R²	MAE	RMSE	MSE	AARD (%)
Training set	0.991	0.0141	0.0220	0.0005	26.238
Testing set	0.986	0.0171	0.0273	0.0007	28.054

Open in a new tab

Comparison of actual and ANN-predicted CO₂ solubility.

The discrepancy between predicted and experimentally measured values is analyzed to assess model performance. Figure 5 represents the distribution of errors between predicted and experimental solubilities. ${(x}_{c o_{2}}^{predicted} - x_{c o_{2}}^{experimental})$ for the ANN model. Most data points fall within a narrow range of − 0.05 to 0.05, indicating a smooth distribution close to zero. However, a few outliers exhibited higher error values. Figure 6 presents a histogram of the error distribution for the ANN model to provide further insights into the range of predicted errors. It is seen that the error distribution is mainly concentrated around zero, with minimal deviation. This suggests the ANN model accurately predicts CO₂ solubility across various temperatures and pressures in ILs.

ANN model errors for predicting CO₂ solubility.

Distribution of prediction error of ANN model.

LSTM model

This study aims to implement a Long Short-Term Memory (LSTM) model for predicting CO₂ solubility in ILs. LSTM models have seen limited application in this field. Deng et al.³¹ have used a classic Recurrent Neural Network (RNN) model for predicting CO₂ solubility in ILs, employing a dataset of 180 data points. While RNNs are typically used for time series problems to analyze long-term dependencies, their applicability to regression problems has been demonstrated³¹. This study significantly improves over previous work by Song et al.³⁰ by replacing the SVM model with an LSTM neural network model. This substitution leads to a more accurate prediction of CO₂ solubility in ILs.

The LSTM model was structured with a dual-layer configuration, each containing 64 neurons (Supplementary Fig. S2). The widely used "tanh" activation function is the default choice for all hidden layers³⁴. Typically, dropout functions are utilized to address overfitting issues in model⁵⁴. Even though dropout functions are implemented to prevent overfitting in models. The LSTM model showed no signs of overfitting and was performing adequately; dropout was not incorporated. The Adam optimizer with a learning rate of 0.001 was used to optimize the training of the LSTM model. For hyperparameter tuning, various batch sizes were tested to train the dataset, and a batch size of 16 and 280 epochs yielded the best results. The training and validation loss curves observe the determination of the number of epochs. Figure 7 demonstrates the MAE loss curves for the training and data validation. The training data reveals a significant drop in MAE errors (blue line), highlighting the robust learning capability of the LSTM model. The MAE loss for the validation data (red dashed line) shows the stability of the model by avoiding overfitting. Table 3 provides evaluation metrics to compare the efficiency of training and testing the LSTM model. The LSTM model achieved an R² of 0.985 and an MAE of 0.0175 on the testing data, with differences from the training data by 0.41% and 11.9%, respectively. The predicted CO₂ solubilities are compared with the experimental values. They are presented in Fig. 8. The data points for the training (black circle) and testing datasets (blue triangle) are evenly distributed around the diagonal line, indicating good agreement between the predicted and experimental CO₂ solubility values.

Table 3.

Comparison of performance evaluation metrics for training and testing datasets in the LSTM model.

LSTM model	R²	MAE	RMSE	MSE	AARD%
Training set	0.989	0.0155	0.0258	0.0007	8.154
Testing set	0.985	0.0174	0.0293	0.0009	10.793

Open in a new tab

Comparison of experimental and LSTM predicted CO₂ solubility.

Figure 9 depicts the distribution of errors between the predicted and experimental CO₂ solubility values for the LSTM model on both the training and testing datasets. ${(x}_{c o_{2}}^{predicted} - x_{c o_{2}}^{experimental})$ . The LSTM model also demonstrates the favorable error distribution for training and testing data, with errors falling from − 0.1 to 0.1 and exhibiting a consistent distribution centered around zero. This suggests good accuracy in predicting CO₂ solubility; however, it is worth noting that the ANN model achieves a slightly lower error margin. Figure 10 utilizes histograms to provide a more granular visualization of the error distribution for the LSTM model. The histograms reveal minimal deviations from zero, indicating that the model predicts CO₂ solubility accurately.

LSTM model errors for predicting CO₂ solubility.

Distribution of prediction errors of the LSTM model.

Models comparison

A comprehensive evaluation compares the performance and computational efficiency of ANN and LSTM models for predicting CO₂ solubility in ILs. This evaluation considers accuracy, training time (CPU usage), and memory expenditure during training. The computational cost of neural network models during training is analyzed by comparing their CPU time (seconds) and memory consumption (Mebibytes, MiB).

Figure 11 presents the graphical representation of CPU time and memory usage over the training epochs for the ANN and LSTM models. In terms of CPU time, the ANN model proves to be much more efficient. Each training epoch for the ANN model takes approximately 1 s (Fig. 11a), whereas the LSTM model requires a significantly longer time, averaging between 20 and 30 s per epoch (Fig. 11b). The total CPU time of the ANN model (4.03 min) is 31 times faster than that of the LSTM model (126.85 min) during the model's training. In comparing peak memory usage between ANN and LSTM models, the LSTM model consumed the most memory, reaching a peak of 733.93 MiBs at the end of the training, followed by the ANN model, which peaked at 535.98 MiBs. LSTMs incorporate memory cells that store past information, resulting in a larger memory footprint compared to the more straightforward layer-based structure of ANNs. LSTMs are inherently more complex architectures, including memory cells and gates (input, output, and forget) to control information flow and contribute to a higher computational load during training.

CPU time and memory usage during model training: (a) ANN model (b) LSTM model.

Table 4 summarizes the statistical comparison of ANN and LSTM models regarding model performance and error ranges. The ANN model performed slightly better than the LSTM model in terms of prediction accuracy. The R² values of testing data in the ANN and LSTM models are 0.986 and 0.985, respectively. The MAE of the ANN model is 2.3% lower than the LSTM. Although both models have demonstrated excellent performance, the ANN model outperforms the LSTM model regarding computational cost and efficiency.

Table 4.

Statistical comparison of ANN and LSTM models.

Models	R²	MAE	RMSE	MSE	AARD (%)
ANN	0.986	0.0172	0.0274	0.0007488	28.05
LSTM	0.985	0.0175	0.0294	0.0008616	10.80

Open in a new tab

Regarding the AARD values, it is worth noting that the LSTM model (10%) exhibits less deviation than the ANN model (28%). Initially, the ANN model recorded an AARD value of 57.5%, which was later reduced to 28.05% by increasing the number of hidden layers from 1 to 3 and adjusting the neuron count. The higher AARD percentage can be attributed to using a large dataset with diverse input parameters.

Song et al.³⁰ developed an ANN-GC model using the current dataset to predict the CO₂ solubility in ILs. Figure 12 compares evaluation metrics between the current ANN and LSTM models and the ANN-GC model from the previous study³⁰. The LSTM and ANN models slightly outperformed the ANN-GC model in terms of prediction accuracy. Specifically, the prediction accuracy of the current ANN model increased by 0.2%, accompanied by a 13% reduction in MAE compared to the ANN-GC model³⁰. Table 5 compares the methodology used for ANN modelling in this study with the previous study³⁰. This study adopts the ReLU activation function due to its computational efficiency and effectiveness with large datasets. It performs better in capturing non-linear patterns and gradients, making it well-suited for a wide range of problems. It is worth noticing that Song et al.³⁰ achieved a significantly higher accuracy with 7 neurons in the hidden layer compared to this study, which utilized 64 neurons in each hidden layer. The optimal number of neurons is the most crucial step in designing neural networks, especially for a large dataset. Using a higher number of neurons and more hidden layers is generally preferred for larger datasets. This approach allows the neural network to learn and model big data's complex patterns and relationships more effectively. This study lacks information about the hyperparameter tuning and optimization processes. This study investigated the training and testing accuracy by adjusting the learning rate (using the Adam optimizer) and the batch size for model training. Figure 13 visualizes the effect of varying the number of neurons and hidden layers on ANN model accuracy. Optimal results were achieved by configuring 3 hidden layers and 64 neurons in each layer. Optimization aims to minimize the discrepancies between the predicted and actual outputs. As observed in Fig. 13, adjusting the number of hidden layers and neurons significantly reduced prediction errors.

Performance comparison of the ANN and LSTM models with the ANN-GC model developed by Song et al.³⁰.

Table 5.

Statistical comparison of ANN-GC model³⁰ with this study for the ANN model.

	Song et al. ³⁰	This study
Activation function	Transig and Purelin	ReLU
Hidden layers	1	3
Neurons per hidden layer	7	64
R²	0.984	0.986
MAE	0.0202	0.0172

Open in a new tab

ANN model accuracy with different numbers of neurons and hidden layers.

A study by Deng et al.³¹ employed an ANN model and achieved a high R² of 0.999. However, their model was trained on a relatively small dataset of 218 data points for only 13 types of ILs. This limited data size might contribute to high accuracy, as smaller datasets can sometimes lead to overfitting. Additionally, their ANN architecture utilized a 7-layer network with many neurons ranging from 500 to 1, decreasing to 1 in the final layer. While this complex architecture may have performed well on their specific dataset, their study did not explicitly evaluate the impact of the specific number of neurons on model performance.

In addition to the DL models, traditional ML regression techniques, namely Random Forest (RFR) and Gradient Boosting Regression (GBR), were employed on this comprehensive dataset. Both RFR and GBR achieved R² values of 0.974 and 0.966, respectively. A detailed visualization of the predicted values and their associated error ranges for both models is presented in Supplementary Fig. S3 of the supplementary materials. A review of multiple literature sources was conducted to achieve a comprehensive overview of the prediction accuracy of models concerning statistical parameters, the number of data points, and the variety of ILs utilized for predicting CO₂ solubility. Table 6 compares the performance of various machine learning and thermodynamics-based models for CO₂ solubility prediction in ILs. Interestingly, it is observed that models with higher R² values and lower AARD values are often associated with smaller data points and a lower number of ILs in their respective studies. Despite the challenges associated with larger datasets, studies conducted by Venkatraman and Alsberg²⁸ demonstrate promising results with a higher number of ILs and data points. Their RF and CTREE models achieved R² values of 0.92 and 0.82, respectively. Song et al.³⁰ reported the most extensive dataset for ILs. In their work, authors developed ANN-GC and SVM-GC models, yielding reliable R² values of 0.9836 and 0.9783, respectively. Among the literature studies surveyed, Mesbah et al.⁵⁵ introduced the MLP-ANN model, which achieved the highest R² value of 0.9987 and the lowest AARD value of 1.8416. This model was evaluated using a dataset comprising 20 ionic liquids (ILs) and 1386 data points.

Table 6.

Various model comparisons for CO₂ solubility in ILs.

Model name	Data points	No. of ILs	R²	MAE	MSE	RMSE	%AARD	References
MLPNN	548	1	0.98631	–	0.00094	0.03066	7.21	⁵⁵
CFNN	548	1	0.98808	–	0.00080	0.02829	6.88
GRNN	548	1	0.99079	–	0.00062	0.02490	13.6
RBF	548	1	0.98516	–	0.00100	0.03159	13.5
ANFIS	548	1	0.98732	–	0.00085	0.02919	7.95
LS-SVM	548	1	0.98428	–	0.00106	0.03253	9.71	⁴⁰
DT	1668	40	0.94	–	–	–	21.24	⁵⁶
RF	1668	40	0.96	–	–	–	12.05
LSSVM	1668	40	0.75	–	–	–	31.01
MLR	1668	40	0.55	–	–	–	40.48
COSMO-RS	10,848	185	0.71	0.12	–	0.19	–	²⁸
RF	10,848	185	0.92	0.04	–	0.07	–
CTREE	10,848	185	0.82	0.10	–	0.07	–
DNN	218	13	0.984	0.291	–	0.757	–	³¹
CNN	218	13	0.999	0.145	–	0.206	–
RNN	218	13	0.988	0.25	–	0.651	–
XG Boost	218	13	0.981	0.175	–	0.586	–
MLP-ANN	1386	20	0.9987	–	0.6293	–	1.8416	⁵⁵
ANFIS	728	14	0.9972	–	0.00294	0.05423	–	³⁷
MLP-ANN	728	14	0.6989	–	0.00013	1.00145	–
PR-EOS	728	14	0.9376	–	0.00270	0.05198	–
SRK-EOS	728	14	0.9336	–	0.00558	0.07468	–
MLR	32	32	0.892	–	–	0.372	6.33 (ARD)	⁵⁷
LS-SVM	32	32	0.962	–	–	0.221	4.42 (ARD)	⁵⁷
PSO-ANFIS	1119	11	0.9397	–	0.3910	–	14.1286	⁵⁸
CSA-LSSVM	1119	11	0.9846	–	0.0999	–	3.0410	⁵⁸
BP	544	9	0.9982	0.0068	–	0.0090	–	⁵⁹
SVM	544	9	0.9933	0.0105	–	0.0174	–
ELM	544	9	0.9961	0.0093		0.0136
Linear fusion model I	544	9	0.9983	0.0062	–	0.0090	–
Linear fusion model II	544	9	0.9985	0.0060	–	0.0084	–
GMDH	4726	60	0.9043	–	–	0.0765	–	⁶⁰
ANN-GC	10,116	124	0.9836	0.0202	–	–	–	³⁰
SVM-GC	10,116	124	0.9783	0.0240	–	–	–	³⁰
ANN	10,116	124	0.986	0.0172	0.00074	0.02736	28.054	This study
LSTM	10,116	124	0.985	0.0175	0.00086	0.02935	10.793
RF	10,116	124	0.974	0.0232	0.00138	0.03719	11.875
GBR	10,116	124	0.966	0.0305	0.00182	0.04277	67.82

Open in a new tab

Global sensitivity analysis (GSA)

CO₂ solubility in ILs is strongly influenced by input parameters such as temperature, pressure, and the presence of functional groups. Blanchard et al.⁶¹ demonstrated efficient CO₂ dissolution in ILs at 25 °C and pressures up to 40 MPa. Extensive research has explored CO₂ absorption with ILs, encompassing both conventional ILs relying on physisorption and functionalized ILs utilizing chemisorption mechanisms¹⁴. Generally, for conventional ILs, the anions are more effective for CO₂ absorption, while cations have relatively low effects.

The solubility of CO₂ in ILs has been investigated through Global Sensitivity Analysis (GSA) to assess the relative impacts of process parameters, including temperature, pressure, and various functional groups. This analysis aims to ascertain the significance of these factors on the solubility behaviour of CO₂ in ILs. GSA is a robust approach that evaluates the influence of input parameters on outputs by allowing all inputs to fluctuate within predefined ranges⁴⁷, providing valuable insights into the consequences of input variations on the overall system behaviour.

For GSA, two widely used techniques, Sobol sensitivity analysis⁴⁶ and Morris sensitivity analysis⁴⁸, were applied to analyze the effect of input variables on CO₂ solubility in ILs. In the Sobol method, the total sensitivity index (S_T) is utilized to assess the overall impact of an input variable on CO₂ solubility. The S_T quantifies an input variable's total effect on the model output. On the other hand, the Morris method employs the μ index, which represents the average effect of each input variable over the sampled parameter space. It quantifies the average change in the model output when a variable is perturbed while holding other variables constant. Higher μ values indicate a more significant influence of the variable on the model output.

Figure 14 presents the results of both Sobol and Morris global sensitivity analysis for temperature (T), pressure (P), and the functional groups. While both methods provide valuable insights, they may present slightly different perspectives. Pressure emerges as a dominant factor affecting CO₂ solubility. This is evident in Fig. 14a, where both methods indicate a significant sensitivity index for pressure. This means changes in pressure have a strong impact on predicted CO₂ solubility values. The temperature (T) index is positive for the Sobol analysis and negative for the Morris analysis. The Sobol sensitivity analysis unexpectedly assigns a positive value to the T index. This finding seemingly contradicts the established knowledge that temperature has a negative impact on CO₂ solubility (i.e., higher temperature leads to lower CO₂ solubility). The Sobol method is sensitive to non-linear relationships between input parameters and the output. The true relationship between temperature and CO2 solubility may be non-linear within the range of your data. The positive Sobol index might capture an initial increase in CO₂ solubility followed by a decrease at higher temperatures, which a simple negative index would not reflect. Jerng et al. have indicated that the CO₂ solubility decreases with increasing temperature⁶². The Morris method suggests a negative correlation between temperature and CO₂ solubility, aligning with the observation that CO₂ solubility increases as temperature decreases. Figure 14b,c display the sensitivity indices for various functional groups. The graphs indicate that some functional groups have a minimal influence on CO₂ solubility, whereas others demonstrate a negative impact. Supplementary Table S1 (Supplementary Material) presents the sensitivity index values for each parameter across the dataset, as determined by the Sobol and Morris sensitivity analysis methods.

When dealing with extensive datasets that include numerous input variables, the Morris method could be a preferable initial option over the Sobol sensitivity analysis. Due to its faster execution and lower computational demand, the Morris method is particularly beneficial for large-scale data processing, enabling rapid analysis with modest resource consumption. The Morris method serves as a valuable tool for initial screening. It can efficiently identify the most influential parameters (such as pressure and temperature) while filtering out those with a lower impact (certain functional groups).

This study offers a valuable combination of high accuracy, efficiency, and insights into model interpretability using deep learning models. Still, these models' ability to generalize to other solutes or liquid types remains unverified. Another limitation to consider is the computational cost of the LSTM model. Although both models achieve high accuracy, the LSTM model requires significantly more training time and memory resources than the ANN model. This could limit its applicability in real-world scenarios where computational power or hardware resources might be restricted.

Conclusions

This study investigated the potential of deep learning models for predicting CO₂ solubility in ionic liquids (ILs). A comprehensive dataset containing over 10,116 CO₂ solubility measurements covering 164 different ILs under varying temperatures and pressures was used to train two deep neural network models: an Artificial Neural Network (ANN) and a Long Short-Term Memory (LSTM) network. The hyperparameter tuning, optimization, and validation strategy were conducted to evaluate the model performance comprehensively. The efficiency of the ANN and LSTM models was compared by analyzing their computational demands and memory consumption throughout the training process. Both models demonstrated remarkable accuracy in predicting CO₂ solubility. The ANN model achieved a high R² of 0.985 in just 4 min of training, consuming 535 MiB of memory. The LSTM model required significantly more training time (approximately 126 min) and consumed more memory (735 MiB) to achieve a comparable R² of 0.984. This difference can be attributed to the LSTM architecture's inherent complexity in handling sequential data. The ANN model achieved a 13% lower error rate than a previous study that used an ANN-GC model on a similar dataset. In this study, the size of neurons is optimized within the ANN model to achieve this higher accuracy and lower error rate. A review of existing literature on the prediction models developed for CO₂ capture in ILs was conducted to gain insights into the relationship between model performance and characteristics of ILs.

Sobol and Morris, sensitivity analysis methods, were employed to investigate the relative importance of input parameters on CO₂ solubility in ILs. The Morris sensitivity analysis identified pressure and temperature as having the most significant influence on CO₂ solubility in ILs, aligning well with experimental observations. The Morris method is a computationally efficient and easy-to-interpret technique for initial sensitivity analysis, particularly suitable for large datasets. The sensitivity analysis results provided valuable insights into the model's sensitivity to different parameters and helped identify the key factors driving the CO₂ solubility.

This study offers significant advancements in predicting CO₂ solubility in ILs using deep learning models. The high accuracy and efficiency of the ANN model make it a promising tool for streamlining the screening process of ILs for CO₂ capture applications. This paves the way for further exploration of deep learning approaches for similar prediction tasks in CO₂ capture research and potentially extends its application to other areas of material science.

Supplementary Information

Supplementary Information.^{(1.6MB, docx)}

List of symbols

a: Activations (or the total inputs) for the neurons in the layers
b: Bias vectors for the layers
${\tilde{C}}_{t}$: A memory state (a vector in an LSTM cell)
C_t: Cell state in an LSTM cell
D: Total variance in sensitivity analysis
D_i: First-order variance contribution of parameter i
D_ij: Second-order variance contribution from the interaction between parameters i and j
D_ijk: Third-order variance contribution from the interaction between parameters i, j, and k
EE_i: Elementary effect for the parameter i
f₁: Activation functions for the first layer
f₂: Activation functions for the second layer
f_t: Forget the gate in an LSTM cell
h_i: Hidden state of the LSTM cell
i_t: Input gate in an LSTM cell
o_t: Output gate in an LSTM cell
S_i: First-order sensitivity index for parameter i
S_Ti: Total-order sensitivity index for parameter i
P: Pressure (Pa)
p: Input vector to the neural network
R: Coefficient of determination [–]
T: Temperature (K)
W: Weight matrices
tanh: Hyperbolic tangent activation function
σ: Sigmoid activation function
μ: Average of elementary effects for a parameter in Morris sensitivity analysis

Abbreviations

ANN: Artificial neural network
ANFIS: Adaptive neuro-fuzzy inference system
AARD: Average absolute relative deviation (AARD)
BP: Back Propagation (commonly used in neural networks)
CO₂: Carbon dioxide
CNN: Convolutional neural network
CSTREE: Conditional inference trees
COSMO-RS: Conductor-like screening model for real solvents
CAMD: Computer-aided molecular design
CFNN: Cascade forward neural network
CSA-LSSVM: Cuckoo search algorithm least squares support vector machine
CPU: Central processing unit
DT: Decision tree
DNN: Deep neural network
DEA: Diethanolamine
ELM: Extreme learning machine
GC: Group contribution
GMDH: Group method of data handling
GWO: Grey Wolf Optimization
GRNN: General regression neural network
GBR: Gradient boost regressor
H₂S: Hydrogen sulfide
IL: Ionic liquids
IFC: Ionic fragments contribution
LS-SVM: Least squares support vector machine
MAE: Mean absolute error
MSE: Mean squared error
MiB: Mebibytes
MLPNN: Multilayer perceptron neural network
MLP-ANN: Multilayer perceptron artificial neural network
MLR: Multiple linear regression
MEA: Monoethanolamine
MDEA: Methyldiethanolamine
PSA: Pressure swing absorption
PSRK: Peng–Robinson–Stryjek–Vera
PR-EOS: Peng–Robinson equation of state
PLSR: Partial-least-squares regression
PSO: Particle swarm optimization
QSPR: Quantitative structure–property relationship
ReLU: Rectified linear activation function
RNN: Recurrent neural network
RBF: Radial basis function (network or kernel)
RF: Random forest
RMSE: Root mean square error
SRK-EOS: Soave–Redlich–Kwong equation of state
SVM: Support vector machine
SAFT: Statistical associating fluid theory
SSA: Sparrow search algorithm
TSA: Temperature swing adsorption
XG Boost: EXtreme gradient boosting
[BF₄]: Tetrafluoroborate
[DCA]: Dicyanamide
[PF₆]: Hexafluorophosphate
[Cl]: Chloride
[NO₃]: Nitrate
[C(CN)₃]: Tricyanomethanide
[Tf₂N]: Bis(trifluoromethylsulflonyl)amide
[HSO₄]: Hydrogen sulfate
[MeSO₄]: Methylsulfate

Author contributions

M.A.: Conceptualization, methodology, software, validation, writing—original draft preparation, writing—review and editing. T.S.: Methodology, software, data curation, writing—original draft preparation, visualization, writing—review and editing. N.M.M.: Conceptualization, validation, and writing—review and editing. R.R.K.: Conceptualization, validation, and writing—review and editing. S.A.M.: Conceptualization, methodology, validation, formal analysis, writing-original draft preparation, and writing—review and editing. L.G.: Writing—review and editing. A.B.: Writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Data availability

The datasets used and analyzed during the current study are available from the corresponding author upon reasonable request.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Nabisab Mujawar Mubarak, Email: mubarak.yaseen@gmail.com.

Rama Rao Karri, Email: kramarao.iitd@gmail.com.

Shaukat Ali Mazari, Email: shaukat.mazari@duet.edu.pk.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-024-65499-y.

References

1.Zheng S, et al. State of the art of ionic liquid-modified adsorbents for CO2 capture and separation. AIChE J. 2022;68:e17500. doi: 10.1002/aic.17500. [DOI] [Google Scholar]
2.Arellano IH, Madani SH, Huang J, Pendleton P. Carbon dioxide adsorption by zinc-functionalized ionic liquid impregnated into bio-templated mesoporous silica beads. Chem. Eng. J. 2016;283:692–702. doi: 10.1016/j.cej.2015.08.006. [DOI] [Google Scholar]
3.Krótki A, et al. Experimental results of advanced technological modifications for a CO2 capture process using amine scrubbing. Int. J. Greenh. Gas Control. 2020;96:103014. doi: 10.1016/j.ijggc.2020.103014. [DOI] [Google Scholar]
4.Zhou Y, et al. Tetra-n-heptyl ammonium tetrafluoroborate: Synthesis, phase equilibrium with CO2 and pressure swing absorption for carbon capture. J. Supercrit. Fluids. 2017;120:304–309. doi: 10.1016/j.supflu.2016.05.030. [DOI] [Google Scholar]
5.Jiang L, et al. Comparative analysis on temperature swing adsorption cycle for carbon capture by using internal heat/mass recovery. Appl. Therm. Eng. 2020;169:114973. doi: 10.1016/j.applthermaleng.2020.114973. [DOI] [Google Scholar]
6.Guo M, et al. Amino-decorated organosilica membranes for highly permeable CO2 capture. J. Membr. Sci. 2020;611:118328. doi: 10.1016/j.memsci.2020.118328. [DOI] [Google Scholar]
7.Polesso BB, et al. Supported ionic liquids as highly efficient and low-cost material for CO2/CH4 separation process. Heliyon. 2019;5:e02183. doi: 10.1016/j.heliyon.2019.e02183. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Lian S, et al. Recent advances in ionic liquids-based hybrid processes for CO2 capture and utilization. J. Environ. Sci. 2021;99:281–295. doi: 10.1016/j.jes.2020.06.034. [DOI] [PubMed] [Google Scholar]
9.Davarpanah E, Hernández S, Latini G, Pirri CF, Bocchini S. Enhanced CO2 absorption in organic solutions of biobased ionic liquids. Adv. Sustain. Syst. 2020;4:1900067. doi: 10.1002/adsu.201900067. [DOI] [Google Scholar]
10.Zhang X, et al. Carbon capture with ionic liquids: Overview and progress. Energy Environ. Sci. 2012;5:6668–6681. doi: 10.1039/c2ee21152a. [DOI] [Google Scholar]
11.Babamohammadi S, Shamiri A, Aroua MK. A review of CO2 capture by absorption in ionic liquid-based solvents. Rev. Chem. Eng. 2015;31:383–412. doi: 10.1515/revce-2014-0032. [DOI] [Google Scholar]
12.Kenarsari SD, et al. Review of recent advances in carbon dioxide separation and capture. RSC Adv. 2013;3:22739–22773. doi: 10.1039/c3ra43965h. [DOI] [Google Scholar]
13.Theo WL, Lim JS, Hashim H, Mustaffa AA, Ho WS. Review of pre-combustion capture and ionic liquid in carbon capture and storage. Appl. Energy. 2016;183:1633–1663. doi: 10.1016/j.apenergy.2016.09.103. [DOI] [Google Scholar]
14.Zeng S, et al. Ionic-liquid-based CO2 capture systems: Structure, interaction and process. Chem. Rev. 2017;117:9625–9673. doi: 10.1021/acs.chemrev.7b00072. [DOI] [PubMed] [Google Scholar]
15.Johnson KE. What’s an ionic liquid? Electrochem. Soc. Interface. 2007;16:38. doi: 10.1149/2.F04071IF. [DOI] [Google Scholar]
16.Ramdin M, de Loos TW, Vlugt TJ. State-of-the-art of CO2 capture with ionic liquids. Ind. Eng. Chem. Res. 2012;51:8149–8177. doi: 10.1021/ie3003705. [DOI] [Google Scholar]
17.Krupiczka R, Rotkegel A, Ziobrowski Z. Comparative study of CO2 absorption in packed column using imidazolium based ionic liquids and MEA solution. Sep. Purif. Technol. 2015;149:228–236. doi: 10.1016/j.seppur.2015.05.026. [DOI] [Google Scholar]
18.Weis DC, MacFarlane DR. Computer-aided molecular design of ionic liquids: An overview. Aust. J. Chem. 2012;65:1478–1486. doi: 10.1071/CH12344. [DOI] [Google Scholar]
19.Holderbaum T, Gmehling J. PSRK: A group contribution equation of state based on UNIFAC. Fluid Phase Equilib. 1991;70:251–265. doi: 10.1016/0378-3812(91)85038-V. [DOI] [Google Scholar]
20.Mourah M, NguyenHuynh D, Passarello J, De Hemptinne J, Tobaly P. Modelling LLE and VLE of methanol+ n-alkane series using GC-PC-SAFT with a group contribution kij. Fluid Phase Equilib. 2010;298:154–168. doi: 10.1016/j.fluid.2010.07.013. [DOI] [Google Scholar]
21.Fredenslund A, Jones RL, Prausnitz JM. Group-contribution estimation of activity coefficients in nonideal liquid mixtures. AIChE J. 1975;21:1086–1099. doi: 10.1002/aic.690210607. [DOI] [Google Scholar]
22.Eckert F, Klamt A. Fast solvent screening via quantum chemistry: COSMO-RS approach. AIChE J. 2002;48:369–385. doi: 10.1002/aic.690480220. [DOI] [Google Scholar]
23.Tatar A, et al. Prediction of carbon dioxide solubility in ionic liquids using MLP and radial basis function (RBF) neural networks. J. Taiwan Inst. Chem. Eng. 2016;60:151–164. doi: 10.1016/j.jtice.2015.11.002. [DOI] [Google Scholar]
24.Faúndez CA, Fierro EN, Valderrama JO. Solubility of hydrogen sulfide in ionic liquids for gas removal processes using artificial neural networks. J. Environ. Chem. Eng. 2016;4:211–218. doi: 10.1016/j.jece.2015.11.008. [DOI] [Google Scholar]
25.Mulero Á, Cachadiña I, Valderrama JO. Artificial neural network for the correlation and prediction of surface tension of refrigerants. Fluid Phase Equilib. 2017;451:60–67. doi: 10.1016/j.fluid.2017.07.022. [DOI] [Google Scholar]
26.Sun J, Sato Y, Sakai Y, Kansha Y. A review of ionic liquids design and deep eutectic solvents for CO2 capture with machine learning. J. Clean. Prod. 2023;414:137695. doi: 10.1016/j.jclepro.2023.137695. [DOI] [Google Scholar]
27.Eslamimanesh A, Gharagheizi F, Mohammadi AH, Richon D. Artificial neural network modelling of solubility of supercritical carbon dioxide in 24 commonly used ionic liquids. Chem. Eng. Sci. 2011;66:3039–3044. doi: 10.1016/j.ces.2011.03.016. [DOI] [Google Scholar]
28.Venkatraman, V. & Alsberg, B. K. Predicting CO₂ capture of ionic liquids using machine learning. J. CO2 Util.21, 162–168 (2017).
29.Soleimani R, Saeedi Dehaghani AH, Bahadori A. A new decision tree based algorithm for prediction of hydrogen sulfide solubility in various ionic liquids. J. Mol. Liq. 2017;242:701–713. doi: 10.1016/j.molliq.2017.07.075. [DOI] [Google Scholar]
30.Song Z, Shi H, Zhang X, Zhou T. Prediction of CO2 solubility in ionic liquids using machine learning methods. Chem. Eng. Sci. 2020;223:115752–115752. doi: 10.1016/j.ces.2020.115752. [DOI] [Google Scholar]
31.Deng T, Liu F-H, Jia G-Z. Prediction carbon dioxide solubility in ionic liquids based on deep learning. Mol. Phys. 2020;118:e1652367–e1652367. doi: 10.1080/00268976.2019.1652367. [DOI] [Google Scholar]
32.Tian Y, Wang X, Liu Y, Hu W. Prediction of CO2 and N2 solubility in ionic liquids using a combination of ionic fragments contribution and machine learning methods. J. Mol. Liq. 2023;383:122066. doi: 10.1016/j.molliq.2023.122066. [DOI] [Google Scholar]
33.Liu Z, Bian X-Q, Duan S, Wang L, Fahim RI. Estimating CO2 solubility in ionic liquids by using machine learning methods. J. Mol. Liq. 2023;391:123308. doi: 10.1016/j.molliq.2023.123308. [DOI] [Google Scholar]
34.Samra, M. N. A., Abed, B. E. E.-D., Zaqout, H. A. N. & Abu-Naser, S. S. ANN model for predicting protein localization sites in cells. Int. J. Acad. Appl. Res. IJAAR.4 (2020).
35.Mirarab M, Sharifi M, Behzadi B, Ghayyem MA. Intelligent prediction of CO2 capture in propyl amine methyl imidazole alanine ionic liquid: An artificial neural network model. Sep. Sci. Technol. 2015;50:26–37. doi: 10.1080/01496395.2014.946145. [DOI] [Google Scholar]
36.Zhou G-S, et al. Hydrophilic interaction chromatography combined with ultrasound-assisted ionic liquid dispersive liquid–liquid microextraction for determination of underivatized neurotransmitters in dementia patients’ urine samples. Anal. Chim. Acta. 2020;1107:74–84. doi: 10.1016/j.aca.2020.02.027. [DOI] [PubMed] [Google Scholar]
37.Baghban A, Ahmadi MA, Shahraki BH. Prediction carbon dioxide solubility in presence of various ionic liquids using computational intelligence approaches. J. Supercrit. Fluids. 2015;98:50–64. doi: 10.1016/J.SUPFLU.2015.01.002. [DOI] [Google Scholar]
38.Baghban A, Mohammadi AH, Taleghani MS. Rigorous modelling of CO2 equilibrium absorption in ionic liquids. Int. J. Greenh. Gas Control. 2017;58:19–41. doi: 10.1016/j.ijggc.2016.12.009. [DOI] [Google Scholar]
39.Zhang X, Wang J, Song Z, Zhou T. Data-driven ionic liquid design for CO2 capture: Molecular structure optimization and DFT verification. Ind. Eng. Chem. Res. 2021;60:9992–10000. doi: 10.1021/acs.iecr.1c01384. [DOI] [Google Scholar]
40.Daryayehsalameh B, Nabavi M, Vaferi B. Modelling of CO2 capture ability of [Bmim][BF4] ionic liquid using connectionist smart paradigms. Environ. Technol. Innov. 2021;22:101484–101484. doi: 10.1016/j.eti.2021.101484. [DOI] [Google Scholar]
41.Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning (Adaptive Computation and Machine Learning Series), 321–359 (Cambridge Massachusetts, 2017).
42.Hochreiter S. The vanishing gradient problem during learning recurrent neural nets and problem solutions. Int. J. Uncertain. Fuzziness Knowl. Based Syst. 1998;6:107–116. doi: 10.1142/S0218488598000094. [DOI] [Google Scholar]
43.Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9:1735–1780. doi: 10.1162/neco.1997.9.8.1735. [DOI] [PubMed] [Google Scholar]
44.Siami-Namini, S., Tavakoli, N. & Namin, A. S. In 2019 IEEE International Conference on Big Data (Big Data). 3285–3292 (IEEE).
45.Xiang Z, Yan J, Demir I. A rainfall-runoff model with LSTM-based sequence-to-sequence learning. Water Resour. Res. 2020;56:e2019WR025326–e022019WR025326. doi: 10.1029/2019WR025326. [DOI] [Google Scholar]
46.Sobol IM. Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates. Math. Comput. Simul. 2001;55:271–280. doi: 10.1016/S0378-4754(00)00270-6. [DOI] [Google Scholar]
47.Homma T, Saltelli A. Importance measures in global sensitivity analysis of nonlinear models. Reliab. Eng. Syst. Saf. 1996;52:1–17. doi: 10.1016/0951-8320(96)00002-6. [DOI] [Google Scholar]
48.Morris MD. Factorial sampling plans for preliminary computational experiments. Technometrics. 1991;33:161–174. doi: 10.1080/00401706.1991.10484804. [DOI] [Google Scholar]
49.van Griensven AV, et al. A global sensitivity analysis tool for the parameters of multi-variable catchment models. J. Hydrol. 2006;324:10–23. doi: 10.1016/j.jhydrol.2005.09.008. [DOI] [Google Scholar]
50.Ruder, S. An overview of gradient descent optimization algorithms. arXiv preprint arXiv:1609.04747 (2016).
51.Roeder, L. Netron: Visualizer for Neural Network, Deep Learning and Machine Learning Models. https://www.lutzroeder.com/ai
52.Abhishek K, Singh M, Ghosh S, Anand A. Weather forecasting model using artificial neural network. Proc. Technol. 2012;4:311–318. doi: 10.1016/j.protcy.2012.05.047. [DOI] [Google Scholar]
53.Krogh A. What are artificial neural networks? Nat. Biotechnol. 2008;26:195–197. doi: 10.1038/nbt1386. [DOI] [PubMed] [Google Scholar]
54.Gal, Y. & Ghahramani, Z. A theoretically grounded application of dropout in recurrent neural networks. Adv. Neural Inf. Process. Syst.29 (2016).
55.Mesbah, M., Shahsavari, S., Soroush, E., Rahaei, N. & Rezakazemi, M. Accurate prediction of miscibility of CO₂ and supercritical CO₂ in ionic liquids using machine learning. J. CO₂ Util.25, 99–107. 10.1016/j.jcou.2018.03.004 (2018).
56.Aghaie M, Zendehboudi S. Estimation of CO2 solubility in ionic liquids using connectionist tools based on thermodynamic and structural characteristics. Fuel. 2020;279:117984–117984. doi: 10.1016/j.fuel.2020.117984. [DOI] [Google Scholar]
57.Ghaslani D, Gorji ZE, Gorji AE, Riahi S. Descriptive and predictive models for Henry’s law constant of CO2 in ionic liquids: A QSPR study. Chem. Eng. Res. Des. 2017;120:15–25. doi: 10.1016/j.cherd.2016.12.020. [DOI] [Google Scholar]
58.Dashti A, Riasat Harami H, Rezakazemi M, Shirazian S. Estimating CH4 and CO2 solubilities in ionic liquids using computational intelligence approaches. J. Mol. Liq. 2018;271:661–669. doi: 10.1016/j.molliq.2018.08.150. [DOI] [Google Scholar]
59.Xia L, Wang J, Liu S, Li Z, Pan H. Prediction of CO2 solubility in ionic liquids based on multi-model fusion method. Processes. 2019;7:258–258. doi: 10.3390/pr7050258. [DOI] [Google Scholar]
60.Moosanezhad-Kermani H, Rezaei F, Hemmati-Sarapardeh A, Band SS, Mosavi A. Modelling of carbon dioxide solubility in ionic liquids based on group method of data handling. Eng. Appl. Comput. Fluid Mech. 2021;15:23–42. [Google Scholar]
61.Blanchard LA, Hancu D, Beckman EJ, Brennecke JF. Green processing using ionic liquids and CO2. Nature. 1999;399:28–29. doi: 10.1038/19887. [DOI] [Google Scholar]
62.Jerng SE, Park YJ, Li J. Machine learning for CO2 capture and conversion: A review. Energy AI. 2024;16:100361. doi: 10.1016/j.egyai.2024.100361. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information.^{(1.6MB, docx)}

Data Availability Statement

The datasets used and analyzed during the current study are available from the corresponding author upon reasonable request.

[CR1] 1.Zheng S, et al. State of the art of ionic liquid-modified adsorbents for CO2 capture and separation. AIChE J. 2022;68:e17500. doi: 10.1002/aic.17500. [DOI] [Google Scholar]

[CR2] 2.Arellano IH, Madani SH, Huang J, Pendleton P. Carbon dioxide adsorption by zinc-functionalized ionic liquid impregnated into bio-templated mesoporous silica beads. Chem. Eng. J. 2016;283:692–702. doi: 10.1016/j.cej.2015.08.006. [DOI] [Google Scholar]

[CR3] 3.Krótki A, et al. Experimental results of advanced technological modifications for a CO2 capture process using amine scrubbing. Int. J. Greenh. Gas Control. 2020;96:103014. doi: 10.1016/j.ijggc.2020.103014. [DOI] [Google Scholar]

[CR4] 4.Zhou Y, et al. Tetra-n-heptyl ammonium tetrafluoroborate: Synthesis, phase equilibrium with CO2 and pressure swing absorption for carbon capture. J. Supercrit. Fluids. 2017;120:304–309. doi: 10.1016/j.supflu.2016.05.030. [DOI] [Google Scholar]

[CR5] 5.Jiang L, et al. Comparative analysis on temperature swing adsorption cycle for carbon capture by using internal heat/mass recovery. Appl. Therm. Eng. 2020;169:114973. doi: 10.1016/j.applthermaleng.2020.114973. [DOI] [Google Scholar]

[CR6] 6.Guo M, et al. Amino-decorated organosilica membranes for highly permeable CO2 capture. J. Membr. Sci. 2020;611:118328. doi: 10.1016/j.memsci.2020.118328. [DOI] [Google Scholar]

[CR7] 7.Polesso BB, et al. Supported ionic liquids as highly efficient and low-cost material for CO2/CH4 separation process. Heliyon. 2019;5:e02183. doi: 10.1016/j.heliyon.2019.e02183. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Lian S, et al. Recent advances in ionic liquids-based hybrid processes for CO2 capture and utilization. J. Environ. Sci. 2021;99:281–295. doi: 10.1016/j.jes.2020.06.034. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Davarpanah E, Hernández S, Latini G, Pirri CF, Bocchini S. Enhanced CO2 absorption in organic solutions of biobased ionic liquids. Adv. Sustain. Syst. 2020;4:1900067. doi: 10.1002/adsu.201900067. [DOI] [Google Scholar]

[CR10] 10.Zhang X, et al. Carbon capture with ionic liquids: Overview and progress. Energy Environ. Sci. 2012;5:6668–6681. doi: 10.1039/c2ee21152a. [DOI] [Google Scholar]

[CR11] 11.Babamohammadi S, Shamiri A, Aroua MK. A review of CO2 capture by absorption in ionic liquid-based solvents. Rev. Chem. Eng. 2015;31:383–412. doi: 10.1515/revce-2014-0032. [DOI] [Google Scholar]

[CR12] 12.Kenarsari SD, et al. Review of recent advances in carbon dioxide separation and capture. RSC Adv. 2013;3:22739–22773. doi: 10.1039/c3ra43965h. [DOI] [Google Scholar]

[CR13] 13.Theo WL, Lim JS, Hashim H, Mustaffa AA, Ho WS. Review of pre-combustion capture and ionic liquid in carbon capture and storage. Appl. Energy. 2016;183:1633–1663. doi: 10.1016/j.apenergy.2016.09.103. [DOI] [Google Scholar]

[CR14] 14.Zeng S, et al. Ionic-liquid-based CO2 capture systems: Structure, interaction and process. Chem. Rev. 2017;117:9625–9673. doi: 10.1021/acs.chemrev.7b00072. [DOI] [PubMed] [Google Scholar]

[CR15] 15.Johnson KE. What’s an ionic liquid? Electrochem. Soc. Interface. 2007;16:38. doi: 10.1149/2.F04071IF. [DOI] [Google Scholar]

[CR16] 16.Ramdin M, de Loos TW, Vlugt TJ. State-of-the-art of CO2 capture with ionic liquids. Ind. Eng. Chem. Res. 2012;51:8149–8177. doi: 10.1021/ie3003705. [DOI] [Google Scholar]

[CR17] 17.Krupiczka R, Rotkegel A, Ziobrowski Z. Comparative study of CO2 absorption in packed column using imidazolium based ionic liquids and MEA solution. Sep. Purif. Technol. 2015;149:228–236. doi: 10.1016/j.seppur.2015.05.026. [DOI] [Google Scholar]

[CR18] 18.Weis DC, MacFarlane DR. Computer-aided molecular design of ionic liquids: An overview. Aust. J. Chem. 2012;65:1478–1486. doi: 10.1071/CH12344. [DOI] [Google Scholar]

[CR19] 19.Holderbaum T, Gmehling J. PSRK: A group contribution equation of state based on UNIFAC. Fluid Phase Equilib. 1991;70:251–265. doi: 10.1016/0378-3812(91)85038-V. [DOI] [Google Scholar]

[CR20] 20.Mourah M, NguyenHuynh D, Passarello J, De Hemptinne J, Tobaly P. Modelling LLE and VLE of methanol+ n-alkane series using GC-PC-SAFT with a group contribution kij. Fluid Phase Equilib. 2010;298:154–168. doi: 10.1016/j.fluid.2010.07.013. [DOI] [Google Scholar]

[CR21] 21.Fredenslund A, Jones RL, Prausnitz JM. Group-contribution estimation of activity coefficients in nonideal liquid mixtures. AIChE J. 1975;21:1086–1099. doi: 10.1002/aic.690210607. [DOI] [Google Scholar]

[CR22] 22.Eckert F, Klamt A. Fast solvent screening via quantum chemistry: COSMO-RS approach. AIChE J. 2002;48:369–385. doi: 10.1002/aic.690480220. [DOI] [Google Scholar]

[CR23] 23.Tatar A, et al. Prediction of carbon dioxide solubility in ionic liquids using MLP and radial basis function (RBF) neural networks. J. Taiwan Inst. Chem. Eng. 2016;60:151–164. doi: 10.1016/j.jtice.2015.11.002. [DOI] [Google Scholar]

[CR24] 24.Faúndez CA, Fierro EN, Valderrama JO. Solubility of hydrogen sulfide in ionic liquids for gas removal processes using artificial neural networks. J. Environ. Chem. Eng. 2016;4:211–218. doi: 10.1016/j.jece.2015.11.008. [DOI] [Google Scholar]

[CR25] 25.Mulero Á, Cachadiña I, Valderrama JO. Artificial neural network for the correlation and prediction of surface tension of refrigerants. Fluid Phase Equilib. 2017;451:60–67. doi: 10.1016/j.fluid.2017.07.022. [DOI] [Google Scholar]

[CR26] 26.Sun J, Sato Y, Sakai Y, Kansha Y. A review of ionic liquids design and deep eutectic solvents for CO2 capture with machine learning. J. Clean. Prod. 2023;414:137695. doi: 10.1016/j.jclepro.2023.137695. [DOI] [Google Scholar]

[CR27] 27.Eslamimanesh A, Gharagheizi F, Mohammadi AH, Richon D. Artificial neural network modelling of solubility of supercritical carbon dioxide in 24 commonly used ionic liquids. Chem. Eng. Sci. 2011;66:3039–3044. doi: 10.1016/j.ces.2011.03.016. [DOI] [Google Scholar]

[CR28] 28.Venkatraman, V. & Alsberg, B. K. Predicting CO₂ capture of ionic liquids using machine learning. J. CO2 Util.21, 162–168 (2017).

[CR29] 29.Soleimani R, Saeedi Dehaghani AH, Bahadori A. A new decision tree based algorithm for prediction of hydrogen sulfide solubility in various ionic liquids. J. Mol. Liq. 2017;242:701–713. doi: 10.1016/j.molliq.2017.07.075. [DOI] [Google Scholar]

[CR30] 30.Song Z, Shi H, Zhang X, Zhou T. Prediction of CO2 solubility in ionic liquids using machine learning methods. Chem. Eng. Sci. 2020;223:115752–115752. doi: 10.1016/j.ces.2020.115752. [DOI] [Google Scholar]

[CR31] 31.Deng T, Liu F-H, Jia G-Z. Prediction carbon dioxide solubility in ionic liquids based on deep learning. Mol. Phys. 2020;118:e1652367–e1652367. doi: 10.1080/00268976.2019.1652367. [DOI] [Google Scholar]

[CR32] 32.Tian Y, Wang X, Liu Y, Hu W. Prediction of CO2 and N2 solubility in ionic liquids using a combination of ionic fragments contribution and machine learning methods. J. Mol. Liq. 2023;383:122066. doi: 10.1016/j.molliq.2023.122066. [DOI] [Google Scholar]

[CR33] 33.Liu Z, Bian X-Q, Duan S, Wang L, Fahim RI. Estimating CO2 solubility in ionic liquids by using machine learning methods. J. Mol. Liq. 2023;391:123308. doi: 10.1016/j.molliq.2023.123308. [DOI] [Google Scholar]

[CR34] 34.Samra, M. N. A., Abed, B. E. E.-D., Zaqout, H. A. N. & Abu-Naser, S. S. ANN model for predicting protein localization sites in cells. Int. J. Acad. Appl. Res. IJAAR.4 (2020).

[CR35] 35.Mirarab M, Sharifi M, Behzadi B, Ghayyem MA. Intelligent prediction of CO2 capture in propyl amine methyl imidazole alanine ionic liquid: An artificial neural network model. Sep. Sci. Technol. 2015;50:26–37. doi: 10.1080/01496395.2014.946145. [DOI] [Google Scholar]

[CR36] 36.Zhou G-S, et al. Hydrophilic interaction chromatography combined with ultrasound-assisted ionic liquid dispersive liquid–liquid microextraction for determination of underivatized neurotransmitters in dementia patients’ urine samples. Anal. Chim. Acta. 2020;1107:74–84. doi: 10.1016/j.aca.2020.02.027. [DOI] [PubMed] [Google Scholar]

[CR37] 37.Baghban A, Ahmadi MA, Shahraki BH. Prediction carbon dioxide solubility in presence of various ionic liquids using computational intelligence approaches. J. Supercrit. Fluids. 2015;98:50–64. doi: 10.1016/J.SUPFLU.2015.01.002. [DOI] [Google Scholar]

[CR38] 38.Baghban A, Mohammadi AH, Taleghani MS. Rigorous modelling of CO2 equilibrium absorption in ionic liquids. Int. J. Greenh. Gas Control. 2017;58:19–41. doi: 10.1016/j.ijggc.2016.12.009. [DOI] [Google Scholar]

[CR39] 39.Zhang X, Wang J, Song Z, Zhou T. Data-driven ionic liquid design for CO2 capture: Molecular structure optimization and DFT verification. Ind. Eng. Chem. Res. 2021;60:9992–10000. doi: 10.1021/acs.iecr.1c01384. [DOI] [Google Scholar]

[CR40] 40.Daryayehsalameh B, Nabavi M, Vaferi B. Modelling of CO2 capture ability of [Bmim][BF4] ionic liquid using connectionist smart paradigms. Environ. Technol. Innov. 2021;22:101484–101484. doi: 10.1016/j.eti.2021.101484. [DOI] [Google Scholar]

[CR41] 41.Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning (Adaptive Computation and Machine Learning Series), 321–359 (Cambridge Massachusetts, 2017).

[CR42] 42.Hochreiter S. The vanishing gradient problem during learning recurrent neural nets and problem solutions. Int. J. Uncertain. Fuzziness Knowl. Based Syst. 1998;6:107–116. doi: 10.1142/S0218488598000094. [DOI] [Google Scholar]

[CR43] 43.Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9:1735–1780. doi: 10.1162/neco.1997.9.8.1735. [DOI] [PubMed] [Google Scholar]

[CR44] 44.Siami-Namini, S., Tavakoli, N. & Namin, A. S. In 2019 IEEE International Conference on Big Data (Big Data). 3285–3292 (IEEE).

[CR45] 45.Xiang Z, Yan J, Demir I. A rainfall-runoff model with LSTM-based sequence-to-sequence learning. Water Resour. Res. 2020;56:e2019WR025326–e022019WR025326. doi: 10.1029/2019WR025326. [DOI] [Google Scholar]

[CR46] 46.Sobol IM. Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates. Math. Comput. Simul. 2001;55:271–280. doi: 10.1016/S0378-4754(00)00270-6. [DOI] [Google Scholar]

[CR47] 47.Homma T, Saltelli A. Importance measures in global sensitivity analysis of nonlinear models. Reliab. Eng. Syst. Saf. 1996;52:1–17. doi: 10.1016/0951-8320(96)00002-6. [DOI] [Google Scholar]

[CR48] 48.Morris MD. Factorial sampling plans for preliminary computational experiments. Technometrics. 1991;33:161–174. doi: 10.1080/00401706.1991.10484804. [DOI] [Google Scholar]

[CR49] 49.van Griensven AV, et al. A global sensitivity analysis tool for the parameters of multi-variable catchment models. J. Hydrol. 2006;324:10–23. doi: 10.1016/j.jhydrol.2005.09.008. [DOI] [Google Scholar]

[CR50] 50.Ruder, S. An overview of gradient descent optimization algorithms. arXiv preprint arXiv:1609.04747 (2016).

[CR51] 51.Roeder, L. Netron: Visualizer for Neural Network, Deep Learning and Machine Learning Models. https://www.lutzroeder.com/ai

[CR52] 52.Abhishek K, Singh M, Ghosh S, Anand A. Weather forecasting model using artificial neural network. Proc. Technol. 2012;4:311–318. doi: 10.1016/j.protcy.2012.05.047. [DOI] [Google Scholar]

[CR53] 53.Krogh A. What are artificial neural networks? Nat. Biotechnol. 2008;26:195–197. doi: 10.1038/nbt1386. [DOI] [PubMed] [Google Scholar]

[CR54] 54.Gal, Y. & Ghahramani, Z. A theoretically grounded application of dropout in recurrent neural networks. Adv. Neural Inf. Process. Syst.29 (2016).

[CR55] 55.Mesbah, M., Shahsavari, S., Soroush, E., Rahaei, N. & Rezakazemi, M. Accurate prediction of miscibility of CO₂ and supercritical CO₂ in ionic liquids using machine learning. J. CO₂ Util.25, 99–107. 10.1016/j.jcou.2018.03.004 (2018).

[CR56] 56.Aghaie M, Zendehboudi S. Estimation of CO2 solubility in ionic liquids using connectionist tools based on thermodynamic and structural characteristics. Fuel. 2020;279:117984–117984. doi: 10.1016/j.fuel.2020.117984. [DOI] [Google Scholar]

[CR57] 57.Ghaslani D, Gorji ZE, Gorji AE, Riahi S. Descriptive and predictive models for Henry’s law constant of CO2 in ionic liquids: A QSPR study. Chem. Eng. Res. Des. 2017;120:15–25. doi: 10.1016/j.cherd.2016.12.020. [DOI] [Google Scholar]

[CR58] 58.Dashti A, Riasat Harami H, Rezakazemi M, Shirazian S. Estimating CH4 and CO2 solubilities in ionic liquids using computational intelligence approaches. J. Mol. Liq. 2018;271:661–669. doi: 10.1016/j.molliq.2018.08.150. [DOI] [Google Scholar]

[CR59] 59.Xia L, Wang J, Liu S, Li Z, Pan H. Prediction of CO2 solubility in ionic liquids based on multi-model fusion method. Processes. 2019;7:258–258. doi: 10.3390/pr7050258. [DOI] [Google Scholar]

[CR60] 60.Moosanezhad-Kermani H, Rezaei F, Hemmati-Sarapardeh A, Band SS, Mosavi A. Modelling of carbon dioxide solubility in ionic liquids based on group method of data handling. Eng. Appl. Comput. Fluid Mech. 2021;15:23–42. [Google Scholar]

[CR61] 61.Blanchard LA, Hancu D, Beckman EJ, Brennecke JF. Green processing using ionic liquids and CO2. Nature. 1999;399:28–29. doi: 10.1038/19887. [DOI] [Google Scholar]

[CR62] 62.Jerng SE, Park YJ, Li J. Machine learning for CO2 capture and conversion: A review. Energy AI. 2024;16:100361. doi: 10.1016/j.egyai.2024.100361. [DOI] [Google Scholar]

PERMALINK

Prediction of CO2 solubility in Ionic liquids for CO2 capture using deep learning models

Mazhar Ali

Tooba Sarwar

Nabisab Mujawar Mubarak

Rama Rao Karri

Lubna Ghalib

Aisha Bibi

Shaukat Ali Mazari

Abstract

Introduction

Table 1.

Methods

Dataset/experimental data

Model development

Artificial neural network (ANN)

Figure 1.

Long short-term memory (LSTM) model

Figure 2.

Sobol sensitivity analysis

Morris sensitivity analysis

Statistical indexes as an error function

Results and discussion

ANN model

Figure 14.

Figure 3.

Table 2.

Figure 4.

Figure 5.

Figure 6.

LSTM model

Figure 7.

Table 3.

Figure 8.

Figure 9.

Figure 10.

Models comparison

Figure 11.

Table 4.

Figure 12.

Table 5.

Figure 13.

Table 6.

Global sensitivity analysis (GSA)

Conclusions

Supplementary Information

List of symbols

Abbreviations

Author contributions

Data availability

Competing interests

Footnotes

Contributor Information

Supplementary Information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Prediction of CO₂ solubility in Ionic liquids for CO₂ capture using deep learning models