Abstract
Nanofiltration (NF) membranes are increasingly being used to achieve precise solute–solute separation. These membranes are commonly synthesized using interfacial polymerization, offering great potential to separate lithium from magnesium. In this study, we have developed machine learning models that relate fabrication conditions, membrane properties, and operational conditions of NF membranes to predict water permeability and lithium/magnesium selectivity. Morgan fingerprints (MFs) and molecular descriptors (MDs) are used to represent the chemical and physical properties of the monomers. Explainable artificial intelligence tools such as Shapley additive explanations (SHAP) and partial dependence plots are used to evaluate the effects of the synthesis conditions and membrane properties on membrane performance. Based on the insights obtained from SHAP analysis, we developed a material screening approach to find promising monomers from a list of amines and cation-based ionic liquids. We construct a reference MF using the functional groups that positively contribute to membrane performance and compute a screening score that favors potential candidates with more desirable MDs. Finally, the synthesizability of these monomers is assessed using the synthetic accessibility score to find the most promising candidates. We compared the performance of screened monomers against traditional ones to validate the reliability of our approach. The results of this study provide critical insights into the relationships between synthesis conditions, membrane properties, and performance and establishes a novel, strategic framework for rational screening of monomers for NF membrane synthesis. This approach holds promise to accelerate the discovery of high-performance membranes tailored for specific separation challenges, thereby advancing the field of membrane technology.
Keywords: lithium, membranes, nanofiltration, machine learning, material screening, and inverse design


1. Introduction
The rapid rise in industrialization and global warming has led to an increased demand for cleaner energy production. In this context, lithium (Li) has become a critical metal since it is essential to produce rechargeable metal-ion batteries commonly used in electric vehicles, portable electronics, and grid-scale energy storage systems. , This growing demand for Li has intensified global interest in efficient recovery methods from natural resources such as mineral ores and salt-lake brine. Conventional mining practices used to recover Li from mineral ores pose a threat to the natural ecosystem due to large levels of toxic waste being released into the environment. It has been estimated that over 2.3 × 1011 t of Li is found in natural water bodies, making its extraction and recovery a highly desirable area of research and development.
Techniques such as solar evaporation, ion exchange, and chemical precipitation are commonly used to recover Li from saline water. , These techniques present challenges from a chemical and environmental engineering standpoint. Ion exchange is complicated due to exhaustion of resins at highly concentrated metal brines. On the other hand, postrecovery in solar evaporation and chemical precipitation is complex due to the existence of co-ions such as sodium, potassium, magnesium (Mg), and calcium. Mg specifically poses a great difficulty in Li recovery since it coprecipitates with Li as MgCO3. As a result, these traditional techniques are more suitable for brines with a low Mg-to-Li ratio (MLR). Nanofiltration (NF) is a pressure-driven membrane technology that demonstrates promising performance in the separation of Li and Mg in liquid streams with high MLR. − NF has several advantages over conventional methods, including high selectivity, scalability, and good process and energy efficiency, leading to improved engineering outcomes. Interfacial polymerization (IP) is widely used to fabricate NF membranes by reacting an amine monomer in the aqueous phase with an acyl chloride monomer in the organic phase, forming a thin polyamide (PA) layer on top of a porous substrate. The physical and chemical properties of this thin PA film can be tuned by selecting different types of monomers, incorporating additives (e.g., nanomaterials and surfactants), or adjusting the synthesis conditions, such as reaction time, heat curing time, and heat curing temperature. − An in-depth understanding of these synthesis-property-performance relationships is critical to the design of high-performance PA-NF membranes for Li separation.
The commonly used metrics to evaluate the performance of PA-NF membranes for Li separation are water permeability and Li/Mg selectivity. These two metrics exhibit a well-documented trade-off due to the intrinsic material limitations wherein more permeable membranes are less selective and vice versa. This inherent trade-off continues to drive efforts toward the development of advanced membranes that achieve high water permeability and salt selectivity. Achieving this goal requires innovative strategies capable of efficiently navigating the expansive design space, which includes diverse types of monomers and several combinations of synthesis conditions. The traditional trial-and-error approach of membrane synthesis often requires multiple design cycles to fabricate high-performance membranes. It is impractical for humans to explore this large design space through iterative experimentation due to its expensive and cumbersome nature.
Machine learning (ML) is a data-driven approach that has made significant advances in the field of membrane science and technology. − By leveraging experiments conducted over the past few decades, ML enables the development of powerful tools to address complex, multidimensional problems, while accurately predicting membrane performance and properties. − Researchers have used ML to predict water permeability, salt rejection, fouling decline ratio, and flux recovery ratio based on their synthesis conditions and properties. However, due to the “black box” nature of these ML models, it is challenging to understand the decision-making process behind the predictions made by them.
Explainable artificial intelligence (XAI) has recently emerged as a critical framework to elucidate the rationale behind the decision-making process of ML models. Shapley additive explanations (SHAP) and partial dependence plots (PDPs) are commonly used XAI tools that can reveal the impact of various features on the ML models, providing insights into model predictions. , Deng et al. used univariate and bivariate PDPs to investigate the influence of synthesis conditions of PA membranes critical to achieving high selectivity in solute–solute separation. Jeong et al. implemented SHAP to understand the underlying features governing ion transport across PA-NF and reverse osmosis (RO) membranes. By unveiling the impact of polymer properties, synthesis conditions, and the underlying transport mechanisms, XAI not only guides the membrane design process but also facilitates the discovery of new materials for specific applications. The widely used approach for material screening is to implement SHAP analysis to capture key chemical functional groups and screen promising candidates through similarity metrics such as the Tanimoto coefficient. This approach has proven effective; however, it overlooks the structural and physical characteristics of the polymer within the screening process. Gong et al. demonstrated the importance of using such molecular descriptors (MDs) while representing monomers to develop predictive ML models for the synthesis of PA-NF membranes. Thus, inclusion of these MDs in the material search process will streamline the screening pipeline and assist in the identification of novel candidates for NF membrane synthesis.
Some previous studies provide an ML perspective relating membrane properties and operational conditions to understand Li separation. ,− Our work builds on the existing literature by using data-driven tools to develop synthesis-property-performance relationships to improve the overall understanding of PA-NF membranes for Li separation and to devise a screening methodology that accelerates the discovery of high-performance materials for membrane synthesis. In this study, we have curated a database containing synthesis conditions, membrane properties, and membrane performance through literature mining. The primary objective of our study is (1) to develop predictive ML models relating the input features to performance metrics such as permeability and Li/Mg selectivity, (2) to study the impact of synthesis conditions and membrane properties on its separation performance using XAI tools, and (3) to develop high throughput virtual screening setup combining both Morgan fingerprints (MFs) and MDs of monomers to screen promising candidates with the potential for superior membrane performance.
2. Materials and Methods
2.1. Data Collection and Data Featurization
The data set for this study was collected from different sources and publishing aggregators such as Google Scholar, Elsevier, and ACS. The data exclusively focused on PA-NF membranes synthesized using IP for Li/Mg separation. The input data matrix consisted of 19 variables that were split into 3 categories: (1) synthesis conditions, (2) membrane properties, and (3) operational conditions (Table S1, Supporting Information). Membrane properties were represented using surface hydrophilicity (water contact angle), membrane pore characteristics (pore size), and surface charge (zeta potential). The values collected for the features were exhaustively mined from tables and text. For data from graphs, WebPlotDigitizer was used to extract graphical data. The final data set for our study consisted of 256 data points for water permeability (LMHbar–1) and 215 data points for Li/Mg salt selectivity. Importantly, we only considered the publications that reported selectivity values measured from systems containing multiple salts (containing both LiCl and MgCl2) as feed in lieu of systems that measure selectivity in 2 different systems containing a single salt (consisting either of LiCl or MgCl2). This criterion was applied because multi-ion systems represent the salt selectivity more accurately compared to singular ion systems and give a better representation of the separation process between Li and Mg.
The data set consisted of both numerical and categorical features, and no missing data were imputed since every feature has its physicochemical contribution to the membrane performance. Amine monomers used to synthesize NF membranes are reported in publications as a categorical variable i.e., names such as piperazine (PIP) or polyethylenimine (PEI). For better representation of these monomers for ML model training, we generated additional features that allow us to capture their chemical and physical attributes. To consider the chemical structure of monomers/polymers, we used the simplified molecular-input line-entry system (SMILES) and converted them into MFs using the RDKit package. The presence and absence of a specific atomic group/structural feature of molecules is encoded as 1 and 0, respectively. Additionally, we also used MDs to capture the various structural, chemical, electronic, and topological properties of the monomers (Table S2, Supporting Information).
2.2. ML Model Development and Characteristics
ML models such as Random Forest (RF), XGBoost, CatBoost, and LightGBM have showcased strong capabilities, providing good prediction accuracy with superior interpretability for evaluating membrane performance. , To prepare the data for model training, StandardScaler was used to scale the numerical feature values, and one-hot encoder was applied to encode categorical variables. We used Bayesian optimization with 5-fold cross-validation to develop ML models and identify the optimal hyperparameters for each ML algorithm (Table S3, Supporting Information). After these hyperparameters were optimized, we retrained the models to obtain the final performance metrics. For model training, 80% of the data set was used, and the remaining 20% of unseen data was set apart for testing. A shuffle split cross-validation strategy with 5 splits was implemented to ensure model generalizability and reduce the risk of overfitting. This resampling process was repeated across 8 random seed initializations, and the average performance metrics were computed to ensure robustness. The performance metrics being tested were coefficient of determination (R 2), root mean squared error (RMSE), and mean absolute error (MAE).
2.3. Model Interpretation Using XAI
Due to the inherent “black box” nature of ML models, it is challenging to explain the decision-making process behind their predictions. XAI can help humans understand how ML models are making their decisions and, more importantly, how certain input features influence model output.
2.3.1. PDPs
PDP is a graphical representation illustrating the relationship between specific features and the predicted outcome of a model while keeping all other features constant. They help in understanding the marginal effects of a single or multiple features on membrane performance. We constructed single-variable PDP (capturing the effect of a single feature) and multivariable PDP (capturing the effect of two variables) to visualize the impact of the input variables on water permeability and Li/Mg selectivity. The average partial dependence function for a feature S, f s can be calculated using
| 1 |
where feature C is the complement of S; x c and x s are their feature vectors, respectively.
2.3.2. SHAP
SHAP analysis is based on cooperative game theory, which is used to explain the contribution of each input variable to the model’s output. , By calculating marginal contribution of each feature, SHAP analysis can reveal whether input variables have positive or negative impact on model output. It was also used to represent the changes in the membrane permeability and salt selectivity over the range of input feature. The SHAP values for an input feature x (of n total features) give prediction p as
| 2 |
where S is the subsets of all features with feature x, p(S ∪ x) are the predictions by the built ML model considering feature x, and p(S) are the predictions without considering feature x. The differences among all possible subsets of S ⊆ N\x are calculated due to the dependency of the effect of withholding a feature on other features in the ML model.
2.4. Potential Monomer Screening
We developed a 3-step screening process to identify monomers for the synthesis of high-performance membranes with exceptional solute–solute performance. Over 500 candidate monomers were used to find the best candidates, which were obtained from the National Institute of Materials Science (NIMS) materials database and research publications focused on ionic liquids; a new class of candidates showing promise in the synthesis of membranes showing exceptional Li/Mg separation performance. −
2.4.1. Reference MF Using the Tanimoto Coefficient
Based on the results obtained from SHAP analysis of input MFs, we constructed a reference MF by adding the chemical functional groups having positive contributions to membrane permeability and salt selectivity. Using this reference MF, we aim to find compounds with similar chemical properties since they will show greater potential for the synthesis of high-performance NF membranes. We calculated the Tanimoto coefficient (SA,B) to find the similarity between a candidate monomer and the reference. This is computed as the number of bits in common divided by the total number of bits, represented as
| 3 |
where a is the number of bits in molecule A, b is the number of bits in molecule B, and c denotes the number of bits that are in both molecules. Tanimoto coefficient of 1 represents an identical molecule, whereas 0 represents no similarity between the molecules.
2.4.2. Screening Score Using MDs
After the chemically similar monomers to the reference MF were obtained from the candidate database, we subjected the monomers to a screening test based on the relative impact of each MD on the ML models. This impact was a product of the relative importance of the MD to model output and the correlation of its SHAP values to permeability and Li/Mg selectivity. Once this impact was calculated for both models, the MD values of potential monomer candidates were scaled based on it. The final metric is referred to as the screening score (S.S.), which is calculated as
| 4 |
where feature impactp and feature impacts refer to the product between the SHAP importance and Spearman’s correlation of the SHAP values of the MD to the ML model for permeability and Li/Mg selectivity, respectively. x i refers to the MD value of the potential monomer. The SHAP importance and correlation values for the best-performing ML model are calculated for 8 random seeds. The median values of importance and correlation for each descriptor across the random seeds are used as the final value to compute the feature impact and the resulting screening score. The idea is to ensure that candidate monomers having desirable properties score higher as compared to the other monomers, which will aid in the search for better candidates for NF membrane synthesis.
2.4.3. Synthetic Accessibility Score (SAS)
In addition to using reference MF and scaled MD values for the screening process, we calculated the SAS of candidate monomers to find the hypothetical ease of synthesizing certain monomers compared to others. , SAS ranges from 1 (easily synthesizable) to 10 (very difficult to synthesize). We used RDKit for the computation of the SAS of the potential monomeric candidates.
2.5. Membrane Fabrication and Experimental Validation
Three screened amine candidates, 2-methylpiperazine (M-PIP), 1-(2-aminoethyl)piperazine (A-PIP), and trans(2,5)-dimethylpiperazine (T-PIP), and two commonly used monomers, i.e., PEI (MW–800 Da) and PIP, were used to fabricate PA-NF membranes. The membranes were fabricated at modal conditions obtained from the data set. In short, a 0.5 wt % solution of monomer A1 was dissolved in water and allowed to contact a poly(ether sulfone) (PES) substrate. The excess solution was removed from the surface of the substrate using a rubber roller. 0.1 wt % solution of trimesoyl chloride (TMC) and n-hexane was then poured onto the substrate and allowed to react for 60 s. The resultant membrane was cured at a temperature of 60 °C for 10 min to allow the formation of a thin PA layer. The resultant membrane was stored in DI water for further use.
NF membrane performance experiments were conducted using a membrane module consisting of a crossflow testing cell with an effective area of 4.1 cm2 and a flow rate of 450 g/min. The membranes were compacted at 8 bar for 1 h before running the tests. The water permeability of each membrane was calculated as
where V is the volume of water collected (L), A is the effective filtration area (m2), ΔT is the time interval (h), and P is the operation pressure (bar).
To calculate the separation performance of Li with respect to Mg, a salt solution was prepared by mixing appropriate amounts of MgCl2 and LiCl to achieve the desired mass ratio of 20:1. The membrane performance for solute–solute separation can be quantified using a selectivity factor S Li/Mg.
Here, C Li,p, C Li,f and C Mg,p, C Mg,f are the concentrations of Li and Mg in permeate and feed, respectively. The concentration of the Li and Mg ions in the solution was analyzed using ICP-OES (PerkinElmer AA900Z).
3. Results and Discussion
3.1. Data Description, Analysis, and ML Model Performance
In Figure S1 (Supporting Information), we observe the permeability-selectivity trade-off across the entire data set, comparing membranes with additives and modifications to pristine NF membranes. Figures S2 and S3 (Supporting Information) present the data characteristics of the numerical and categorical features of the data set. The most commonly used amine monomers/polymers are PEI and PIP. PEI is specifically favored because of its positive charge, whereas PIP shows great tunability while synthesizing PA-NF membranes. , Sodium dodecyl sulfate (SDS) is the most commonly used surfactant, with most of its data distribution centered at 0.01 wt %. SDS facilitates better reaction control between the amine-based monomer and TMC by forming a self-assembled network at the water/organic solvent interface. This allows for the formation of a compact PA layer with uniform pore sizes. Studies often used acid acceptors such as sodium carbonate and sodium phosphate. These acceptors modulate the diffusive flux of amine monomers, resulting in the formation of thin PA layers with structural homogeneity, allowing for stable membrane performance. Over 88% of the data set consists of studies about standard IP, whereas 11.5% of the data points pertain to a relatively new method known as reverse interfacial polymerization (RIP). Researchers have found several advantages in RIP over IP, the biggest of which is the existence of residual positively charged amine monomers on the surface after membrane synthesis, allowing for better Li/Mg separation performance.
The data for water permeability and Li/Mg selectivity were collected and utilized to train four ML models (i.e., RF, XGBoost, CatBoost, and LightGBM). The prediction accuracy for the ML models was evaluated using MAE, RMSE, and R 2. The hyperparameters used to build the final ML models were obtained using Bayesian optimization. The ML model performance for predicting water permeability and salt selectivity is shown in Figure . CatBoost showed the best performance while predicting permeability (R 2 = 0.66), and Li/Mg salt selectivity (R 2 = 0.65). The MAE errors are ∼1.5 and ∼11%, while the RMSE values are ∼2.5 and ∼17.5% for permeability and Li/Mg selectivity, respectively (Table ). Only minor performance losses were observed in the model built without using membrane properties. This was done to check the predictive performance of ML models using synthesis conditions only.
1.

CatBoost-based ML models to predict membrane performance using (a,b) the entire data set, and (c,d) without membrane properties.
1. ML Model Performance.
| |
with membrane
properties |
without membrane properties |
|||||||
|---|---|---|---|---|---|---|---|---|---|
| RF | XGBoost | Catboost | LightGBM | RF | XGBoost | Catboost | LightGBM | ||
| permeability (LMH/bar) | MAE | 1.72 ± 0.27 | 1.81 ± 0.11 | 1.43 ± 0.25 | 2.40 ± 0.22 | 1.76 ± 0.27 | 1.82 ± 0.14 | 1.45 ± 0.26 | 2.19 ± 0.30 |
| RMSE | 2.73 ± 0.57 | 2.28 ± 0.50 | 2.42 ± 0.25 | 3.29 ± 0.34 | 2.73 ± 0.57 | 2.88 ± 0.53 | 2.47 ± 0.57 | 3.17 ± 0.55 | |
| R 2 | 0.61 ± 0.10 | 0.58 ± 0.11 | 0.66 ± 0.09 | 0.41 ± 0.06 | 0.60 ± 0.11 | 0.58 ± 0.11 | 0.65 ± 0.09 | 0.40 ± 0.12 | |
| Li/Mg selectivity | MAE | 11.66 ± 2.76 | 12.25 ± 2.23 | 11.21 ± 2.26 | 14.82 ± 2.76 | 11.86 ± 2.69 | 12.30 ± 3.10 | 12.08 ± 2.68 | 14.97 ± 3.306 |
| RMSE | 19.66 ± 2.79 | 19.19 ± 3.23 | 17.58 ± 2.26 | 22.85 ± 3.88 | 19.16 ± 4.85 | 19.19 ± 3.23 | 19.25 ± 3.55 | 22.29 ± 5.03 | |
| R 2 | 0.57 ± 0.12 | 0.58 ± 0.11 | 0.65 ± 0.08 | 0.42 ± 0.10 | 0.57 ± 0.14 | 0.58 ± 0.14 | 0.64 ± 0.10 | 0.41 ± 0.08 | |
Data leakage occurs when information from the training data set unintentionally influences the model, leading to an overestimation of predictive performance. Jeong et al. have demonstrated the issue of data leakage causing a falsely high prediction accuracy in NF and RO membranes. In our investigation, researchers within the same study frequently varied MLR as an experimental parameter (i.e., 1, 5, 10, 20, 50) when reporting water permeability and salt selectivity. We showcase the variation in R 2, MAE, and RMSE for the CatBoost model for water permeability and salt selectivity under these conditions (Table S4, Supporting Information). It can be observed that the model performs much better compared to the reported ML model in this case. To mitigate this, the authors chose only one data point from publications where multiple values of permeability and selectivity were reported for varying MLR, ensuring a more realistic and generalizable model.
3.2. Feature Analysis Using XAI
It is important to ensure that the knowledge obtained by the ML model is consistent with the fundamental understanding of design and synthesis of PA-NF membranes for Li/Mg separation. To validate this, we investigated the understanding gained by the ML models using XAI tools such as SHAP and PDP analyses. Figure S4 (Supporting Information) presents the SHAP plots that show the contributions of key features toward the prediction of permeability and selectivity. These plots reveal that membrane properties and the synthesis conditions play a key role in determining membrane performance.
3.2.1. Membrane Properties
To show the impact of membrane properties on the permeability and selectivity, we developed SHAP dependence plots for water contact angle (°), zeta potential (mV), and pore size (Å). Figure a,d show the effect of water contact angle on water permeability and selectivity. All the membranes possess a hydrophilic surface (water contact angle is less than 90°), which is considered ideal for water permeability. However, an increase in contact angle within the subrange has a negative impact on water permeability due to enhanced hydrophobic character, which increases the resistance for water transport through the membrane. On the other hand, the SHAP dependence plot for selectivity shows a sparse distribution throughout the data set range, which indicates that water contact angle has no clear impact on the selectivity of the membrane. Figure b,e show the impact of zeta potential on permeability and Li/Mg selectivity, respectively. Selectivity improves significantly as zeta potential increases beyond 0 mV, suggesting that a positively charged surface allows for easier permeation of Li+ ions as compared to Mg2+ ions. This is due to the dielectric and Donnan effects, which result in stronger repulsion forces faced by divalent ions (Mg2+) as compared to monovalent ions (Li+). ,, Tuning membrane surface charge is thus a useful strategy to improve the selective behavior of membranes to achieve separation of Li and Mg. However, in the case of permeability, negative values of zeta potential are more favorable for water transport due to the formation of a hydration layer under negatively charged conditions, promoting water passage. , In Figure c,f, the influence of pore size on membrane permeability and selectivity was illustrated. To understand the influence of pore size on Li/Mg selectivity, we need to consider two major phenomena, i.e., dehydration energy and size exclusion. Mg (−1830 kJ/mol) requires much more energy to shed its hydration shell as compared to Li (−475 kJ/mol), which favors Li transport through NF membrane pores. However, ions can partially strip their hydration shells to force a passage through membranes. Mg possesses a hydration radius of 4.28 Å and a Stokes radius of 3.47 Å, whereas Li possesses a hydration radius of 3.84 Å and a Stokes radius of 2.40 Å. Peng et al. suggested that the mean pore radius of the membrane pore sizes should lie between the hydration radius of Mg and the Stokes radius for Li (i.e., between 2.4 and 4.28 Å) for exceptional separation performance of both the elements. However, as per Figure f, there is a variation of SHAP values from negative to positive for Li/Mg selectivity within this subrange, possibly due to material-specific interactions with Li and Mg. This warrants a deeper investigation into pore size control of NF membranes to synthesize membranes with high selectivity. The broader trend, however, still shows that membrane permeability is compromised when designing selective membranes.
2.
SHAP analysis of membrane properties for water permeability: (a) water contact angle, (b) zeta potential, (c) pore size; SHAP analysis of membrane properties for Li/Mg selectivity: (d) water contact angle, (e) zeta potential, (f) pore size. Blue and red colors represent positive and negative SHAP values, respectively.
3.2.2. Impact of Synthesis Conditions
To gain a deeper understanding of the impact of synthesis conditions on membrane permeability and selectivity, we used our data set trained on CatBoost model to construct single-variable and multivariable PDP (Figure ). In Figure a,b, the impact of monomer A1 and TMC concentration on water permeability and salt selectivity is illustrated. An increase in the concentration of monomer A1 and TMC in the synthesis solution results in a compact polymer network, which inhibits water permeability and promotes Li/Mg salt selectivity. Zhu et al. also found a linear increase in active layer thickness with increasing concentrations of amine-based monomer. This increases the mass transfer resistance across the membrane, having negative effects on water permeability. An excess of monomer A1 compared to TMC also results in unreacted monomers on the membrane surface, enhancing its positive charge and promoting Donnan exclusion, which directly contributes to membrane selectivity. Low concentrations of monomers can lead to a lesser degree of cross-linking and larger pore sizes, which have a negative impact on the selective behavior of the membrane. Figure c,f show the effect of polymerization time on membrane permeability and selectivity. As the polymerization time increases, monomers A1 and TMC get more time to diffuse and react, enhancing the degree of cross-linking and thus generating a denser active layer. The improved degree of cross-linking decreases membrane permeability while improving Li and Mg separation performance. Figure d,e,g,h show the impact of the variation of heat curing time and heat curing temperature. Heat curing is an essential step in the fabrication of NF membranes since it facilitates polymerization between unreacted monomers and removal of excess solvents. Longer heat curing times are associated with the formation of a denser PA layer since it promotes cross-linking, which improves membrane selectivity. The trends regarding curing temperature are less consistent. Researchers have found that even though heat-curing temperature increases the rate of polymerization, it can also damage the active layer of the NF membrane, resulting in surface defects.
3.
PDP analysis for membrane performance. PDP showing the effect of monomer A1 concentration (wt %) as a function of TMC concentration (wt %) on (a) water permeability and (b) Li/Mg selectivity. In (a,b), lighter regions correspond to feature values associated with higher membrane performance. PDP of (c) polymerization time, (d) heat curing time, and (e) heat curing temperature for water permeability. PDP of (f) polymerization time, (g) heat curing time, and (h) heat curing temperature for Li/Mg selectivity.
The results obtained in Figures and highlight the general trends and the stark contrast in how these synthesis conditions and membrane properties impact water permeability and selectivity. Factors that are conducive to membrane permeability negatively affect selectivity and vice versa. These results also highlight the significant influence of monomer A1 on membrane performance, underscoring the need to develop innovative material screening strategies to enhance design efficiency.
3.3. Potential Monomer Screening and Experimental Validation
Based on the SHAP values of input MFs, we identified 15 functional groups having a positive influence on membrane permeability and 12 functional groups having a positive influence on membrane selectivity. As shown in Figure S5a (Supporting Information), the presence of hydrophilic carbonyl and sulfonate groups can contribute positively to water permeability. As per Figure S5b (Supporting Information), the presence of positively charged nitrogen centers can improve the surface charge of the membrane. This can positively impact Li and Mg separation in membranes, directly contributing to its selectivity. Additionally, the presence of primary, secondary, and tertiary amines is necessary since it forms the polymeric backbone during the synthesis of PA-NF membranes. Using these features, we constructed a reference MF that was used to screen monomers from the monomer database using the Tanimoto coefficient. We selected the top 20% of the monomers from this list for further investigation.
For the second stage of screening, we used radius of gyration, topological polar surface area (TPSA), BalabanJ, sphericity index, eccentricity, inertial shape factor, maximum partial charge, and logarithm of partition coefficient (logP) as MDs of choice. Importance and correlation analyses on the SHAP values of the MDs were carried out for different purposes. Importance was calculated to quantify the contribution of the MD to the ML models’ predictive performance (Figure a,c). On the other hand, correlation was calculated to evaluate the directionality and strength of the relationship between the SHAP values of the MD and membrane performance (Figure b,d). The SHAP values of the MD that are positively correlated to membrane performance are more desirable and vice versa for negative features (Figure S6, Supporting Information).
4.
SHAP Analysis of MDs for membrane performance. (a,b) correspond to water permeability, and (c,d) to Li/Mg selectivity. Green bars in correlation plots imply a positive correlation, whereas red bars imply a negative correlation.
The separation performance of Li and Mg can be traced to the effect of these descriptors on membrane permeability and selectivity. Max_Partial_Charge is related to the reactivity of the monomer, wherein a greater value is associated with the electron richness of the molecule. IP is a reaction where the amine monomer reacts with the acyl chloride monomer at the water–hexane interface. Increasing Max_Partial_Charge can accelerate the reaction, resulting in the formation of a loose, porous structure that negatively impacts solute separation performance. It is desired to have a slower controlled reaction such that a uniform membrane layer is formed. This can enhance size sieving effects, which allows for better separation performance. TPSA is associated with increasing molecule complexity due to the presence of reactive sites, resulting in the formation of membranes with loose structures. Thus, TPSA also has a negative contribution toward the separation performance of the membrane.
After the correlation and importance of the features were calculated, we evaluated the feature impact by multiplying the median correlation and importance obtained across 8 seeds. This would ensure that both directionality and contribution of the MDs are considered when screening potential monomers. We scale the MD of the potential monomers by the feature impact for both permeability and selectivity and add those values to receive a screening score (Figure ). Monomeric candidates having a higher score would have more desirable MDs, taking both permeability and selectivity into account. Thus, we can streamline the screening process and identify materials that are more likely to succeed as potential monomers to synthesize high-performance NF membranes for Li separation. We can potentially tweak the outcomes from this process and find monomers that either showcase more selectivity or more permeability by assigning different weights to both the ML models while calculating the screening score. This will ensure more desirable outputs are favored. The top 20% of the monomers having the highest screening factor were checked for their synthesis feasibility.
5.

Screening score of the potential monomer candidates.
The final stage of the screening process involves the calculation of SAS. The SAS contains fragment contributions and complexity penalty as two major components. The fragment contributions present in molecules that are more difficult to synthesize are scored higher as compared to fragments of commonly known molecules that are easier to synthesize. Complexity penalty assigns a penalty to large rings, ring fusions, and overall molecular size. After the SAS was calculated for our screened monomers having the highest screening factor, we report five monomers with low SAS to find the monomer candidates that show potential for membrane synthesis (Figure ).
6.
Monomer candidates screened using the ML model.
As observed, all these monomers have nitrogen centers, which are essential for the IP reaction with TMC. In the case of 1-acrylol-4-methylpiperazine, it has a carbonyl group on its surface, which can enhance water permeability. A positively charged nitrogen is present in 1-benzyl-3-methyl-1H-imidazole-3-ium that can impart a positive charge to the membrane surface, resulting in the improvement of charge screening effects, promoting better separation of Li and Mg. Out of the five reported monomers, membranes were synthesized using three candidates: A-PIP, M-PIP, and T-PIP. Among these, A-PIP and M-PIP are superior to PIP in terms of membrane permeability and salt selectivity at similar synthesis conditions (Figure ). Specifically, M-PIP exhibited a greater membrane flux as compared to PEI, although it has lower salt selectivity, whereas A-PIP surpasses PEI on both water permeability and salt selectivity. T-PIP, however, did not demonstrate stable performance during membrane testing. These results highlight the promise of the screening approach but also underscore the importance of experimental validation to confirm the practical viability of the selected monomer candidates.
7.
Performance characteristics of screened monomers.
4. Implications and Future Perspectives
In this study, we present the design considerations and a novel material screening approach to synthesize high-performance NF membranes for Li separation. We conduct a comprehensive ML analysis to relate the input synthesis conditions, membrane properties, and operating conditions to membrane performance and elucidate the underlying principles governing the separation of Li and Mg using PA-NF membranes. XAI (i.e., PDP and SHAP analysis of the synthesis conditions) revealed the importance of amine and chloride monomer concentrations, reaction time, heat curing time, and heat curing temperature on membrane performance. Additionally, membrane properties such as pore size and zeta potential are critical to understanding the water transport and separation behavior of Li from Mg in NF membranes. These findings highlight the potential for research on positively charged materials that increase the surface charge and techniques to achieve a sharp pore size distribution, potentially improving the outcomes for Li and Mg separation. In general, there was an inverse relation between parameters favoring permeability compared to those favoring selectivity. This is in line with the literature on the permeability-selectivity trade-off commonly observed in polymeric membranes. Researchers need to make use of advanced toolkits, such as those lying within the inverse-design umbrella, to tackle this trade-off. In that context, we developed a high-throughput virtual screening pipeline that leverages the physical and chemical descriptors used to represent monomers. For this purpose, we used a 3-step setup that combines MFs, MDs, and SAS to screen candidates for Li–Mg separation. The goal was to find the promising candidates from a given list by maximizing their physical and chemical characteristics with the help of the Tanimoto coefficient and our in-house derived screening score. We reported five candidates in our study using this methodology and tested the performance of three candidates against PIP and PEI, which are commonly used materials in the literature. Two candidates had better permeability and selectivity as compared to PIP at similar experimental conditions, showing the potential of this screening process.
In an ideal scenario, models should be able to learn from high-quality, clean, labeled data encompassing a diverse chemical space and experimental conditions that are representative of real-life scenarios. However, models trained for material screening in membrane technology are often trained on smaller data sets, where such a type of data diversity is not available. This raises concerns regarding the transferability of these models across different chemical spaces and experimental conditions, which are unseen in the data set. It becomes essential to test and validate the performance of novel materials screened with the help of ML models in environmentally relevant conditions to alleviate these concerns. Further, the quality of data in experimentally derived data sets is prone to noisy observations due to variability in measurement techniques across laboratories, measurement device inaccuracy, and random human error. Using tools such as WebPlotDigitizer to mine data from graphs can reduce the inconsistencies in data collection as compared to human observations, however its results are dependent on graph quality, image resolution, and human accuracy. These tools also often lack the provision to measure uncertainty using error bars. Data needs to be rigorously checked, preprocessed, and post-training cross-validation techniques need to be implemented to reliably assess model performance and ensure its robustness while minimizing bias. Even though our model shows satisfactory performance over a wide range of parameters, there are several ways in which it can be improved. The most important step is to find methods to represent categorical features, such as additives in the aqueous phase and the additive layer, in a similar fashion to the monomers used in the study. This would ensure that we can capture chemical and physical details from those modifications, which can be further used to enhance the screening process. Furthermore, there were only 12 types of different amine monomers present in the data set. This results in variations in how the ML model captures the chemical fingerprints and MDs while carrying out the screening process (Figures S7 and S8, Supporting Information). Expanding the data set with more monomers and data points will help in improving the chemical diversity of the data set.
Material screening is only the first step in the design hierarchy required to synthesize high-performance membranes. Inverse design has revolutionized materials design and discovery by fundamentally flipping the research paradigm by selecting desired properties of the material and working backward to identify ideal candidates. , Generative ML allows for autonomous design of new chemicals based on previously trained data. Researchers can produce hypothetical candidates with user-defined criteria and incorporate constraints into the generative process targeted to optimize materials performance, thereby accelerating materials discovery. Ultimately, the final performance of membranes is dependent on the close interplay of synthesis conditions and the resultant membrane properties. A deep understanding of these relationships is essential to carefully balance the “permeability-selectivity” trade-off. Being reliant on the traditional trial-and-error approach to do this is costly and time-consuming, thus requiring alternate approaches to navigate this infinite search space consisting of thousands of monomer/synthesis condition combinations. Active learning and Bayesian optimization are ML strategies that employ surrogate models to predict the properties of unexplored materials and guide experiments in a way that maximizes learning, reduces model uncertainty, and optimizes outcomes. , This approach is particularly useful since we can gain information with limited experimental resources. Researchers have relied on using Pareto optimization to identify a set of materials (“Pareto Fronts”) to find balanced performance among the chosen metrics. Even though Pareto optimization provides an interesting solution to the multiobjective design problem, its success hinges on the predictive accuracy of the underlying ML models. Research advances are most likely to occur when experimental work is integrated with ML and computational approaches. This will transform the traditional discovery process and allow efficient exploration of large chemical spaces, leading to the development of a superior class of NF membranes.
Supplementary Material
Acknowledgments
This work was partially supported by the U.S. Department of Agriculture (Awards 2018-68011-28371, 2021-67021-34499, 2021-67021-38585, and 2024-67021-41534); National Science Foundation (Awards 2112533, 2345543, and 2419122).
All the data used in the study are presented in the Supporting Dataset.
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acsestengg.5c00444.
Variables used for data set construction; morgan fingerprint and molecular descriptors used for this study; hyperparameter ranges of the ML model; performance metrics for the CatBoost model when data leakage occurs; permeability–selectivity plot for the data set, data characteristics for numerical features; data description for categorical features; SHAP plots for permeability and Li/Mg selectivity; chemical bonds with positive contribution to membrane permeability and Li/Mg selectivity; SHAP plots relating molecular descriptors to membrane selectivity; SHAP plots showing the variation in correlation and importance for multiple seeds (PDF)
Supporting Dataset (XLSX)
The authors declare no competing financial interest.
References
- Chen T., Jin Y., Lv H., Yang A., Liu M., Chen B., Xie Y., Chen Q.. Applications of Lithium-Ion Batteries in Grid-Scale Energy Storage Systems. Trans. Tianjin Univ. 2020;26(3):208–217. doi: 10.1007/s12209-020-00236-w. [DOI] [Google Scholar]
- Speirs J., Contestabile M., Houari Y., Gross R.. The Future of Lithium Availability for Electric Vehicle Batteries. Renew. Sustain. Energy Rev. 2014;35:183–193. doi: 10.1016/j.rser.2014.04.018. [DOI] [Google Scholar]
- Butylskii D. Yu., Troitskiy V. A., Smirnova N. V., Pismenskaya N. D., Wang Y., Jiang C., Xu T., Nikonenko V. V.. Review of Recent Progress on Lithium Recovery and Recycling from Primary and Secondary Sources with Membrane-Based Technologies. Desalination. 2024;586:117826. doi: 10.1016/j.desal.2024.117826. [DOI] [Google Scholar]
- Li X., Mo Y., Qing W., Shao S., Tang C. Y., Li J.. Membrane-Based Technologies for Lithium Recovery from Water Lithium Resources: A Review. J. Membr. Sci. 2019;591:117317. doi: 10.1016/j.memsci.2019.117317. [DOI] [Google Scholar]
- Alkhadra M. A., Su X., Suss M. E., Tian H., Guyes E. N., Shocron A. N., Conforti K. M., de Souza J. P., Kim N., Tedesco M., Khoiruddin K., Wenten I. G., Santiago J. G., Hatton T. A., Bazant M. Z.. Electrochemical Methods for Water Purification, Ion Separations, and Energy Conversion. Chem. Rev. 2022;122(16):13547–13635. doi: 10.1021/acs.chemrev.1c00396. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang P., Li J., Xing S., Jia Q., Wang J., Bi J., Guo Z., Wang L., Ji Z., Qu L.. Solar Evaporators for Saline Water: Sustainable Clean Water Harvesting and Critical Mineral Resources Extraction. ACS Nano. 2025;19(12):11625–11647. doi: 10.1021/acsnano.5c00564. [DOI] [PubMed] [Google Scholar]
- Foo Z. H., Lienhard J. H.. Emerging Membrane Technologies for Sustainable Lithium Extraction from Brines and Leachates: Innovations, Challenges, and Industrial Scalability. Desalination. 2025;598:118411. doi: 10.1016/j.desal.2024.118411. [DOI] [Google Scholar]
- Zhang Y., Hu Y., Wang L., Sun W.. Systematic Review of Lithium Extraction from Salt-Lake Brines via Precipitation Approaches. Miner. Eng. 2019;139:105868. doi: 10.1016/j.mineng.2019.105868. [DOI] [Google Scholar]
- Zheng R., Xu S., Zhong S., Tong X., Yu X., Zhao Y., Chen Y.. Enhancing Ion Selectivity of Nanofiltration Membranes via Heterogeneous Charge Distribution. Environ. Sci. Technol. 2024;58(51):22818–22828. doi: 10.1021/acs.est.4c08841. [DOI] [PubMed] [Google Scholar]
- Peng H., Zhao Q.. A Nano-Heterogeneous Membrane for Efficient Separation of Lithium from High Magnesium/Lithium Ratio Brine. Adv. Funct. Mater. 2021;31(14):2009430. doi: 10.1002/adfm.202009430. [DOI] [Google Scholar]
- Soyekwo F., Liu C., Mao X., Nie R.. Piperidinium-Incorporated Fabrication of High-Performance Polyamide Nanofiltration Membrane with High Free Volume for Magnesium/Lithium Separation. J. Membr. Sci. 2024;707:122940. doi: 10.1016/j.memsci.2024.122940. [DOI] [Google Scholar]
- M.s S., Elmakki T., Zavahir S., Tariq H., Abdulhameed A., Park H., Shon H. K., Han D. S.. Enhanced Lithium Separation from Brines Using Nanofiltration (NF) Technology: A Review. Desalination. 2024;592:118148. doi: 10.1016/j.desal.2024.118148. [DOI] [Google Scholar]
- Lu X., Elimelech M.. Fabrication of Desalination Membranes by Interfacial Polymerization: History, Current Efforts, and Future Directions. Chem. Soc. Rev. 2021;50(11):6290–6307. doi: 10.1039/D0CS00502A. [DOI] [PubMed] [Google Scholar]
- Wang Y., Chang H., Jiang S., Chen J., Wang J., Liang H., Li G., Tang X.. An Efficient Co-Solvent Tailoring Interfacial Polymerization for Nanofiltration: Enhanced Selectivity and Mechanism. J. Membr. Sci. 2023;677:121615. doi: 10.1016/j.memsci.2023.121615. [DOI] [Google Scholar]
- Wang Z., Gao Y., Zhang J., Peng Z., Wang X., Liu S., Huang X.. Simultaneously Modulating Nanofiltration Membrane Surface Charges and Pore Size in Pursuit of Effective Cation Separation via a Bi-Functional Monomers-Interfered IP Reaction. ACS EST Eng. 2024;4(5):1113–1122. doi: 10.1021/acsestengg.3c00574. [DOI] [Google Scholar]
- Saha N. K., Joshi S. V.. Performance Evaluation of Thin Film Composite Polyamide Nanofiltration Membrane with Variation in Monomer Type. J. Membr. Sci. 2009;342(1):60–69. doi: 10.1016/j.memsci.2009.06.025. [DOI] [Google Scholar]
- Seah M. Q., Lau W. J., Goh P. S., Ooi B. S., Lai G. S., Ismail A. F.. Improving Properties of Thin Film Nanocomposite Membrane via Temperature-Controlled Interfacial Polymerization for Nanofiltration Process. Desalination. 2023;545:116091. doi: 10.1016/j.desal.2022.116091. [DOI] [Google Scholar]
- Park H. B., Kamcev J., Robeson L. M., Elimelech M., Freeman B. D.. Maximizing the Right Stuff: The Trade-off between Membrane Permeability and Selectivity. Science. 2017;356(6343):eaab0530. doi: 10.1126/science.aab0530. [DOI] [PubMed] [Google Scholar]
- Gao H., Zhong S., Zhang W., Igou T., Berger E., Reid E., Zhao Y., Lambeth D., Gan L., Afolabi M. A., Tong Z., Lan G., Chen Y.. Revolutionizing Membrane Design Using Machine Learning-Bayesian Optimization. Environ. Sci. Technol. 2022;56(4):2572–2581. doi: 10.1021/acs.est.1c04373. [DOI] [PubMed] [Google Scholar]
- Dangayach R., Jeong N., Demirel E., Uzal N., Fung V., Chen Y.. Machine Learning-Aided Inverse Design and Discovery of Novel Polymeric Materials for Membrane Separation. Environ. Sci. Technol. 2025;59:993. doi: 10.1021/acs.est.4c08298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barnett J. W., Bilchak C. R., Wang Y., Benicewicz B. C., Murdock L. A., Bereau T., Kumar S. K.. Designing Exceptional Gas-Separation Polymer Membranes Using Machine Learning. Sci. Adv. 2020;6(20):eaaz4301. doi: 10.1126/sciadv.aaz4301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hu A., Liu Y., Wang X., Xia S., Van der Bruggen B.. A Machine Learning Based Framework to Tailor Properties of Nanofiltration and Reverse Osmosis Membranes for Targeted Removal of Organic Micropollutants. Water Res. 2025;268:122677. doi: 10.1016/j.watres.2024.122677. [DOI] [PubMed] [Google Scholar]
- Cao Z., Barati Farimani O., Ock J., Barati Farimani A.. Machine Learning in Membrane Design: From Property Prediction to AI-Guided Optimization. Nano Lett. 2024;24(10):2953–2960. doi: 10.1021/acs.nanolett.3c05137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baig N., Usman J., Abba S. I., Benaafi M., Aljundi I. H.. Fractionation of Dyes/Salts Using Loose Nanofiltration Membranes: Insight from Machine Learning Prediction. J. Clean. Prod. 2023;418:138193. doi: 10.1016/j.jclepro.2023.138193. [DOI] [Google Scholar]
- Ignacz G., Bader L., Beke A. K., Ghunaim Y., Shastry T., Vovusha H., Carbone M. R., Ghanem B., Szekely G.. Machine Learning for the Advancement of Membrane Science and Technology: A Critical Review. J. Membr. Sci. 2025;713:123256. doi: 10.1016/j.memsci.2024.123256. [DOI] [Google Scholar]
- Batra R., Song L., Ramprasad R.. Emerging Materials Intelligence Ecosystems Propelled by Machine Learning. Nat. Rev. Mater. 2021;6(8):655–678. doi: 10.1038/s41578-020-00255-y. [DOI] [Google Scholar]
- Tran H., Gurnani R., Kim C., Pilania G., Kwon H.-K., Lively R. P., Ramprasad R.. Design of Functional and Sustainable Polymers Assisted by Artificial Intelligence. Nat. Rev. Mater. 2024;9:866–886. doi: 10.1038/s41578-024-00708-8. [DOI] [Google Scholar]
- Dansawad P., Li Y., Li Y., Zhang J., You S., Li W., Yi S.. Machine Learning toward Improving the Performance of Membrane-Based Wastewater Treatment: A Review. Adv. Membr. 2023;3:100072. doi: 10.1016/j.advmem.2023.100072. [DOI] [Google Scholar]
- Jeong N., Epsztein R., Wang R., Park S., Lin S., Tong T.. Exploring the Knowledge Attained by Machine Learning on Ion Transport across Polyamide Membranes Using Explainable Artificial Intelligence. Environ. Sci. Technol. 2023;57(46):17851–17862. doi: 10.1021/acs.est.2c08384. [DOI] [PubMed] [Google Scholar]
- Ignacz G., Alqadhi N., Szekely G.. Explainable Machine Learning for Unraveling Solvent Effects in Polyimide Organic Solvent Nanofiltration Membranes. Adv. Membr. 2023;3:100061. doi: 10.1016/j.advmem.2023.100061. [DOI] [Google Scholar]
- Wang M., Shi G. M., Zhao D., Liu X., Jiang J.. Machine Learning-Assisted Design of Thin-Film Composite Membranes for Solvent Recovery. Environ. Sci. Technol. 2023;57:15914. doi: 10.1021/acs.est.3c04773. [DOI] [PubMed] [Google Scholar]
- Yang M., Zhu J.-J., McGaughey A. L., Priestley R. D., Hoek E. M. V., Jassby D., Ren Z. J.. Machine Learning for Polymer Design to Enhance Pervaporation-Based Organic Recovery. Environ. Sci. Technol. 2024;58(23):10128–10139. doi: 10.1021/acs.est.4c00060. [DOI] [PubMed] [Google Scholar]
- Deng H., Luo Z., Imbrogno J., Swenson T. M., Jiang Z., Wang X., Zhang S.. Machine Learning Guided Polyamide Membrane with Exceptional Solute–Solute Selectivity and Permeance. Environ. Sci. Technol. 2023;57:17841. doi: 10.1021/acs.est.2c05571. [DOI] [PubMed] [Google Scholar]
- Wang C., Wang L., Yu H., Soo A., Wang Z., Rajabzadeh S., Ni B.-J., Shon H. K.. Machine Learning for Layer-by-Layer Nanofiltration Membrane Performance Prediction and Polymer Candidate Exploration. Chemosphere. 2024;350:140999. doi: 10.1016/j.chemosphere.2023.140999. [DOI] [PubMed] [Google Scholar]
- Gong W., Xu H., Lu J., Kim J., Zhao Y., Li N., Zhang Y., Yang J., Xu D., Liang H.. Gradient Boosting Decision Tree Algorithms for Accelerating Nanofiltration Membrane Design and Discovery. Desalination. 2024;592:118072. doi: 10.1016/j.desal.2024.118072. [DOI] [Google Scholar]
- Ji Z., Guan H., Dong Y.. Performance and Mechanism of Lithium Extraction from Water via Machine Learning-Powered Nanofiltration. J. Membr. Sci. 2025;733:124344. doi: 10.1016/j.memsci.2025.124344. [DOI] [Google Scholar]
- Sun J.-O., Hua T.-W., Guan Y.-F., Yu H.-Q.. Predicting and Understanding the Performance of Polyamide Nanofiltration Membrane for Li/Mg Selective Separation Based on Machine Learning. Water Res. 2025;285:124140. doi: 10.1016/j.watres.2025.124140. [DOI] [PubMed] [Google Scholar]
- Lu D., Ma X., Lu J., Qian Y., Geng Y., Wang J., Yao Z., Liang L., Sun Z., Liang S., Zhang L.. Ensemble Machine Learning Reveals Key Structural and Operational Features Governing Ion Selectivity of Polyamide Nanofiltration Membranes. Desalination. 2023;564:116748. doi: 10.1016/j.desal.2023.116748. [DOI] [Google Scholar]
- WebPlotDigitizer-Copyright 2010–2024 Ankit Rohatgi. https://apps.automeris.io/wpd4/ (accessed 08 23, 2024).
- Wang R., Zhang J., Tang C. Y., Lin S.. Understanding Selectivity in Solute–Solute Separation: Definitions, Measurements, and Comparability. Environ. Sci. Technol. 2022;56(4):2605–2616. doi: 10.1021/acs.est.1c06176. [DOI] [PubMed] [Google Scholar]
- Gao H., Zhong S., Dangayach R., Chen Y.. Understanding and Designing a High-Performance Ultrafiltration Membrane Using Machine Learning. Environ. Sci. Technol. 2023;57(46):17831–17840. doi: 10.1021/acs.est.2c05404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- RDKit. https://www.rdkit.org/ (accessed 08 19, 2023).
- Mohammed A., Alshraideh H., Alsuwaidi F. A.. Holistic Framework for Improving the Prediction of Reverse Osmosis Membrane Performance Using Machine Learning. Desalination. 2024;574:117253. doi: 10.1016/j.desal.2023.117253. [DOI] [Google Scholar]
- Ma X., Lu D., Lu J., Qian Y., Zhang S., Yao Z., Liang L., Sun Z., Zhang L.. Revealing Key Structural and Operating Features on Water/Salts Selectivity of Polyamide Nanofiltration Membranes by Ensemble Machine Learning. Desalination. 2023;548:116293. doi: 10.1016/j.desal.2022.116293. [DOI] [Google Scholar]
- Niu C., Li B., Wang Z.. Using Artificial Intelligence-Based Algorithms to Identify Critical Fouling Factors and Predict Fouling Behavior in Anaerobic Membrane Bioreactors. J. Membr. Sci. 2023;687:122076. doi: 10.1016/j.memsci.2023.122076. [DOI] [Google Scholar]
- Jeong N., Chung T., Tong T.. Predicting Micropollutant Removal by Reverse Osmosis and Nanofiltration Membranes: Is Machine Learning Viable? Environ. Sci. Technol. 2021;55(16):11348–11359. doi: 10.1021/acs.est.1c04041. [DOI] [PubMed] [Google Scholar]
- Sutariya B., Sarkar P., Indurkar P. D., Karan S.. Machine Learning-Assisted Performance Prediction from the Synthesis Conditions of Nanofiltration Membranes. Sep. Purif. Technol. 2025;354:128960. doi: 10.1016/j.seppur.2024.128960. [DOI] [Google Scholar]
- Hao S., Wang M., Guan H., Zhao Y., Ji Z., Dai C.. Machine Learning-Guided Prediction of Polymeric Membrane Performance in Forward Osmosis. Sep. Purif. Technol. 2025;379:135037. doi: 10.1016/j.seppur.2025.135037. [DOI] [Google Scholar]
- Wu H., Lin Y., Feng W., Liu T., Wang L., Yao H., Wang X.. A Novel Nanofiltration Membrane with [MimAP][Tf2N] Ionic Liquid for Utilization of Lithium from Brines with High Mg2+/Li+ Ratio. J. Membr. Sci. 2020;603:117997. doi: 10.1016/j.memsci.2020.117997. [DOI] [Google Scholar]
- Soyekwo F., Wen H., Liao D., Liu C.. Nanofiltration Membranes Modified with a Clustered Multiquaternary Ammonium-Based Ionic Liquid for Improved Magnesium/Lithium Separation. ACS Appl. Mater. Interfaces. 2022;14(28):32420–32432. doi: 10.1021/acsami.2c03650. [DOI] [PubMed] [Google Scholar]
- Ni Y., Peng H., Zhao Q.. Ultrathin Poly(Ionic Liquid) Nanomembranes for High Performance Mg2+/Li+ Separation. Adv. Mater. Interfaces. 2022;9(32):2201797. doi: 10.1002/admi.202201797. [DOI] [Google Scholar]
- Ertl P., Schuffenhauer A.. Estimation of Synthetic Accessibility Score of Drug-like Molecules Based on Molecular Complexity and Fragment Contributions. J. Cheminf. 2009;1(1):8. doi: 10.1186/1758-2946-1-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen S., Jung Y.. Estimating the Synthetic Accessibility of Molecules with Building Block and Reaction-Aware SAScore. J. Cheminf. 2024;16(1):83. doi: 10.1186/s13321-024-00879-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang L., Rehman D., Sun P.-F., Deshmukh A., Zhang L., Han Q., Yang Z., Wang Z., Park H.-D., Lienhard J. H., Tang C. Y.. Novel Positively Charged Metal-Coordinated Nanofiltration Membrane for Lithium Recovery. ACS Appl. Mater. Interfaces. 2021;13(14):16906–16915. doi: 10.1021/acsami.1c02252. [DOI] [PubMed] [Google Scholar]
- Li L., Zhu G., Tong Y., Ding K., Wang Z., Meng C., Gao C.. Polyethyleneimine Modified Polyamide Composite Nanofiltration Membrane for Separation of Lithium and Magnesium. J. Water Process Eng. 2023;54:103894. doi: 10.1016/j.jwpe.2023.103894. [DOI] [Google Scholar]
- Gao Y., Zhao Y., Wang X., Tang C., Huang X.. Modulating the Asymmetry of the Active Layer in Pursuit of Nanofiltration Selectivity via Differentiating Interfacial Reactions of Piperazine. Environ. Sci. Technol. 2022;56(19):14038–14047. doi: 10.1021/acs.est.2c04124. [DOI] [PubMed] [Google Scholar]
- Liang Y., Zhu Y., Liu C., Lee K.-R., Hung W.-S., Wang Z., Li Y., Elimelech M., Jin J., Lin S.. Polyamide Nanofiltration Membrane with Highly Uniform Sub-Nanometre Pores for Sub-1 Å Precision Separation. Nat. Commun. 2020;11(1):2015. doi: 10.1038/s41467-020-15771-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shen L., Cheng R., Yi M., Hung W.-S., Japip S., Tian L., Zhang X., Jiang S., Li S., Wang Y.. Polyamide-Based Membranes with Structural Homogeneity for Ultrafast Molecular Sieving. Nat. Commun. 2022;13(1):500. doi: 10.1038/s41467-022-28183-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Y., Wang S., Wu W., Yu H., Che R., Kang G., Cao Y.. Fabrication of Positively Charged Nanofiltration Membrane with Uniform Charge Distribution by Reversed Interfacial Polymerization for Mg2+/Li+ Separation. J. Membr. Sci. 2022;659:120809. doi: 10.1016/j.memsci.2022.120809. [DOI] [Google Scholar]
- Zhai X., Lin S., Li X., Wang Z.. The Hidden Role of the Dielectric Effect in Nanofiltration: A Novel Perspective to Unravel New Ion Separation Mechanisms. Environ. Sci. Technol. 2024;58(35):15874–15884. doi: 10.1021/acs.est.4c07510. [DOI] [PubMed] [Google Scholar]
- Chen K., Zhao S., Lan H., Xie T., Wang H., Chen Y., Li P., Sun H., Niu Q. J., Yang C.. Dual-Electric Layer Nanofiltration Membranes Based on Polyphenol/PEI Interlayer for Highly Efficient Mg2+/Li+ Separation. J. Membr. Sci. 2022;660:120860. doi: 10.1016/j.memsci.2022.120860. [DOI] [Google Scholar]
- Nickerson T. R., AntonioMcNallyToney E. N. D. P. M. F., Ban C., Straub A. P., Ban C., Straub P.. Unlocking the potential of polymeric desalination membranes by understanding molecular-level interactions and transport mechanisms. Chem. Sci. 2023;14(4):751–770. doi: 10.1039/D2SC04920A. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang M., Guan K., Ji Y., Liu G., Jin W., Xu N.. Controllable Ion Transport by Surface-Charged Graphene Oxide Membrane. Nat. Commun. 2019;10:1253. doi: 10.1038/s41467-019-09286-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao Y., Tong T., Wang X., Lin S., Reid E. M., Chen Y.. Differentiating Solutes with Precise Nanofiltration for Next Generation Environmental Separations: A Review. Environ. Sci. Technol. 2021;55(3):1359–1376. doi: 10.1021/acs.est.0c04593. [DOI] [PubMed] [Google Scholar]
- Peng Q., Wang R., Zhao Z., Lin S., Liu Y., Dong D., Wang Z., He Y., Zhu Y., Jin J., Jiang L.. Extreme Li-Mg Selectivity via Precise Ion Size Differentiation of Polyamide Membrane. Nat. Commun. 2024;15(1):2505. doi: 10.1038/s41467-024-46887-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhu Q.-Y., Wu L.-K., Li L.-Q., Zhuang L.-W., Xue S.-M., Xu Z.-L., Tang Y.-J.. Novel Insight on the Effect of the Monomer Concentration on the Polypiperazine-Amide Nanofiltration Membrane. Ind. Eng. Chem. Res. 2022;61(17):5843–5852. doi: 10.1021/acs.iecr.2c00478. [DOI] [Google Scholar]
- Tang K., Zhu L., Lan P., Chen Y., Chen Z., Lan Y., Lan W.. Regulating the Thickness of Nanofiltration Membranes for Efficient Water Purification. Nanoscale Adv. 2023;5(18):4770–4781. doi: 10.1039/D3NA00110E. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang H., He Q., Luo J., Wan Y., Darling S. B.. Sharpening Nanofiltration: Strategies for Enhanced Membrane Selectivity. ACS Appl. Mater. Interfaces. 2020;12(36):39948–39966. doi: 10.1021/acsami.0c11136. [DOI] [PubMed] [Google Scholar]
- Xu P., Wang W., Qian X., Wang H., Guo C., Li N., Xu Z., Teng K., Wang Z.. Positive Charged PEI-TMC Composite Nanofiltration Membrane for Separation of Li+ and Mg2+ from Brine with High Mg2+/Li+ Ratio. Desalination. 2019;449:57–68. doi: 10.1016/j.desal.2018.10.019. [DOI] [Google Scholar]
- Zhan Z.-M., Xu Z.-L., Zhu K.-K., Tang Y.-J.. How to Understand the Effects of Heat Curing Conditions on the Morphology and Performance of Polypiperazine-Amide NF Membrane. J. Membr. Sci. 2020;597:117640. doi: 10.1016/j.memsci.2019.117640. [DOI] [Google Scholar]
- Xiang K., Ping H.-H., Xu Z.-L., Zhuang L., Tang Y.-J.. Revealing the Effect of Heat Curing on the Uniformity of the Nanofiltration Membrane: Experiment and Simulation Studies. Ind. Eng. Chem. Res. 2024;63(23):10256–10264. doi: 10.1021/acs.iecr.4c00880. [DOI] [Google Scholar]
- Zhao G., Sun J., Tang G., Pan G., Yu H., Li Y., Zhang Y., Liu Y.. Highly Selective Mg2+/Li+ Separation Membranes Prepared by Surface Grafting of a Novel Quaternary Ammonium Bromide. Sep. Purif. Technol. 2024;335:126184. doi: 10.1016/j.seppur.2023.126184. [DOI] [Google Scholar]
- Bajusz D., Rácz A., Héberger K.. Why Is Tanimoto Index an Appropriate Choice for Fingerprint-Based Similarity Calculations? J. Cheminf. 2015;7(1):20. doi: 10.1186/s13321-015-0069-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang J., Tao L., He J., McCutcheon J. R., Li Y.. Machine Learning Enables Interpretable Discovery of Innovative Polymers for Gas Separation Membranes. Sci. Adv. 2022;8(29):eabn9545. doi: 10.1126/sciadv.abn9545. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jabbar R., Jabbar R., Kamoun S.. Recent Progress in Generative Adversarial Networks Applied to Inversely Designing Inorganic Materials: A Brief Review. Comput. Mater. Sci. 2022;213:111612. doi: 10.1016/j.commatsci.2022.111612. [DOI] [Google Scholar]
- Jiao S., Katz L. E., Shell M. S.. Inverse Design of Pore Wall Chemistry To Control Solute Transport and Selectivity. ACS Cent. Sci. 2022;8(12):1609–1617. doi: 10.1021/acscentsci.2c01011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang M., Jiang J.. Accelerating Discovery of Polyimides with Intrinsic Microporosity for Membrane-Based Gas Separation: Synergizing Physics-Informed Performance Metrics and Active Learning. Adv. Funct. Mater. n/a. 2024;34(23):2314683. doi: 10.1002/adfm.202314683. [DOI] [Google Scholar]
- Mannodi-Kanakkithodi A., Pilania G., Ramprasad R., Lookman T., Gubernatis J. E.. Multi-Objective Optimization Techniques to Design the Pareto Front of Organic Dielectric Polymers. Comput. Mater. Sci. 2016;125:92–99. doi: 10.1016/j.commatsci.2016.08.018. [DOI] [Google Scholar]
- Tan C. S., Gupta A., Ong Y.-S., Pratama M., Tan P. S., Lam S. K.. Pareto Optimization with Small Data by Learning across Common Objective Spaces. Sci. Rep. 2023;13(1):7842. doi: 10.1038/s41598-023-33414-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All the data used in the study are presented in the Supporting Dataset.





