Abstract
A machine learning strategy is presented for the rapid discovery of new polymeric materials satisfying multiple desirable properties. Of particular interest is the design of high refractive index polymers. Our in silico approach employs a series of quantitative structure–property relationship models that facilitate rapid virtual screening of polymers based on relevant properties such as the refractive index, glass transition and thermal decomposition temperatures, and solubility in standard solvents. Exploration of the chemical space is carried out using an evolutionary algorithm that assembles synthetically tractable monomers from a database of existing fragments. Selected monomer structures that were further evaluated using density functional theory calculations agree well with model predictions.
Keywords: computer-aided design, high refractive index polymer, QSPR, machine learning
1. Introduction
Polymeric thin films with high refractive indices and high optical clarity are highly sought after in various applications such as optical data storage [1], lenses [2], anti-reflective coatings [3], immersion lithography [4] and complementary metal-oxide-semiconductor (CMOS) image sensors [5].
The trends and developments in the field have been summarized in recent reviews [6,7] and the references therein. In addition to a high refractive index, other properties of interest are the thermal stability related parameters, such as the glass transition temperatures ( C [8]) and thermal decomposition temperatures ( C) that play a critical role in the optical device fabrication. Given the performance requirements that need to be satisfied for different applications, the polymer chemist is faced with the challenge of finding a balance between several complementary properties.
A popular strategy for developing high refractive index polymers (HRIP) is based on using inorganic nanoparticle-filled polymer composites [9]. Although promising, they pose significant processing challenges and suffer from high optical losses. A more effective approach has been to alter the chemical structure of the polymer by incorporating high-molar refraction groups, such as sulfur atoms and aromatic structures [7]. With a view to rapidly explore the vast chemical space and thereby fine tune experimental efforts, there has been increased focus on using computational approaches. Semi-empirical electronic structure methods, for instance, have been used to create a database of porous polymer networks that can be used as methane adsorbents [10]. In other studies, ab initio approaches were used to search for polymer dielectrics [11] and also in the design of polymers for photovoltaic applications [12]. However, the computational costs associated with ab initio methods limit large scale explorations of the chemical space. Inverse Quantitative Structure-Property Relationship (QSPR) approaches such as those based on the signature molecular descriptor [13] have also been employed but require significant computational effort to solve constraint equations.
More computationally expedient alternatives to standard quantum chemistry (QC) based schemes for property prediction include those based on data-driven approaches that make use of machine learning (ML), multivariate statistics and chemometrics. These methods typically rely on establishing quantitative structure–property relationships [14] (QSPR) and can be used for modelling diverse properties including those that at present cannot be directly computed using QC (such as the solar power conversion efficiency [15]). Starting with a representative set of molecules, each structure is encoded as a vector using a wide range of descriptors that capture geometrical and electronic properties. Each vector can be associated with one or more responses or class labels that are to be predicted. The vectors and their responses are stored in matrices that are submitted to ML algorithms [14] that perform regression or classification to establish a mathematical model between the structure descriptor variables and the responses. The purpose of the QSPR is two-fold: to discover interesting relationships between the structure descriptors and the responses and, secondly, as a way to obtain rapid estimations of the responses. The latter is of particular interest in combinatorial [16] and evolutionary [15,17] searches in chemical space as the number of possible structures to investigate would -be intractable for most QC approaches. Thus, using QSPR models to predict relevant responses facilitates searching a much larger chemical space using fewer computational resources than traditional QC approaches.
In this article, we adopt an in silico molecular design approach that is based on principles of Darwinian evolution to drive the search for HRIPs. In the past, approaches based on evolutionary algorithms have been successfully applied to the design of drug-like molecules [18], dyes for solar cells [15] and olefin metathesis catalysts [19]. In recent years, a number of polymer properties relevant for optoelectronic applications such as the refractive index [20,21,22], glass transition temperature [23] and thermal decomposition temperature [24] have been modelled using QSPR methods wherein descriptors calculated from the monomer structures have been correlated with the property of interest. Continuing with this approach, we make use of such models to carry out a multiple criteria-based virtual screening of polymers. As opposed to the largely intuition-driven trial and error approaches, we adopt a systems approach advocated by Bicerano [25]. The computational search strategy used in this study facilitates the accelerated discovery of advanced optical materials in a cost-effective manner. The most promising repeat units are further subjected to rigorous validation based on density functional theory [26] (DFT). Application of this multi-pronged strategy has yielded a number of promising monomers with advantageous properties and may be of considerable value in the development of photonic materials.
2. Materials and Methods
An overview of the molecular design approach based on Darwinian evolution is depicted in Figure 1. The process has been discussed in detail in previous articles [15,19,27,28] and only a short description is provided here. Each proposed structure (repeating unit) is defined as a combination of fragments attached to a scaffold that has an available set of attachment points (shown as “A” in the figure). The building blocks are connected with respect to a set of pre-defined rules so as to maximize the probability of synthesis [29,30]. The molecular fragments were generated by applying the Breaking of Retrosynthetically Interesting Chemical Substructures (BRICS) [31] fragmentation algorithm to existing monomer structures taken from various literature sources. Figure 2 lists the various scaffolds used as starting points to create different monomers. Each structure output by the evolutionary protocol is then evaluated in terms of the value of the refractive index (the primary fitness estimate), which is obtained here using an ML model. Over several iterations of the evolutionary algorithm, the population of monomer structures undergoes computational crossover and mutation operations involving fragment exchange or substitution. In addition to the refractive index, other properties such as the , and solubility in standard solvents (N-methyl-2-pyrrolidone (NMP), chloroform), etc. are also evaluated using different machine learning models. Finally, selected candidates that pass initial criteria are further analyzed using density functional theory approaches [26].
Figure 1.
Schematic shows an outline of the design approach. The “A” on the fragments indicate points of attachment. The crossover operations (indicated by blue arrows) involve random selection of fragments (building blocks highlighted by the circles) in the two parent structures and swaps them, thus producing typically two offspring. In a given structure, the mutation operator (red arrow) may either replace or delete a randomly selected fragment.
Figure 2.
The building block scaffolds used in this study. The attachment points are indicated by the letter “A”. The fragment recombination proceeds according to a set of fragment compatibility rules [29].
2.1. Polymer Properties
The refractive index (n) of a polymer is often expressed in terms of the Lorentz–Lorenz [32,33] equation given by:
| (1) |
where is the linear molecular polarizability, the polymer density, the molecular weight of the monomer and the Avogadro number [34,35]. The equation thus enables the prediction of the refractive index, using the polarizability of an isolated molecule.
An important parameter in optics and design of lenses is the Abbe number () [8], which is a measure of the refractive index dispersion (larger numbers correspond to a lower dispersion) and is given by:
| (2) |
where , , and are the refractive indices of the material at the wavelengths of the Fraunhofer D (589.3 nm), F (486.1 nm) and C (656.3 nm) spectral lines, respectively.
In addition to the n and , other properties to be considered include birefringence and optical transparency [7]. The birefringence, calculated as (where and are the in-plane and out-of-plane refractive indices, respectively), is caused by the orientation of polymer molecular chains and is required to be minimal, in order to achieve fine focusing in lenses. Furthermore, for use in optoelectronic materials, a high transparency in the visible range is desirable [36]. Thus, the spectra of new monomer structures should have minimal absorbance in the visible region (400–700 nm).
2.2. Machine Learning
Experimental values of the refractive index (n at 589 nm), density ( at room temperature), glass transition temperatures () and thermal decomposition temperatures ( 10% weight loss temperatures recorded in N atmosphere) for a diverse set of polymers were collated from existing literature [20,23,25,37,38]. A complete list of the monomer structures and corresponding experimental values are provided in the Supplementary Materials (see Tables S2–S6). Table 1 summarizes the available data for the properties studied. The chemistry spans several classes including polyimides, polyethylenes, polyphosphazenes, polyacrylates, polyarylene sulfides, phenylquinoxalines, polystyrenes and polycarbonates.
Table 1.
Summary of the experimental data available for refractive index (n), density (), glass transition temperatures () and decomposition temperatures ( for 10% weight loss). is the number of available samples, while and are the respective numbers in the calibration and test sets (based on a random 50:50 split of the data).
| Property | Range | |||
|---|---|---|---|---|
| n | 237 | 1.34–1.71 | 120 | 117 |
| 195 | 0.84–2.1 | 99 | 96 | |
| (C) | 601 | −143–399 | 304 | 297 |
| (C) | 175 | 125–563 | 90 | 85 |
Given the previous success of quantum chemical descriptors in modelling polymer properties [39,40,41] based on the structure of the monomer, we employ these to model the different properties. For each monomer, various geometrical and molecular orbital-based descriptors such as the Highest Occupied Molecular Orbital/Lowest Unoccupied Molecular Orbital (HOMO/LUMO) energies, charges, polarizabilities, superdelocalizabilities, and radial distribution function (RDF) indices were calculated using the software KRAKENX (version 0.1.3, www.krakenminer.com) [42]. These descriptors have earlier been shown to be well suited for predicting diverse properties such as power conversion efficiencies of dyes in solar cells [17,43], densities/viscosities [42], p [27] and thermal decomposition temperatures of ionic liquids [44]. A total of 828 descriptors was calculated for each monomer, which was reduced to around 818, after the removal of low variance columns and those containing missing values. Table S1 in the Supplementary Materials provides a description of the variables.
The data fitting was carried out using both linear partial least squares regression [45] (PLSR) and the ensemble tree-based random forests (RF) [46] method. In order to assess the predictive abilities of the ML models, the data was split (50:50) randomly for each model. As part of the preprocessing, a pairwise correlation analysis was performed and only one among the highly correlated pair of variables () was retained. The remaining variables (<400) were then autoscaled to zero mean and unit variance. Multiple metrics: the root mean squared error (), mean absolute error (), and 10-fold cross-validated correlation coefficient were used to evaluate the model performances. In addition, variable selection was also carried out to improve the predictive ability and, where possible, reduce model complexity (see previous papers [15,42,44]). The generated models are constrained by the response and chemical structure space within which they are assumed to reliable. To establish reliability estimates for the PLSR model predictions, the distance to the model [47] and the bootstrap variance [15] based on 500 models was computed, while, for the random forests, the conditional quantiles [48] were used. Predictions for which the estimated variability is small can in general be trusted while those with large values need to be treated with caution.
2.3. Computational Details
The structures of the monomers were drawn using the MarvinSketch [49] program version 5.9.3 from ChemAxon, https://chemaxon.com/ (or alternatively taken from literature when available) were converted to 3D using OpenBabel (version 2.4.1, http://openbabel.org/docs/current/) [50] (based on the Universal Force Field [51]). The initial geometries were further optimized using the semi-empirical AM1 Hamiltonian in MOPAC (version 16.220L, http://openmopac.net/) [52]. For the refractive index calculations, the MOPAC optimized structures were further subjected to full geometry optimizations at the DFT level (without symmetry constraints) using the B3LYP [53] functional and the 6-311G(d,p) basis set. The wavelength-dependent linear polarizabilities were computed using the range-corrected CAM-B3LYP [54] functional along with the 6-311++G(d,p) [55] basis set (containing both polarisation and diffuse functions). To assess optical transparency, the UV-Vis absorption spectra [56] of the monomers was computed using time-dependent DFT (TDDFT) carried out at the CAM-B3LYP/6-311G(d,p) level of theory. The DFT calculations were performed using the Gaussian 09 [57] software package. The regression models were developed using the statistical software R version 3.4.2, https://www.r-project.org/ [58] with the packages pls [59] and randomForest [48,60].
3. Results and Discussion
3.1. Analysis of Regression Models
Table 2 summarizes the results for the regression models corresponding to the n, , and . The complete list of experimental and predicted values is provided in Tables S2–S6 in the Supplementary Materials. The PLSR model applied to the refractive index prediction yields a low-complexity 4 latent variable (LV) model, which performs quite well for both the calibration and independent test sets with . While similar metrics for the are achieved, the PLSR model performance for () is considerably poorer. In comparison, the random forests’ (based on 100 trees) regression models perform well on most properties. In general, tree-based models are less useful for extrapolation as they employ boxing in selected regions of the variable space. Given the interest in identifying HRIPs (with n > 1.70) for which the models are required to extrapolate beyond the calibration range of the response variable, we make use of PLSR models for driving the search for suitable polymers. Since both the experimental and values span a desirable temperature range, the RF models act as filters by excluding those that have predicted values below a given threshold. Predictive models for the density were found to be comparatively poorer with the RF model yielding calibration statistics of and a test set , while PLSR failed to produce models with . Attempts to improve the performance using other methods such as support vector machines [61], however, did not meet with success.
Table 2.
Summary of the regression model performances for the refractive index (n), glass transition temperatures () and decomposition temperatures (). Here, is the mean absolute error, is the root mean squared error and the squared correlation between the observed and predicted values.
| Model | Property | Calibration | Testing | ||
|---|---|---|---|---|---|
| PLSR | n | 0.79 | 0.04 (0.03) | 0.79 | 0.04 (0.03) |
| (C) | 0.81 | 52 (34) | 0.83 | 49 (38) | |
| (C) | 0.61 | 49 (24) | 0.62 | 51 (41) | |
| RF | n | 0.83 | 0.03 (0.01) | 0.88 | 0.03 (0.02) |
| (C) | 0.86 | 44 (14) | 0.88 | 40 (30) | |
| (C) | 0.80 | 35 (12) | 0.72 | 45 (30) | |
| 0.64 | 0.13 (0.04) | 0.66 | 0.14 (0.08) | ||
Figure 3 shows the variable importance plots for the regression models and is based on the contribution the predictor variables make to the construction of the models. For PLSR, the ranking is based on the variable importance in projection score [62] (VIP), while, for RF models, the importance is calculated based on the increase in the mean square error of predictions as a result of a given descriptor being randomly permuted [63]. The PLSR model for the refractive index n show the most important variables to be the heat of formation (at the AM1 level of theory) that reflects the thermodynamic stability of the polymer, the global softness (the inverse of the HOMO-LUMO energy gap), the nucleophilic (DNR) and electrophilic (DER) delocalizabilities that are dynamic reactivity indices and signify intermolecular interaction, and the static hyperpolarizability that influences the electric susceptibility [64]. A small HOMO-LUMO energy gap may also indicate that the molecule is easily polarized. In addition, a number of charge surface area descriptors that emphasize the charge distribution can be linked to the size related bulk properties of the repeating unit [39]. For both and models, the prominent features are dominated by variables that emphasize electrophilic and nucleophilic attack along with other values such as the heat of formation, the global electrophilicity index [65] (measures the energy stabilization) and the HOMO-LUMO gap that are seen as standard indicators of stability. The charge based descriptors, on the other hand, may reflect the electrostatic interactions between the polymer chains. Many of the highlighted variables mirror the findings in previous studies [39,40,41].
Figure 3.
Variable importance plots for the (A) Partial Least Squares Regression (PLSR) model for n, and Random Forest (RF) models for (B) and (C) . In all cases, only the top 10 most important variables are shown.
3.2. Molecular Evolution Analysis
The evolutionary algorithm was configured to run with a population of 100 structures for a maximum of 100 generations with crossover and mutation probabilities set to 0.5. In order to prevent the monomers from becoming too large, a molecular weight restriction was imposed wherein structures above 1000 daltons were discarded. Over 4000 unique structures were produced from five runs of the evolutionary algorithm initialized with different starting seeds. For these monomers, the predicted values of n (at = 589.3 nm) ranged between to (see Figure S1 in the Supplementary Materials), with approximately 40% of the structures yielding , the maximum n value in the calibration data set. In order to examine the trends in the predicted response, an analysis of the PLSR latent variable scores was performed. Figure 4 shows the increasing trend of the refractive indices along the first two latent vectors. While the presence of conjugated ring structures and sulphur content have been shown to increase n, other factors such as the higher molecular weight of monomers have also been seen to influence n (see Figures S2 and S3 in the Supplementary Materials). Similar trends are seen for the designed structures with nearly one-fourth of the monomers having n > 1.70 and . Although the molecular weight descriptor was removed during the model building (highly correlated with other variables and hence excluded), it is interesting to note that the model nonetheless captures the variations in the refractive index with respect to the chemical composition reasonably well.
Figure 4.
Scores plot for the first two latent variables shows the spread of the predicted refractive indices for the designed monomers. The dashed arrow in the centre of the plot shows the direction of the increasing refractive indices as indicated by the PLSR model. Structures of selected monomers along this line reflect the chemical diversity in the population.
Analysis of the monomers based on the different scaffolds (shown in Figure 2) used show that structures based on nitrogen or sulfur-containing substituents generally yielded high refractive indices (n > 1.7) [7,66]. The thiazole moiety (comprising of a sulfur atom and a C=N–C bond) not only increases the sulphur content but also leads to low molar volumes, thereby yielding high refractive indices [67,68]. Furthermore, scaffolds based on the diketopyrrolopyrrole (see Figure 5) (linked with thiazole, furan, thiophene and thienothiophene), -conjugated benzodithiophene and thianthrene moieties are seen to have high refractive indices. Incorporating these units has been seen to improve thermal stability and solubility.
Figure 5.
The box plot shows the range of refractive index values with respect to each scaffold (see Figure 2).
Although the molecular assembly attempts to create structures that are likely to be synthetically tractable, it is interesting to assess it quantitatively with a synthetic accessibility (SA) score. Here, we use the ease of synthesis ranging between 1 (easy to make) to 10 (difficult) as a metric to guage the synthetic accessibility of the proposed monomers [69]. A Python-based implementation [70] that combines fragment contributions and complexity penalties was used to estimate the SA score. For a significant majority of the structures, the score was found to be around 5 or less (see Figure S4 in the Supplementary Materials).
Another important criterion to consider is that of the solubility of the polymer in common organic solvents [71]. Poor solubility makes processing difficult and limits their applications. Solubility data for polymers with respect to five commonly used solvents—chloroform (CHCl), dimethylacetamide (DMAc), N-methylpyrrolidine (NMP), tetrahydrofuran (THF) and dimethyl sulfoxide (DMSO)—were collated from the literature. The available data was divided into three classes, which include: S, soluble, PS, partially soluble/swelling/soluble on heating and I, insoluble. For each solvent, the data were equally divided into independent calibration and test sets across the different classes. Random forests classification models were created to predict the solubility class (I/PS/S) using the descriptors as described above. Since the classes are unevenly distributed for all solvents, we make use of Cohen’s Kappa statistic [72], which can be applied to both multi-class and imbalanced class problems. Model performances summarized in Table 3 show that the typical values are in the range 0.50–0.60 and fall in the moderate agreement range [72]. Using these models, the solubility classes for the designed monomers were predicted. As can be seen from Figure 6, a majority (>70%) of the proposed structures are potentially soluble in DMAc, DMSO and NMP solvents.
Table 3.
Summary of the random forest classification performances for the polymer solubilities in different solvents. See Tables S7–S12 in the Supplementary Materials for performances with respect to each solvent. The 10-fold cross-validated and values are reported for each solvent. Here, S, soluble, PS, partially soluble/swelling/soluble on heating and I, insoluble.
| Solvent | #Samples | I | PS | S | ||
|---|---|---|---|---|---|---|
| CHCl | 136 | 53 | 34 | 48 | 0.56 | 0.50 |
| NMP | 145 | 10 | 42 | 93 | 0.62 | 0.36 |
| DMAc | 105 | 8 | 41 | 56 | 0.52 | 0.48 |
| DMSO | 154 | 19 | 56 | 79 | 0.53 | 0.58 |
| THF | 120 | 15 | 59 | 46 | 0.49 | 0.62 |
Figure 6.
Predicted solubilities for the de novo designed monomers.
3.3. Comparison with DFT
In order to apply Equation (1) to estimate polymer refractive indices, the polarizability and density are required. Since the polarizability is assumed to be additive, the monomer polarizability () has been used in a number of studies [26,34,73]. Although this does not sufficiently hold true at a theoretical level, for computational ease, all calculations were carried out only for the monomers terminated by hydrogen atoms. The second component of density typically requires molecular dynamics simulations but is computationally demanding. Hence, for computational ease, the density is estimated using a QSPR () model. To evaluate the efficacy of this approach, polarizability calculations were carried out for a number of polymers for which experimental refractive indices recorded at different wavelengths (589, 633, 1324 nm) were available (see Table S13 in the Supplementary Materials) and spanned a range of 1.34–1.79. The higher values were included in particular to assess the ability of the - driven approach to estimate the predicted reflective indices that are extrapolated by the model.
Analysis of the results of 160 randomly selected polymers shows that, for most polymers studied, the absolute deviations are less than 0.10. Figure 7 shows the scatter plot of the experimental vs. calculated refractive indices. For this data, an overall correlation of 0.81 was obtained. The - scheme in particular tends to overestimate n for nearly two-thirds of the samples with the average deviation around 0.07. Further examination of polymers with n > 1.71 (the maximum value in the calibration) shows that the - based n estimations were again overestimated with errors ranging between 0.01 and 0.09. The deviations could be attributed to the density predictions, since the QSPR model was not found to have a significantly high performance. To investigate this further, only cases for which experimental refractive index and density data were available were considered. For a set of 51 polymers, the mean absolute deviations were around 0.15 with errors in the range –0.21 (see Table S14 in the Supplementary Materials). Here again, the - estimates were found to be larger than the experimental n. These results suggest that the errors in the polarizability estimates for the hydrogen-terminated monomer units could also be contributing to the errors. Given these results, caution must therefore exercised when comparing the QSPR predictions for n with the - based estimates.
Figure 7.
Plot shows the observed vs. the Density Functional Theory (DFT)-predicted refractive indices. The density in Equation (1) is calculated using a QSPR model. An overall correlation of 0.81 was obtained. See Table S13 in the Supplementary Materials for additional details.
3.4. Analysis of Selected Monomers
Table 4 summarizes the predicted properties for selected monomers shown in Figure 8. The structures are likely to show good thermal stability as seen from the glass transition temperatures that are around 200 C along with relatively high weight loss temperatures ( > 350 C). The inclusion of groups such as cyclohexane have been shown to improve stability [73]. The predicted refractive indices are typically high and can be attributed to the presence of aromatic heterocycles and high sulfur content [74]. Substituents such as thioethers, thiazoles and nitro groups are also seen to increase refractive indices [67]. Analysis of the TDDFT calculations suggests that for a majority of the proposed structures, the absorption wavelengths peak at less than 300 nm (see plots in Figure S5 in the Supplementary Materials), and we expect these to have good optical transparency. Comparison of the QSPR and DFT-calculated refractive index estimates shows that—for the monomers: M0002, M0003, M0006, M0008, M0010—the deviation is not significantly high. The high deviation with respect to M0001 clearly suggests that there are limits to the extent of extrapolation that can be performed using the existing model. Table S15 in the Supplementary Materials list additional cases where there is a significant discrepancy between the QSPR and DFT predictions. Abbe numbers (listed in Table 4) for monomers M0002, M0009 and M0010 are relatively high, which should correspond to low wavelength dispersion. Birefringence depends on a number of factors such as the preferred orientations of the polymer chains, as well as the polarizability and van der Waals volume of the repeating units [75]. While low values are desirable, for the selected monomers, the calculated birefringence () is somewhat high. For a few cases, negative birefringences are observed. We attribute this to largely to the incorrect estimations of the DFT-calculated polarizabilities. Methyl-terminated structures in place of the standard hydrogen have been shown to improve the accuracy [26] and therefore could be used.
Table 4.
Summary of the calculated properties for selected monomers. For the refractive index , , and , the prediction uncertainties are also provided. is the refractive index calculated according to Equation (1) with polarizabilities obtained from DFT. The Abbe number is calculated according to Equation (2) and makes use of the DFT-calculated polarizabilities and QSPR based density estimation. Absorption maxima (in chloroform solvent) are calculated using Time-dependent Density Functional Theory (TD-DFT). MW, molecular weight.
| Structure | MW | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| M0001 | 927 | 1.98 ± 0.11 | 256 ± 27 | 438 ± 65 | 1.35 ± 0.23 | 1.79 | 7.79 | 0.09 | 367 |
| M0002 | 570 | 1.75 ± 0.05 | 226 ± 62 | 456 ± 56 | 1.37 ± 0.32 | 1.72 | 22.85 | 0.07 | 356 |
| M0003 | 571 | 1.74 ± 0.15 | 210 ± 51 | 398 ± 84 | 1.29 ± 0.16 | 1.67 | 5.85 | 0.09 | 420 |
| M0004 | 663 | 1.79 ± 0.10 | 242 ± 50 | 408 ± 65 | 1.36 ± 0.31 | 1.65 | 7.45 | 0.38 | 411 |
| M0005 | 801 | 1.80 ± 0.04 | 222 ± 47 | 466 ± 50 | 1.36 ± 0.22 | 1.98 | 1.98 | 0.05 | 429 |
| M0006 | 716 | 1.84 ± 0.06 | 206 ± 41 | 396 ± 74 | 1.27 ± 0.16 | 1.80 | 10.88 | −0.12 | 299 |
| M0007 | 637 | 1.78 ± 0.14 | 223 ± 42 | 439 ± 81 | 1.37 ± 0.25 | 1.76 | 3.49 | −0.05 | 455 |
| M0008 | 596 | 1.78 ± 0.09 | 180 ± 64 | 370 ± 87 | 1.33 ± 0.32 | 1.70 | 13.13 | −0.03 | 347 |
| M0009 | 649 | 1.72 ± 0.04 | 198 ± 85 | 387 ± 79 | 1.63 ± 0.44 | 1.90 | 33.36 | 0.03 | 257 |
| M0010 | 935 | 1.77 ± 0.11 | 226 ± 38 | 428 ± 60 | 1.44 ± 0.35 | 1.73 | 24.80 | −0.13 | 305 |
Figure 8.
Monomers selected from de novo runs.
4. Conclusions
Herein, a series of QSPR models are employed, which, in combination with a Darwinian evolution based search algorithm, facilitates the discovery of novel polymers that are able to satisfy several complementary properties. The designed monomers are generally seen to be synthesizable with polymerization for some candidates requiring specific conditions such as elevated temperatures or the presence of metal catalysts. For some monomers, the calculated birefringences were found to be high or negative. While low values are desirable, for the selected monomers, the calculated birefringence () is somewhat high. Birefringence depends on a number of factors such as the preferred orientations of the polymer chains, as well as the polarizability and van der Waals volume of the repeating units [75]. For a few cases, negative birefringences were observed. We attribute some of this largely to the incorrect estimations of the DFT-calculated polarizabilities. Methyl-terminated structures in place of the standard hydrogen have been shown to improve the accuracy [26] and may help to address the issue. Alternatively, methods such as copolymerization of monomers with different birefringences, or the addition of small birefringent crystals, may also be employed [76].
Although the models are sufficiently predictive, there still exist inherent discrepancies between property estimations and the experimental values. Incorporating information relating to the experimental uncertainties in combination with nonlinear methods such as kernel-based PLS regression [77] may help to address some of these issues. Since experimental data are somewhat limited and model extrapolation is not always reliable, future work will focus on methods such as semi-supervised learning [78] that aim to build better predictive models using unlabelled data as additional data. Performing DFT calculations at a high level theory for the numerous candidates produced is still a computational bottleneck. It is hoped that methods such as deep learning can help approximate such calculations in shorter timeframes [79].
Acknowledgments
The Norwegian Research Council (NFR) is acknowledged for financial support from the CLIMIT (Grant No. 233776) and for CPU resources granted through the NOTUR supercomputing programme. Rajesh Raju is thanked for helpful discussions on synthetic accessibility and polymerizability. We also thank ChemAxon (http://www.chemaxon.com) for free academic use of the Marvin package.
Abbreviations
The following abbreviations are used in this manuscript:
| QC | Quantum Chemistry |
| QSPR | Quantitative Structure Property Relationship |
| ML | Machine Learning |
| DFT | Density Functional Theory |
| TD-DFT | Time-Dependent Density Functional Theory |
Supplementary Materials
The following are available online at http://www.mdpi.com/2073-4360/10/1/103/s1. Table S1: Description of the structure variables used in the models; Tables S2–S12: Experimental (data taken from literature) and model predicted values for n, , , and solubility; Table S13: Experimental and predicted n measured at given wavelengths for different polymers; Table S14: Experimental and predicted n using DFT-based polarizability estimates and ; Table S15: Cases where large deviations between QSPR and DFT estimates for n are observed; Figure S1: Histogram of the predicted n for the designed monomers; Figure S2: Scatter plot of the molecular weights vs. the experimental n; Figure S3: Scatter plot of the molecular weights vs. the predicted n of designed monomers; Figure S4: Histogram of the synthetic accessibility scores for the designed monomers; Figure F5: Calculated UV-Vis spectra for different polymers.
Author Contributions
V.V. and B.K.A. conceived the study; V.V. designed and performed the experiments and analyzed the data; V.V. and B.K.A. wrote the paper.
Conflicts of Interest
The authors declare no conflict of interest.
References
- 1.Yetisen A.K., Montelongo Y., Butt H. Rewritable three-dimensional holographic data storage via optical forces. Appl. Phys. Lett. 2016;109:061106. doi: 10.1063/1.4960710. [DOI] [Google Scholar]
- 2.Kim K.C. Effective graded refractive-index anti-reflection coating for high refractive-index polymer ophthalmic lenses. Mater. Lett. 2015;160:158–161. doi: 10.1016/j.matlet.2015.07.108. [DOI] [Google Scholar]
- 3.Li X., Yu X., Han Y. Polymer thin films for antireflection coatings. J. Mater. Chem. C. 2013;1:2266–2285. doi: 10.1039/c2tc00529h. [DOI] [Google Scholar]
- 4.Sanders D.P. Advances in Patterning Materials for 193 nm Immersion Lithography. Chem. Rev. 2010;110:321–360. doi: 10.1021/cr900244n. [DOI] [PubMed] [Google Scholar]
- 5.Suwa M., Niwa H., Tomikawa M. High Refractive Index Positive Tone Photo-sensitive Coating. J. Photopolym. Sci. Technol. 2006;19:275–276. doi: 10.2494/photopolymer.19.275. [DOI] [Google Scholar]
- 6.Macdonald E.K., Shaver M.P. Intrinsic high refractive index polymers. Polym. Int. 2014;64:6–14. doi: 10.1002/pi.4821. [DOI] [Google Scholar]
- 7.Higashihara T., Ueda M. Recent Progress in High Refractive Index Polymers. Macromolecules. 2015;48:1915–1929. doi: 10.1021/ma502569r. [DOI] [Google Scholar]
- 8.Suzuki Y., Higashihara T., Ando S., Ueda M. Synthesis and Characterization of High Refractive Index and High Abbe’s Number Poly(thioether sulfone)s based on Tricyclo[5.2.1.02,6]decane Moiety. Macromolecules. 2012;45:3402–3408. doi: 10.1021/ma300379w. [DOI] [Google Scholar]
- 9.Balazs A.C., Emrick T., Russell T.P. Nanoparticle Polymer Composites: Where Two Small Worlds Meet. Science. 2006;314:1107–1110. doi: 10.1126/science.1130557. [DOI] [PubMed] [Google Scholar]
- 10.Martin R.L., Simon C.M., Smit B., Haranczyk M. In Silico Design of Porous Polymer Networks: High-Throughput Screening for Methane Storage Materials. J. Am. Chem. Soc. 2014;136:5006–5022. doi: 10.1021/ja4123939. [DOI] [PubMed] [Google Scholar]
- 11.Sharma V., Wang C., Lorenzini R.G., Ma R., Zhu Q., Sinkovits D.W., Pilania G., Oganov A.R., Kumar S., Sotzing G.A., et al. Rational design of all organic polymer dielectrics. Nat. Commun. 2014;5:4845. doi: 10.1038/ncomms5845. [DOI] [PubMed] [Google Scholar]
- 12.Bérubé N., Gosselin V., Gaudreau J., Côté M. Designing Polymers for Photovoltaic Applications Using ab Initio Calculations. J. Phys. Chem. C. 2013;117:7964–7972. doi: 10.1021/jp309800f. [DOI] [Google Scholar]
- 13.Martin S. Lattice Enumeration for Inverse Molecular Design Using the Signature Descriptor. J. Chem. Inf. Model. 2012;52:1787–1797. doi: 10.1021/ci3001748. [DOI] [PubMed] [Google Scholar]
- 14.Le T., Epa V.C., Burden F.R., Winkler D.A. Quantitative Structure-Property Relationship Modeling of Diverse Materials Properties. Chem. Rev. 2012;112:2889–2919. doi: 10.1021/cr200066h. [DOI] [PubMed] [Google Scholar]
- 15.Venkatraman V., Foscato M., Jensen V.R., Alsberg B.K. Evolutionary de novo design of phenothiazine derivatives for dye-sensitized solar cells. J. Mater. Chem. A. 2015;3:9851–9860. doi: 10.1039/C5TA00625B. [DOI] [Google Scholar]
- 16.Wang C., Pilania G., Boggs S., Kumar S., Breneman C., Ramprasad R. Computational strategies for polymer dielectrics design. Polymer. 2014;55:979–988. doi: 10.1016/j.polymer.2013.12.069. [DOI] [Google Scholar]
- 17.Venkatraman V., Alsberg B.K. A quantitative structure–property relationship study of the photovoltaic performance of phenothiazine dyes. Dyes Pigments. 2015;114:69–77. doi: 10.1016/j.dyepig.2014.10.026. [DOI] [Google Scholar]
- 18.Lameijer E.W., Kok J.N., Bäck T., IJzerman A.P. The Molecule Evoluator. An Interactive Evolutionary Algorithm for the Design of Drug-Like Molecules. J. Chem. Inf. Model. 2006;46:545–552. doi: 10.1021/ci050369d. [DOI] [PubMed] [Google Scholar]
- 19.Chu Y., Heyndrickx W., Occhipinti G., Jensen V.R., Alsberg B.K. An Evolutionary Algorithm for de Novo Optimization of Functional Transition Metal Compounds. J. Am. Chem. Soc. 2012;134:8885–8895. doi: 10.1021/ja300865u. [DOI] [PubMed] [Google Scholar]
- 20.Duchowicz P.R., Fioressi S.E., Bacelo D.E., Saavedra L.M., Toropova A.P., Toropov A.A. QSPR studies on refractive indices of structurally heterogeneous polymers. Chemom. Intell. Lab. Syst. 2015;140:86–91. doi: 10.1016/j.chemolab.2014.11.008. [DOI] [Google Scholar]
- 21.Katritzky A.R., Lobanov V.S., Karelson M. QSPR: The correlation and quantitative prediction of chemical and physical properties from structure. Chem. Soc. Rev. 1995;24:279–287. doi: 10.1039/cs9952400279. [DOI] [Google Scholar]
- 22.Astray G., Cid A., Moldes O., Ferreiro-Lage J.A., Gaĺvez J.F., Mejuto J.C. Prediction of Refractive Index of Polymers Using Artificial Neural Networks. J. Chem. Eng. Data. 2010;55:5388–5393. doi: 10.1021/je100885f. [DOI] [Google Scholar]
- 23.Liu W., Cao C. Artificial neural network prediction of glass transition temperature of polymers. Colloid Polym. Sci. 2009;287:811–818. doi: 10.1007/s00396-009-2035-y. [DOI] [Google Scholar]
- 24.Toropova A.P., Toropov A.A., Kudyshkin V.O., Leszczynska D., Leszczynski J. Optimal descriptors as a tool to predict the thermal decomposition of polymers. J. Math. Chem. 2014;52:1171–1181. doi: 10.1007/s10910-014-0323-3. [DOI] [Google Scholar]
- 25.Bicerano J. Prediction of Polymer Properties. 3rd ed. CRC Press; Boca Raton, FL, USA: 2002. [Google Scholar]
- 26.Maekawa S., Moorthi K. Polymer Optical Constants from Long-Range Corrected DFT Calculations. J. Phys. Chem. B. 2016;120:2507–2516. doi: 10.1021/acs.jpcb.5b10203. [DOI] [PubMed] [Google Scholar]
- 27.Venkatraman V., Gupta M., Foscato M., Svendsen H.F., Jensen V.R., Alsberg B.K. Computer-aided molecular design of imidazole-based absorbents for CO2 capture. Int. J Greenh. Gas Control. 2016;49:55–63. doi: 10.1016/j.ijggc.2016.02.023. [DOI] [Google Scholar]
- 28.Venkatraman V., Abburu S., Alsberg B.K. Artificial evolution of coumarin dyes for dye sensitized solar cells. Phys. Chem. Chem. Phys. 2015;17:27672–27682. doi: 10.1039/C5CP04624F. [DOI] [PubMed] [Google Scholar]
- 29.Foscato M., Occhipinti G., Venkatraman V., Alsberg B.K., Jensen V.R. Automated Design of Realistic Organometallic Molecules from Fragments. J. Chem. Inf. Model. 2014;54:767–780. doi: 10.1021/ci4007497. [DOI] [PubMed] [Google Scholar]
- 30.Foscato M., Venkatraman V., Occhipinti G., Alsberg B.K., Jensen V.R. Automated Building of Organometallic Complexes from 3D Fragments. J. Chem. Inf. Model. 2014;54:1919–1931. doi: 10.1021/ci5003153. [DOI] [PubMed] [Google Scholar]
- 31.Degen J., Wegscheid-Gerlach C., Zaliani A., Rarey M. On the Art of Compiling and Using ‘Drug-Like’ Chemical Fragment Spaces. Chem. Med. Chem. 2008;3:1503–1507. doi: 10.1002/cmdc.200800178. [DOI] [PubMed] [Google Scholar]
- 32.Lorentz H.A. Ueber die Beziehung zwischen der Fortpflanzungsgeschwindigkeit des Lichtes und der Körperdichte. Ann. Phys. Chem. 1880;245:641–665. doi: 10.1002/andp.18802450406. [DOI] [Google Scholar]
- 33.Lorenz L. Ueber die Refractionsconstante. Ann. Phys. Chem. 1880;247:70–103. doi: 10.1002/andp.18802470905. [DOI] [Google Scholar]
- 34.Terui Y., Ando S. Coefficients of molecular packing and intrinsic birefringence of aromatic polyimides estimated using refractive indices and molecular polarizabilities. J. Polym. Sci. Polym. Phys. 2004;42:2354–2366. doi: 10.1002/polb.20114. [DOI] [Google Scholar]
- 35.Nakabayashi K., Imai T., Fu M.C., Ando S., Higashihara T., Ueda M. Poly(phenylene thioether)s with Fluorene-Based Cardo Structure toward High Transparency, High Refractive Index, and Low Birefringence. Macromolecules. 2016;49:5849–5856. doi: 10.1021/acs.macromol.6b01182. [DOI] [Google Scholar]
- 36.Xiao X., Qiu X., Kong D., Zhang W., Liu Y., Leng J. Optically transparent high temperature shape memory polymers. Soft Matter. 2016;12:2894–2900. doi: 10.1039/C5SM02703A. [DOI] [PubMed] [Google Scholar]
- 37.Mark J.E. The Polymer Data Handbook. 2nd ed. Oxford University Press; Oxford, UK: 2009. [Google Scholar]
- 38.Otsuka S., Kuwajima I., Hosoya J., Xu Y., Yamazaki M. PoLyInfo: Polymer Database for Polymeric Materials Design; Proceedings of the 2011 International Conference on Emerging Intelligent Data and Web Technologies, Institute of Electrical and Electronics Engineers; Tirana, Albania. 7–9 September 2011. [Google Scholar]
- 39.Katritzky A.R., Sild S., Karelson M. Correlation and Prediction of the Refractive Indices of Polymers by QSPR. J. Chem. Inf. Model. 1998;38:1171–1176. doi: 10.1021/ci980087w. [DOI] [Google Scholar]
- 40.Katritzky A.R., Sild S., Lobanov V., Karelson M. Quantitative Structure-Property Relationship (QSPR) Correlation of Glass Transition Temperatures of High Molecular Weight Polymers. J. Chem. Inf. Model. 1998;38:300–304. doi: 10.1021/ci9700687. [DOI] [Google Scholar]
- 41.Yu X., Yi B., Wang X. Prediction of refractive index of vinyl polymers by using density functional theory. J. Comput. Chem. 2007;28:2336–2341. doi: 10.1002/jcc.20752. [DOI] [PubMed] [Google Scholar]
- 42.Venkatraman V., Alsberg B.K. KRAKENX: Software for the generation of alignment-independent 3D descriptors. J. Mol. Model. 2016;22:1–8. doi: 10.1007/s00894-016-2957-5. [DOI] [PubMed] [Google Scholar]
- 43.Venkatraman V., Åstrand P.O., Alsberg B.K. Quantitative structure–property relationship modeling of Grätzel solar cell dyes. J. Comput. Chem. 2013;35:214–226. doi: 10.1002/jcc.23485. [DOI] [PubMed] [Google Scholar]
- 44.Venkatraman V., Alsberg B.K. Quantitative structure–property relationship modelling of thermal decomposition temperatures of ionic liquids. J. Mol. Liq. 2016;223:60–67. doi: 10.1016/j.molliq.2016.08.023. [DOI] [Google Scholar]
- 45.Abdi H. Partial least squares regression and projection on latent structure regression (PLS Regression) Wiley Interdiscip. Rev. Comput. Stat. 2010;2:97–106. doi: 10.1002/wics.51. [DOI] [Google Scholar]
- 46.Ziegler A., König I.R. Mining data with random forests: Current options for real-world applications. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2013;4:55–63. doi: 10.1002/widm.1114. [DOI] [Google Scholar]
- 47.Wold S., Sjöström M., Eriksson L. PLS-regression: A basic tool of chemometrics. Chemom. Intell. Lab. Syst. 2001;58:109–130. doi: 10.1016/S0169-7439(01)00155-1. [DOI] [Google Scholar]
- 48.Meinshausen N. quantregForest: Quantile Regression Forests. 2016. R package version 1.3-5.
- 49.ChemAxon Marvin 5.9.3. [(accessed on 22 January 2018)];2012 Available online: http://www.chemaxon.com/products/marvin/marvinsketch.
- 50.O’Boyle N.M., Banck M., James C.A., Morley C., Vandermeersch T., Hutchison G.R. Open Babel: An open chemical toolbox. J. Chem. 2011;3:33. doi: 10.1186/1758-2946-3-33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Rappe A.K., Casewit C.J., Colwell K.S., Goddard W.A., III, Skiff W.M. UFF, a full periodic table force field for molecular mechanics and molecular dynamics simulations. J. Am. Chem. Soc. 1992;114:10024–10035. doi: 10.1021/ja00051a040. [DOI] [Google Scholar]
- 52.Stewart J.J.P. MOPAC2016. Stewart Computational Chemistry; Colorado Springs, CO, USA: 2016. [(accessed on 22 January 2018)]. Available online: http://OpenMOPAC.net. [Google Scholar]
- 53.Stephens P.J., Devlin F.J., Chabalowski C.F., Frisch M.J. Ab Initio Calculation of Vibrational Absorption and Circular Dichroism Spectra Using Density Functional Force Fields. J. Phys. Chem. 1994;98:11623–11627. doi: 10.1021/j100096a001. [DOI] [Google Scholar]
- 54.Yanai T., Tew D.P., Handy N.C. A new hybrid exchange-correlation functional using the Coulomb-attenuating method (CAM-B3LYP) Chem. Phys. Lett. 2004;393:51–57. doi: 10.1016/j.cplett.2004.06.011. [DOI] [Google Scholar]
- 55.Krishnan R., Binkley J.S., Seeger R., Pople J.A. Self-consistent molecular orbital methods. XX. A basis set for correlated wave functions. J. Chem. Phys. 1980;72:650–654. doi: 10.1063/1.438955. [DOI] [Google Scholar]
- 56.Reish M.E., Nam S., Lee W., Woo H.Y., Gordon K.C. A Spectroscopic and DFT Study of the Electronic Properties of Carbazole-Based D-A Type Copolymers. J. Phys. Chem. C. 2012;116:21255–21266. doi: 10.1021/jp307552z. [DOI] [Google Scholar]
- 57.Frisch M.J., Trucks G.W., Schlegel H.B., Scuseria G.E., Robb M.A., Cheeseman J.R., Scalmani G., Barone V., Mennucci B., Petersson G.A., et al. Gaussian 09 Revision D.01. Gaussian Inc.; Wallingford, CT, USA: 2009. [Google Scholar]
- 58.R Core Team . R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; R Core Team; Vienna, Austria: 2016. [Google Scholar]
- 59.Mevik B.H., Wehrens R. The pls Package: Principal Component and Partial Least Squares Regression in R. J. Stat. Softw. 2007;18:1–24. doi: 10.18637/jss.v018.i02. [DOI] [Google Scholar]
- 60.Liaw A., Wiener M. Classification and Regression by randomForest. R News. 2002;2:18–22. [Google Scholar]
- 61.Smola A.J., Schölkopf B. A tutorial on support vector regression. Stat. Comput. 2004;14:199–222. doi: 10.1023/B:STCO.0000035301.49549.88. [DOI] [Google Scholar]
- 62.Chong I.G., Jun C.H. Performance of some variable selection methods when multicollinearity is present. Chemom. Intell. Lab. Syst. 2005;78:103–112. doi: 10.1016/j.chemolab.2004.12.011. [DOI] [Google Scholar]
- 63.Breiman L. Random Forests. Mach. Learn. 2001;45:5–32. doi: 10.1023/A:1010933404324. [DOI] [Google Scholar]
- 64.Jang S.H., Jen A.Y. Comprehensive Nanoscience and Technology. Elsevier; Amsterdam, The Netherlands: 2011. Structured Organic Non-Linear Optics; pp. 143–187. [Google Scholar]
- 65.Vleeschouwer F.D., Speybroeck V.V., Waroquier M., Geerlings P., Proft F.D. Electrophilicity and Nucleophilicity Index for Radicals. Org. Lett. 2007;9:2721–2724. doi: 10.1021/ol071038k. [DOI] [PubMed] [Google Scholar]
- 66.Groh W., Zimmermann A. What is the lowest refractive index of an organic polymer? Macromolecules. 1991;24:6660–6663. doi: 10.1021/ma00025a016. [DOI] [Google Scholar]
- 67.Javadi A., Shockravi A., Koohgard M., Malek A., Shourkaei F.A., Ando S. Nitro-substituted polyamides: A new class of transparent and highly refractive materials. Eur. Polym. J. 2015;66:328–341. doi: 10.1016/j.eurpolymj.2015.02.032. [DOI] [Google Scholar]
- 68.Liu J., Ueda M. High refractive index polymers: fundamental research and practical applications. J. Mater. Chem. 2009;19:8907–8919. doi: 10.1039/b909690f. [DOI] [Google Scholar]
- 69.Ertl P., Schuffenhauer A. Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions. J. Chem. 2009;1:8. doi: 10.1186/1758-2946-1-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Ertl P., Schuffenhauer A. SA_Score. [(accessed on 01 March 2017)]; Available online: https://github.com/rdkit/rdkit/tree/master/Contrib/SA_Score.
- 71.Miller-Chou B.A., Koenig J.L. A review of polymer dissolution. Prog. Polym. Sci. 2003;28:1223–1270. doi: 10.1016/S0079-6700(03)00045-5. [DOI] [Google Scholar]
- 72.Viera A., Garrett J. Understanding interobserver agreement: The kappa statistic. Fam. Med. 2005;37:360–363. [PubMed] [Google Scholar]
- 73.You N.H., Higashihara T., Yasuo S., Ando S., Ueda M. Synthesis of sulfur-containing poly(thioester)s with high refractive indices and high Abbe numbers. Polym. Chem. 2010;1:480–484. doi: 10.1039/b9py00326f. [DOI] [Google Scholar]
- 74.Zhang G., Ren H.H., Li D.S., Long S.R., Yang J. Synthesis of highly refractive and transparent poly(arylene sulfide sulfone) based on 4,6-dichloropyrimidine and 3,6-dichloropyridazine. Polymer. 2013;54:601–606. doi: 10.1016/j.polymer.2012.12.008. [DOI] [Google Scholar]
- 75.Song Y., Wang J., Li G., Sun Q., Jian X., Teng J., Zhang H. Synthesis, characterization and optical properties of fluorinated poly(aryl ether)s containing phthalazinone moieties. Polymer. 2008;49:4995–5001. doi: 10.1016/j.polymer.2008.09.003. [DOI] [Google Scholar]
- 76.Tagaya A., Koike Y. Compensation and control of the birefringence of polymers for photonics. Polym. J. 2012;44:306–314. doi: 10.1038/pj.2011.141. [DOI] [Google Scholar]
- 77.Rosipal R. Kernel Partial Least Squares for Nonlinear Regression and Discrimination. Neural Netw. World. 2003;13:291–300. [Google Scholar]
- 78.Levatić J., Ceci M., Kocev D., Džeroski S. Semi-supervised Learning for Multi-target Regression. In: Appice A., Ceci M., Loglisci C., Manco G., Masciari E., Ras Z.W., editors. New Frontiers in Mining Complex Patterns: Third International Workshop, NFMCP 2014, Held in Conjunction with ECML-PKDD 2014, Nancy, France, 19 September 2014, Revised Selected Papers. Springer International Publishing; Cham, Switzerland: 2015. pp. 3–18. [Google Scholar]
- 79.Smith J.S., Isayev O., Roitberg A.E. ANI-1: An extensible neural network potential with DFT accuracy at force field computational cost. Chem. Sci. 2017;8:3192–3203. doi: 10.1039/C6SC05720A. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.








