Skip to main content
ACS AuthorChoice logoLink to ACS AuthorChoice
. 2025 Dec 3;17(50):67906–67913. doi: 10.1021/acsami.5c17708

Forecasting Battery Electrode Performance via Electrochemical Fluorescence Microscopy and Machine-Learning

Karla Negrete , Marco-Tulio Fonseca Rodrigues , Daniel P Abraham , Maureen H Tang §,*
PMCID: PMC12723639  PMID: 41332278

Abstract

Predicting lithium-ion battery performance is hindered by microscale electrode heterogeneities invisible to conventional diagnostics. Here, we combine electrochemical fluorescence microscopy (EFM), which maps electronic connectivity by visualizing an electrofluorophore reaction distribution, with a multitask ElasticNet regression to forecast discharge capacity from spatial heterogeneity. Analyzing 196 images from six pilot-scale LiNi0.5Mn0.3Co0.2O2 cathodes with varying carbon loadings, we extract 62 descriptors that capture morphology and texture. A compact five-feature model predicts capacity across eight discharge rates, achieving a per-target R 2 of up to 0.63 and an overall R 2 of 0.92, with a mean absolute percentage error of less than 2%. This performance rivals impedance-based approaches while avoiding their reliance on postformation data and incomplete electronic network information. Our facile and rapid, image-driven method may enable electrode quality control upstream of costly cell assembly to offer a transformative tool for data-driven battery research and manufacturing.

Keywords: electrochemical fluorescence microscopy, machine learning, battery electrodes, performance predictions, data-driven manufacturing


graphic file with name am5c17708_0007.jpg


graphic file with name am5c17708_0005.jpg

1. Introduction

Despite tremendous importance to electric vehicles and consumer devices, predicting the performance of lithium-ion batteries (LIBs) remains challenging due to microscale heterogeneities in electrode structure that evade conventional diagnostics. , Nominally identical cells can diverge widely in capacity and lifespan, with the weakest units limiting module safety and reliability. Variations in the spatial distribution of active material, conductive carbon, and binder within the cathode influence electronic connectivity but remain invisible to bulk electrochemical measurements. While machine learning (ML) has shown promise for state estimation and degradation forecasting, most approaches rely on time-series voltage or current data that are path-dependent, low-resolution, and unavailable until after significant cycling. , This dependence on time-series data limits their utility for early stage quality control. A predictive framework that can link electrode-level heterogeneity to downstream performance, before cell assembly and formation, which account for over 60% of manufacturing cost, could transform battery production by enabling proactive image screening of suboptimal electrodes before they enter costly electrochemical workflows.

To address this gap, we introduce an image-based framework that couples electrochemical fluorescence microscopy (EFM) with a penalized multitask regression model to forecast discharge capacity directly from mesoscale spatial heterogeneity. In EFM, an electrofluorophore emits light only where continuous electronic networks exist, revealing connectivity with high spatial resolution. We applied this approach to six pilot-scale LiNi0.5Mn0.3Co0.2O2 cathodes with carbon black loadings from 1–5 wt %, yielding 196 fluorescence images. From each image, we extracted 62 statistical descriptors capturing spatial autocorrelation, texture, and morphology. Trained solely on these descriptors, our model achieves per-target R 2 values up to 0.63 and an overall R 2 of 0.92 with mean absolute percent error below 2%, rivaling more complex, multimodal methods that depend on post-formation electrochemical or impedance data. The consistent selection of a compact, five-feature set across all targets underscores the model’s stability and interpretability. By relying solely on intensity-based texture and blob-derived morphological features, our framework enables early stage, facile, and rapid quality control, paving the way for predictive diagnostics upstream of formation and accelerating the path toward data-driven manufacturing.

Figure illustrates the operating principle of EFM. The cross-section of a composite cathode depicts suboptimal manufacturing, in which some particles of active material form strong electronic connections with conductive carbon and binder (left) while others are electronically isolated (right). Electronically isolated regions of electrodes are not only dead weight/volume within the battery, which negatively effects overall performance metrics, but also regions which can experience extreme potentials during cycling, leading to side reactions that can generate gases and other detrimental products. In EFM, the optical cell is filled with an electrolyte containing a reversible electrofluorophore that fluoresces only upon reduction at 1.95 V vs Li/Li+, well below typical cathode lithiation (∼3.0 V vs Li/Li+). Polarization at −7.6 mA/cm2 selectively reduces the fluorophore in electronically connected regions, allowing spatial mapping of electronic accessibility.

1.

1

The EFM mechanism. Cross-sectional schematic of an electrode (a): homogeneous regions within the composite transfer electrons to the electrofluorophore in the electrolyte, producing fluorescence. Bright areas in top-down images indicate strong electronic connectivity; dark regions reveal electronically isolated or disconnected particles. EFM images comparing the top surface of two NMC electrodes: (b) high-performance electrode from CAMP pilot-scale facility and (c) low-performance electrode fabricated in-house via doctor blade.

Under the microscope, fluorescence intensity directly reports the concentration of reduced fluorophore, which corresponds to the accessibility of the electronic network. Bottom panels of Figure contrast EFM images of high- and low-quality NMC electrodes. An electrode made with a pilot-scale coater at Argonne’s cell analysis modeling and prototyping (CAMP) facility exhibits near-uniform fluorescence with subtle gradients. In contrast, a doctor-bladed, lab-scale electrode shows pronounced dark regions, previously linked to electronically isolated particles. Because EFM probes only the top surface, where electronic connectivity is likely lowest, observed heterogeneity therefore provides a conservative metric of network quality.

While qualitative variations in Figure b,c reflect differences in manufacturing scale and process control, our goal was to move beyond visual inspection and quantitatively compare heterogeneity among high-quality, pilot-scale electrodes. Although our prior work correlated EFM fluorescence with capacity fade in lab-scale composites, its application to industrially relevant electrodes and its potential to yield predictive heterogeneity descriptors remains untested. Here, we extract 62 image-derived heterogeneity features from the CAMP library of pilot-scale electrodes and pair them with half-cell rate-performance data. We show that heterogeneity metrics alone can predict discharge capacity, establishing a new approach to performance evaluation in electrode manufacturing.

2. Results and Discussion

Six pilot-scale electrodes were fabricated at Argonne’s CAMP facility using BASF TODA NMC532 active material and SOLVAY 5130 PVDF binder, varying only in carbon black type: Timcal Super C45 (t) or Cabot LITX 200 (c), at loadings from 1 to 5 wt % (Figure ). Formulation labels such as “90:5­(t):5” denote 90 wt % NMC532, 5 wt % carbon, and 5 wt % PVDF. The medium-structure Super C45 network typically requires 2 to 5 wt % to percolate reliably. In contrast, the high-structure LITX 200 achieves percolation at 1 to 3 wt %, potentially enabling higher active-material content and improved energy density. , These differences are evident when comparing equivalent carbon content (96:2:2): cells with LITX 200 consistently outperform Super C45 in discharge capacity (Supporting Information Figure S1). EFM images (lower panels, Figure a) reveal uniform fluorescence at 5 wt %, with increasing fragmentation (“peppered” dark regions) at 2 to 4 wt %, evolving into discrete disconnected islands below 1.5 wt %.

2.

2

Contrasting views of electronic transport in composite electrodes. (a) Six NMC532 cathodes were fabricated with Timcal Super C45 (diamond) or Cabot LITX 200 (circle) at 1–5 wt % carbon. EFM images quantify a fluorescence-derived heterogeneity index that decreases with carbon loading, whereas charge-transfer resistance (R ct) from EIS paradoxically increases. (b) Schematic illustration of parallel resistors shows why EIS underestimates heterogeneity: only particles well connected to the carbon binder domain contribute to the measured R ct, while disconnected regions remain invisible. EFM, by contrast, resolves both connected and disconnected domains, directly capturing transport inhomogeneities.

We quantified heterogeneity using Shannon entropy, here termed the heterogeneity index, computed from the normalized 256-bin intensity histogram of each image. Entropy values spanned 6.6 bits (most uniform, 5 wt %) to 7.3 bits (most heterogeneous, 1.5 wt %), where increments of 0.1 bit correspond to perceptible shifts from continuous fluorescence to isolated dead zones (Figure a). By contrast, charge-transfer resistance (R ct) measured by EIS exhibits the opposite trend, in which the 5 wt % formulation shows nearly double the R ct of lower-loading electrodes. This discrepancy occurs because EIS is a bulk, low-perturbation probe that lacks spatial resolution. It preferentially reports the lowest-resistance pathways, akin to the brightest hotspots in EFM, while overlooking inactive regions. Consequently, it systematically underestimates the true extent of transport limitations. As Newman has shown, standard interpretations of R ct in porous composites are misleading without explicit percolation models. Figure b illustrates this concept: active particles act as resistors in parallel, but only those well connected to the carbon binder domain contribute to the measured R ct. This measurement inflates the apparent connectivity of the film and conceals resistive regions of the underlying electronic network. By contrast, the fluorescence-derived heterogeneity index from EFM captures both connected and disconnected domains and directly reveals electronic transport inhomogeneities that EIS cannot access.

From 196 high-magnification EFM images, we extracted 62 quantitative heterogeneity descriptors spanning global spatial statistics, patch-wise variation, radiomic texture, and blob metrics, indexed by electrode formulation. To remove redundancy, we computed Pearson correlation coefficients across descriptors and discarded one member of each pair with |r| > 0.85, yielding 15 independent descriptors (Supporting Information Figure S4). For formulation-resolved analysis, each feature was Z-score normalized across formulations to enable direct comparison of heterogeneous metrics. Figure highlights the most diagnostic descriptors across carbon loadings: blob count (connected-component analysis of dark particle-like regions), gray-level run length matrix (GLRLM, low gray-level run emphasis), gray-level co-occurrence matrix (GLCM, sum average), heterogeneity index (Shannon entropy), Moran’s I (global spatial autocorrelation), and local binary pattern with radius = 1 and 8 neighbors (LBP R=1,P=8, energy statistic). We further examine the individual and cumulative contributions of these top features to model predictions using SHAP (SHapley Additive exPlanations) analysis, where features with higher absolute mean SHAP values are deemed more important. As shown in Supporting Information Figure S5, blob count is the most influential single feature, with an absolute mean SHAP value of 0.4, whereas the combined effect of the top six features reaches 0.9.

3.

3

Feature variability across electrode formulations. Violin plots display the distribution of selected image-derived features after z-scoring, highlighting formulation-dependent trends. Features include blob count (disconnected active material particle density), GLRLM (gray-level run-length measures of texture), GLCM (co-occurrence-based texture metrics), heterogeneity index (Shannon entropy of the intensity histogram), Moran’s I (spatial autocorrelation), and LBP (local binary patterns, pixel neighborhood descriptors). Each color corresponds to a distinct electrode formulation, with carbon loading and source specified in the legend. Shifts in median position capture systematic differences between formulations. For example, blob count, heterogeneity index, and Moran’s I vary strongly with carbon loading, whereas other descriptors such as GLRLM reveal more subtle but consistent formulation-dependent trends.

The top three descriptors are blob count, the heterogeneity index, and Moran’s I. Blob count quantifies discrete low-intensity regions after thresholding; higher blob counts at low carbon loadings are consistent with a weaker electronic network and fewer actively connected particles. The heterogeneity index and Moran’s I are global descriptors that capture complementary aspects of large-scale intensity structure. A higher heterogeneity index denotes a broader, more disordered intensity distribution, indicating increased electronic heterogeneity. Moran’s I measures global spatial autocorrelation: values near +1 indicate spatially continuous domains, near 0 randomness, and near −1 checkerboard-like alternation. In our images, elevated Moran’s I at low carbon loadings reflects clustered, fluorescence-rich “islands” of connected active particles separated by dark regions of isolated active particles. Texture descriptors provide complementary, scale-dependent information. LBP R=1, P=8 energy measures the concentration of local binary patterns: high energy indicates dominance of a few repeating microtextures, whereas low energy indicates diverse local motifs. Experimentally, higher LBP energy at increased carbon content corresponds to the fine “peppering” of intensity that we establish as the baseline homogeneity. GLRLM quantifies consecutive runs of identical gray levels along multiple directions (0°, 45°, 90°, 135°); the predominant GLRLM metric here is low gray-level run emphasis (LGRE), which up-weights runs at darker intensities. Because LGRE pools contributions from both short and long low-intensity runs, its median may be less sharply separated across loadings even though it explains substantial image variability. Finally, the GLCM SumAverage (averaged over 0°, 45°, 90°, 135°) captures the expected summed intensity of neighboring pixel pairs: differences in SumAverage arise from the bright-connected islands and dark isolated regions at low carbon versus the more uniformly peppered intensity field at higher carbon. Defining such formulation-resolved trends in image-derived heterogeneity features offers substantial value for quality control in electrode fabrication. Given that fabrication accounts for ∼45% of cell-production cost and that a 5.3 GWh yr–1 plant incurs $140–180 million USD GWh–1 annually, even minor improvements in quality control can yield substantial savings.

Although formulation-resolved trends are informative, our primary aim was to establish a proof-of-concept: can a compact set of image-derived heterogeneity features predict discharge capacity under stringent modeling constraints? From 196 EFM images, each paired with the mean capacity of its electrode formulation (six formulations total), we regressed 15 image features against capacities at eight protocol stages (C/25 to 2C). With only six distinct formulations, the problem is underdetermined for ordinary least-squares (OLS), since the number of predictors exceeds the number of independent inputs. To address this, we implemented a penalized multitask ElasticNet, which jointly learns across all eight discharge rates and enforces shared sparsity to stabilize estimates. Nested, stratified 5-fold cross-validation with a held-out test partition was used, with all preprocessing, feature selection, and hyperparameter tuning confined to the inner folds (Supporting Information Figure S6). The procedure selected α = 0.0336 and l1_ratio = 0.9, where the l 1 term eliminates noninformative features by driving coefficients to zero. The l 2 term shrinks the remaining coefficients toward each other to stabilize estimates and preserve correlated predictors. With l1_ratio = 0.9, the model strongly favors sparsity while retaining some grouping.

Despite the demanding regime, the multitask ElasticNet model consistently selected only 3–5 features (well below the number of formulations) and outperformed single-task OLS in cross-validation by ΔR 2 ≈ 0.10, while reducing fold-to-fold variance 4-fold (Supporting Information Figure S7). On the independent hold-out set, per-target R 2 reached up to 0.63 (C/25) and 0.61 (second C/25 stage) with relative error <1.2% and RMSE < 2.6. Across faster rates the model achieved R 2 = 0.51 to −0.60 (C/10 to 2.0C) (Figure ). Aggregated across all eight targets, the final model yields R 2 = 0.92, RMSE = 3.53, and mean absolute error <2%. Supplementary Table S3 reports the fully fitted equations along with the corresponding feature weights. This level of performance is consistent with or surpasses impedance-based approaches. Zhang et al. achieved R 2 ≈ 0.90 with mean absolute percent error ∼2.0% using full EIS spectra. Liu et al. reported lower RMSE (∼1.1%) using variational autoencoders on impedance data. In contrast, Jones et al. demonstrated probabilistic capacity forecasts with higher test error (∼8.2%) from single-scan EIS. Our image-based regression achieves comparable or superior accuracy while avoiding the limitations of EIS, which probes only part of the electrode’s electronic network.

4.

4

Predicted vs measured discharge capacities across rate protocols. Each point shows a formulation-level prediction from the multitask ElasticNet model, trained on 15 image features. Despite only six unique formulations, the model selects 3 to 5 predictors and achieves test R 2 up to 0.63, with errors under 3% and RMSE from 2.06 to 4.97 mAh. Overall, the model attains an aggregate R 2 = 0.92, RMSE of 3.53 mAh, and mean absolute error below 2% across all rate conditions.

To address concerns of information leakage due to shared formulation-level capacities, we also attempt a leave-one-formulation-out analysis. In this approach, all images from one formulation are held out for testing while training is conducted on the remaining formulations, ensuring strict independence between training and test data. While conceptually the most conservative approach, leave-one-formulation-out is less suited to this compact six-formulation data set, as it reduces variability within the training folds. Comparing the workflow (Supporting Information Figure S8) and results (Supporting Information Figure S10) confirm that, as expected, the leave-one-formulation-out framework can predict performance when interpolating for intermediate carbon loadings (1.5%, 2%, and 4%), but accuracy degrades when extrapolating to carbon loadings (5% and 1%).

3. Conclusion

In summary, this work has demonstrated that EFM can predict electrode performance from ex situ visualizations of local electronic connectivity. EFM requires relatively inexpensive and accessible equipment and can be obtained before cell assembly or cycling. Here, we show that under stringent formulation constraints, a small set of image-derived features can predict discharge capacity with accuracy comparable to state-of-the-art electrochemical methods. Our proof-of-concept offers promising evidence that expanding this framework to broader formulation diversity and true cell-to-cell variability could generalize relationships across materials, architectures, and manufacturing conditions. Performance predictions via EFM may expedite battery R&D by reducing the number of cell experiments required to determine the viability of electrode materials or formulations. For manufacturing quantity control, EFM may offer robust classification to detect defected electrodes upstream of cell assembly for faster feedback and reduced scrap rates. Scaling and validating this approach will require collaboration among national laboratories, academia, and industry, establishing the foundation for performance-guided optimization in electrode manufacturing. Ultimately, a closed-loop data-driven manufacturing design will rely on frameworks such as this one to connect electrode fabrication, electronic structure, and battery performance.

4. Experimental Section

4.1. Electrode Preparation and Battery Cycling

4.1.1. Electrode Formulations

Six pilot-scale cathode formulations were fabricated at Argonne’s CAMP Facility using BASF TODA NMC532 active material and Solvay 5130 PVDF binder. The formulations varied only in carbon black type, either Timcal Super C45 (BET ≈ 45 m2/g, OAN ≈ 36 mL/100 g) or Cabot LITX 200 (BET 150–1500 m2/g, OAN 100–200 mL/100 g), and in loading, ranging from 1 to 5 wt %. Slurries were prepared in N-methylpyrrolidone (NMP), cast onto 20 μ m aluminum foil, dried, and calendered to a final thickness of 53–71 μ m, corresponding to 34.8–35.6% porosity. Final blends spanned a formulation range from an additive-heavy 90:5:5 to a lean 98:1:1 (active/carbon/binder, wt %) blend.

4.1.2. Coin Cell Assembly

Cathodes were punched into 14 mm disks and dried under dynamic vacuum at 120 °C overnight. 2032-format coin cells were assembled using Li metal anodes (15 mm), Celgard 2500 separators (16 mm), and 40 μL of the electrolyte (1.2 M LiPF6 in EC/EMC, 3:7 wt %). All components were handled in a dry room to minimize moisture uptake.

4.1.3. Electrochemical Measurements

Cathode capacities were evaluated on a Maccor Series 4100 cycler between 2.5 and 4.3 V. For each formulation (10–16 cells), rate capability was assessed with two cycles each at C/25, C/10, C/5, C/2, 1, 1.5, and 2C, followed by two final C/25 cycles to re-establish baseline. Each charge half-cycle terminated with a hold at 4.3 V until the current had decayed to below C/25. Discharge was always performed under constant current. A 2 min rest followed each half-cycle. Capacities are reported per gram of active material.

Electrochemical impedance spectroscopy (EIS) was performed on triplicate samples for each cathode. These tests used full-cells, obtained by pairing each cathode with the same type of CAMP-fabricated anode (91.8 wt % Superior Graphite SLC1506T, 2 wt % Timcal C45, 6 wt % PVDF, 0.17 wt % oxalic acid). Spectra were collected after five cycles (3x C/10, 2x C/2), and after equilibration of the cells at 3.8 V.

4.2. Electrochemical Fluorescent Microscopy

4.2.1. Optical Cell Assembly

Electrochemical fluorescent microscopy (EFM) experiments were conducted in a modified ECC-Opto-10 optical cell (El-Cell) fitted with a 1.1 mm-thick FTO conductive window (Saida Glass Co.). The cell was assembled face-to-face, with the cathode at the bottom. A 25 μ m PTFE separator (McMaster-Carr), laser-cut with nine 1 mm openings, was placed above the cathode. The cell was flooded with an electro-fluorophore electrolyte (1 mM 9,10-anthraquinone in 0.1 M TEABF4/propylene carbonate; Millipore Sigma) and sealed in an Ar-filled glovebox (LC Tech).

4.2.2. In Situ Imaging

Fluorescence imaging was performed on a ZEISS Axio Observer wide-field microscope coupled to a BioLogic potentiostat. A custom filter cube (375 nm excitation, 515 nm emission) and an LD Plan-Neofluar 20×/0.4 objective were used. A cathodic current of 60 μA activated the fluorophore, and nine regions of interest (ROI, 1 mm apertures) were imaged with 20 s exposures to minimize photobleaching. For each formulation, three to five electrodes were imaged, resulting in a maximum of 45 images per formulation. The ROIs were selected across the electrode surface to capture representative spatial variation and minimize sampling bias, ensuring that the quantitative fluorescence measurements reflected the overall heterogeneity of the electrode.

4.2.3. ML Framework

The workflow included five sequential steps: (1) feature extraction, (2) data set generation, (3) feature reduction, (4) model training and prediction, and (5) model evaluation. All computations were implemented in Python 3.10 using standard libraries.

4.2.3.1. Feature Extraction

Feature extraction was timed to identify bottlenecks and ensure scalability. Timing data and extracted features were organized by sample and archived for transparency and reproducibility. Features incorporated into the ML models are highlighted in bold in the descriptions that follow. A comprehensive overview of the extracted features with corresponding references and timing results is provided in Supporting Information Tables S1 and S2.

4.2.4. Preprocessing

Grayscale TIFF images were normalized to the range [0, 1], resized to 429 × 563 px via bicubic interpolation, and downconverted to 8 bit depth for uniform spatial resolution. A full-frame binary mask defined the analysis region. Features were standardized to have a mean of zero and a variance of one before modeling.

4.2.5. Global Spatial Statistics

Four whole-image descriptors quantified spatial heterogeneity.

Moran’s I measured spatial autocorrelation across 8-connected neighborhoods

I=Ni,jwij·i,jwij(xi)(xj)i(xi)2

where N is the number of pixels, x i the intensity at pixel i, the mean image intensity, and w ij the spatial weight (1 if pixels i, j are neighbors, 0 otherwise). Values near +1 indicate strong clustering, while values near −1 indicate dispersion.

Shannon entropy quantified grayscale disorder using a 256-bin histogram

H=k=1256pklog2pk

where p k is the normalized probability of intensity level k. Higher H reflects greater grayscale heterogeneity.

Global mean intensity μ and standard deviation σ summarized overall brightness and variability

μ=1Nixi,σ=1Ni(xiμ)2

4.2.6. Patch-wise Variability

To probe mesoscale heterogeneity, images were divided into nonoverlapping 70 × 70 px tiles (22 × 22 μm). For each valid tile, we computed

μt=1ntitxi,σt2=1ntit(xiμt)2,Ht=k=1256pk,tlog2pk,t

where n t is the number of pixels in tile t, and p k,t is the normalized histogram of intensities. Global patch descriptors are averages across all tiles: μpatch=1Ttμt , σpatch2=1Ttσt2 , Hpatch=1TtHt .

4.2.7. Radiomic Texture Descriptors

Second-order and frequency-domain descriptors were extracted using PyFeats:

  • GLCM (Gray-Level Co-occurrence Matrix): For a given offset (Δx, Δy), the co-occurrence matrix entry is

P(i,j)=#{(x,y)|I(x,y)=i,I(x+Δx,y+Δy)=j}

where I(x, y) is the gray level at pixel (x, y). The matrix is normalized such that

i=1Gj=1GP(i,j)=1

where G is the number of gray levels.

Haralick features are derived from P(i, j). For example, the sum average is

fsum_avg=k=22Gkpx+y(k),px+y(k)=i+j=kP(i,j)

Features were computed for four directions (0°, 45°, 90°, 135°) and averaged.

  • GLRLM (Gray-Level Run Length Matrix): The run-length matrix R(i, j) records the number of contiguous runs of gray level i with run length j along a given direction. Formally,

R(i,j)=#{runsofgrayleveliwithlengthj}

here i ∈ {1, ..., G} indexes gray levels and j ∈ {1, ..., R} indexes run lengths, where G is the number of gray levels and R the maximum run length. The total number of runs is

Nr=i=1Gj=1RR(i,j)

From R(i, j), several statistics are defined. , The low gray-level run emphasis (LGRE) emphasizes contributions from low-intensity pixels

LGRE=1Nri=1Gj=1RR(i,j)i2

LGRE attains larger values when runs of darker pixels dominate, irrespective of run length, thus capturing the prevalence of low-intensity regions.

  • LBP (Local Binary Patterns): For radius R and P neighbors,

LBPR,P(xc)=p=0P1s(I(xp)I(xc))2p,s(z)={1,z00,z<0

where x c is the center pixel. Histogram-based statistics, such as energy, were computed.

  • NGTDM, GLDS, SFM: Capturing neighborhood differences, gray-level differences, and submatrix statistics, respectively, each defined per.

  • LTE (Laws’ Texture Energy): Convolution with 7-tap Laws’ kernels (e.g., L5, E5, S5) yields energy maps; the mean energy per filter characterizes primitive textures (edges, spots, waves).

  • Fourier Power Spectrum (FPS): The 2D Fourier transform F(u, v) yields radial energy

P(r)=1|Ω(r)|(u,v)Ω(r)|F(u,v)|2

averaged across annuli Ω­(r), capturing periodicity and frequency content.

4.2.7.1. Data Set Generation

Extracted feature vectors were compiled into a data set indexed by cathode formulation, with discharge capacities from eight protocol steps serving as multivariate targets. Given the relatively small sample size and diversity across electrode types, a stratified split was used to partition the data into training (70%) and validation (30%) subsets while preserving the distribution of cathode formulations. This ensured that performance metrics reflected generalization across all sample types, rather than overfitting to dominant classes.

4.2.7.2. Feature Reduction

To mitigate multicollinearity, features with a Pearson correlation coefficient exceeding |r| > 0.85 were removed based on the upper triangle of the correlation matrix computed from the training set. This reduced the original 62 features to a set of 15 statistically independent predictors. No additional feature selection or engineering was required, as the sparsity-inducing regularization of the learning algorithm automatically pruned redundant or noninformative inputs during training.

4.2.7.3. Model Training and Prediction

Although the data set contained 196 image samples, these represented only six unique cathode formulations, limiting the effective number of independent inputs. After feature reduction, 15 predictors remained, exceeding the number of unique inputs, and preventing standard linear regression methods such as ordinary least-squares (OLS) from reliably finding stable solutions. While ElasticNet regularization reduces the number of active features, fitting separate models for each electrochemical protocol step can lead to overfitting.

To address this, we adopted a multitask learning framework in which discharge capacities from all protocol steps were modeled jointly. By learning multiple related outputs simultaneously using a shared set of predictors, the model captures commonalities across tasks, thereby improving stability and generalization. All features and targets were standardized to zero mean and unit variance; target scaling was reversed after prediction to report performance in the original units.

The multitask elastic net estimates the coefficient matrix WRp×T by solving

argminWi=1nt=1T(yi,tŷi,t)2+λ[αW2,1+(1α)WF2]

where y i,t and ŷi,t are the observed and predicted values for sample i and task t. The term ∥W2,1 is a group-lasso norm promoting shared sparsity across tasks, while ∥W F is the squared Frobenius norm providing ridge regularization. The parameters λ and α control overall regularization strength and the balance between sparsity and shrinkage, respectively.

Hyperparameters λ and α were optimized via grid search using stratified cross-validation to preserve the distribution of cathode formulations across folds. The final model was trained on the whole training set using the best-performing parameters and evaluated on a held-out test set to assess generalization.

4.2.7.4. Model Evaluation

Model performance was assessed using nested 5-fold stratified cross-validation, where inner folds were used for hyperparameter tuning and outer folds evaluated generalization on held-out data. For each output, we report the coefficient of determination

R2=1i=1N(yiŷi)2i=1N(yi)2

the root-mean-square error (RMSE)

RMSE=1Ni=1N(yiŷi)2

and the mean absolute percent error (MAPE)

MAPE=100Ni=1N|yiŷiyi|

These metrics were computed separately per output and then averaged across outer folds.

To provide a summary of overall model accuracy across all targets, predictions, and actual values were concatenated across outputs and evaluated jointly. This approach produces global R 2, RMSE, and MAPE scores that reflect performance across the entire output space, accounting for the relative variance and scale of each target. Per-target and overall results are presented in Supporting Information Table S3.

Supplementary Material

am5c17708_si_001.pdf (2.7MB, pdf)

Acknowledgments

K.N. was supported by the US Department of Education GAANN program, fund #P200A190036. The submitted manuscript has been created in part by UChicago Argonne, LLC, Operator of Argonne National Laboratory (“Argonne”). Argonne, a U.S. Department of Energy Office of Science laboratory, is operated under Contract No. DE-AC02-06CH11357. The U.S. Government retains for itself, and others acting on its behalf, a paid-up nonexclusive, irrevocable worldwide license in said article to reproduce, prepare derivative works, distribute copies to the public, and perform publicly and display publicly, by or on behalf of the Government.

196 TIFF images, parameter metrics, and Python scripts are openly available under an open-source license at: https://github.com/karnegre/efm-ml-framework.git.

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acsami.5c17708.

  • Supplementary electrochemical measurements (EIS and rate performance); feature group definitions and computation benchmarks; dimensionality reduction; cross-validation; LOFO scheme details, regression coefficients, and model performance across rates and formulations; supplementary references (PDF)

The authors declare no competing financial interest.

References

  1. Cano Z. P., Banham D., Ye S., Hintennach A., Lu J., Fowler M., Chen Z.. Batteries and fuel cells for emerging electric vehicle markets. Nat. Energy. 2018;3:279–289. doi: 10.1038/s41560-018-0108-1. [DOI] [Google Scholar]
  2. Liu H., Li C., Hu X., Li J., Zhang K., Xie Y., Wu R., Song Z.. Multi-modal framework for battery state of health evaluation using open-source electric vehicle data. Nat. Commun. 2025;16:1137. doi: 10.1038/s41467-025-56485-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Attia P. M., Moch E., Herring P. K.. Challenges and opportunities for high-quality battery production at scale. Nat. Commun. 2025;16:611. doi: 10.1038/s41467-025-55861-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Albertus P., Christensen J., Newman J.. Experiments on and Modeling of Positive Electrodes with Multiple Active Materials for Lithium-Ion Batteries. J. Electrochem. Soc. 2009;156:A606–A618. doi: 10.1149/1.3129656. [DOI] [Google Scholar]
  5. Tao R., Gu Y., Du Z., Lyu X., Li J.. Advanced electrode processing for lithium-ion battery manufacturing. Nat. Rev. Clean Technol. 2025;1:116–131. doi: 10.1038/s44359-024-00018-w. [DOI] [Google Scholar]
  6. Li R., Fan Y., Liu X., Li J., Zhang H., Lin L., Wang Z., Wen B.. Visualizing Electron Transport and Interface Evolution of Li3InCl6-Based Composite Cathode for All-Solid-State Batteries. ACS Energy Lett. 2025;10:2793–2803. doi: 10.1021/acsenergylett.5c00174. [DOI] [Google Scholar]
  7. Berecibar M., Gandiaga I., Villarreal I., Omar N., Van Mierlo J., Van den Bossche P.. Critical review of state of health estimation methods of Li-ion batteries for real applications. Renewable Sustainable Energy Rev. 2016;56:572–587. doi: 10.1016/j.rser.2015.11.042. [DOI] [Google Scholar]
  8. Xiong R., Cao J., Yu Q., He H., Sun F.. Critical Review on the Battery State of Charge Estimation Methods for Electric Vehicles. IEEE Access. 2018;6:1832–1843. doi: 10.1109/ACCESS.2017.2780258. [DOI] [Google Scholar]
  9. Severson K. A., Attia P. M., Jin N., Perkins N., Jiang B., Yang Z., Chen M. H., Aykol M., Herring P. K., Fraggedakis D.. et al. Data-driven prediction of battery cycle life before capacity degradation. Nat. Energy. 2019;4:383–391. doi: 10.1038/s41560-019-0356-8. [DOI] [Google Scholar]
  10. Hu X., Che Y., Lin X., Onori S.. Battery Health Prediction Using Fusion-Based Feature Selection and Machine Learning. IEEE Trans. Transp. Electrif. 2021;7:382–398. doi: 10.1109/TTE.2020.3017090. [DOI] [Google Scholar]
  11. Hasib S. A., Islam S., Chakrabortty R. K., Ryan M. J., Saha D. K., Ahamed M. H., Moyeen S. I., Das S. K., Ali M. F., Islam M. R.. et al. A comprehensive review of available battery datasets, RUL prediction approaches, and advanced battery management. IEEE Access. 2021;9:86166–86193. doi: 10.1109/access.2021.3089032. [DOI] [Google Scholar]
  12. Zhu J., Wang Y., Huang Y., Bhushan Gopaluni R., Cao Y., Heere M., Mühlbauer M. J., Mereacre L., Dai H., Liu X., Senyshyn A., Wei X.. et al. Data-driven capacity estimation of commercial lithium-ion batteries from voltage relaxation. Nat. Commun. 2022;13:2261. doi: 10.1038/s41467-022-29837-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Iftikhar M., Shoaib M., Altaf A., Iqbal F., Villar S. G., Dzul Lopez L. A., Ashraf I.. A deep learning approach to optimize remaining useful life prediction for Li-ion batteries. Sci. Rep. 2024;14:25838. doi: 10.1038/s41598-024-77427-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Thelen A., Huan X., Paulson N., Onori S., Hu Z., Hu C.. Probabilistic machine learning for battery health diagnostics and prognosticsreview and perspectives. npj Mater. Sustain. 2024;2:14. doi: 10.1038/s44296-024-00011-1. [DOI] [Google Scholar]
  15. Zhang Z., Gu X., Zhu Y., Wang T., Gong Y., Shang Y.. Data-driven available capacity estimation of lithium-ion batteries based on fragmented charge capacity. Commun. Eng. 2025;4:32. doi: 10.1038/s44172-025-00372-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Sulzer V., Mohtat P., Aitio A., Lee S., Yeh Y. T., Steinbacher F., Khan M. U., Lee J. W., Siegel J. B., Stefanopoulou A. G., Howey D. A.. The challenge and opportunity of battery lifetime prediction from field data. Joule. 2021;5:1934–1955. doi: 10.1016/j.joule.2021.06.005. [DOI] [Google Scholar]
  17. Gervillié-Mouravieff C., Bao W., Steingart D. A., Meng Y. S.. Non-destructive characterization techniques for battery performance and life-cycle assessment. Nat. Rev. Electr. Eng. 2024;1:547–558. doi: 10.1038/s44287-024-00069-y. [DOI] [Google Scholar]
  18. Nelson, P. A. ; Ahmed, S. ; Gallagher, K. G. ; Dees, D. W. . Modeling the performance and cost of lithium-ion batteries for electric-drive vehicles; Argonne National Laboratory, 2019. [Google Scholar]
  19. Zhang Y., Tang Q., Zhang Y., Wang J., Stimming U., Lee A. A.. Identifying degradation patterns of lithium ion batteries from impedance spectroscopy using machine learning. Nat. Commun. 2020;11:1706. doi: 10.1038/s41467-020-15235-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Jones P. K., Stimming U., Lee A. A.. Impedance-based forecasting of lithium-ion battery performance amid uneven usage. Nat. Commun. 2022;13:4806. doi: 10.1038/s41467-022-32422-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Li, Y. ; Maleki, M. ; Banitaana, S. ; Samadib, M. F. In Data-Driven Lithium-ion Battery Degradation Estimation Using Electrochemical Impedance Spectroscopy and Throughput, 2023 2nd International Engineering Conference on Electrical, Energy, and Artificial Intelligence (EICEEAI); IEEE; pp 1–8, 2023. [Google Scholar]
  22. Liu Y., Li Q., Wang K.. Revealing the degradation patterns of lithium-ion batteries from impedance spectroscopy using variational auto-encoders. Energy Storage Mater. 2024;69:103394. doi: 10.1016/j.ensm.2024.103394. [DOI] [Google Scholar]
  23. Negrete K., Tang M. H.. Visualizing and Quantifying Electronic Accessibility in Composite Battery Electrodes Using Electrochemical Fluorescent Microscopy. J. Electrochem. Soc. 2024;171:100512. doi: 10.1149/1945-7111/ad81b6. [DOI] [Google Scholar]
  24. Saraka R. M., Morelly S. L., Tang M. H., Alvarez N. J.. Correlating Processing Conditions to Short- and Long-Range Order in Coating and Drying Lithium-Ion Batteries. ACS Appl. Energy Mater. 2020;3:11681–11689. doi: 10.1021/acsaem.0c01305. [DOI] [Google Scholar]
  25. Morelly S. L., Alvarez N. J., Tang M. H.. Short-Range Contacts Govern the Performance of Industry-Relevant Battery Cathodes. J. Power Sources. 2018;387:49–56. doi: 10.1016/j.jpowsour.2018.03.039. [DOI] [Google Scholar]
  26. Timcal America, Inc . Super C45 Carbon Black Technical Data Sheet; Timcal, 2023. [Google Scholar]
  27. Cabot Corporation LITX 200 . Conductive Carbon Black Product Information; Cabot Corporation, 2024. [Google Scholar]
  28. Mauler L., Duffner F., Leker J.. Economies of scale in battery cell manufacturing: The impact of material and process innovations. Appl. Energy. 2021;286:116499. doi: 10.1016/j.apenergy.2021.116499. [DOI] [Google Scholar]
  29. Orangi S., Manjong N., Clos D. P., Usai L., Burheim O. S., Strømman A. H.. Historical and prospective lithium-ion battery cost trajectories from a bottom-up production modeling perspective. J. Energy Storage. 2024;76:109800. doi: 10.1016/j.est.2023.109800. [DOI] [Google Scholar]
  30. Van der Walt S., Schönberger J. L., Nunez-Iglesias J., Boulogne F., Warner J. D., Yager N., Gouillart E., Yu T.. scikit-image: image processing in Python. PeerJ. 2014;2:e453. doi: 10.7717/peerj.453. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Giakoumoglou, N. PyFeats: Open-source software for image feature extraction, 2021. https://github.com/giakou4/pyfeats (accessed May 26, 2025).
  32. Contributors, O. OpenCV Documentation. 2025. https://docs.opencv.org/3.4/ (accessed Feb 26, 2025).
  33. Moran P. A. P.. Notes on Continuous Stochastic Phenomena. Biometrika. 1950;37:17–23. doi: 10.2307/2332142. [DOI] [PubMed] [Google Scholar]
  34. Shannon C. E.. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948;27:379–423. doi: 10.1002/j.1538-7305.1948.tb01338.x. [DOI] [Google Scholar]
  35. Haralick R. M., Shanmugam K., Dinstein I.. Textural Features for Image Classification. IEEE Trans. Syst. Man. Cybern. 1973;SMC-3:610–621. doi: 10.1109/TSMC.1973.4309314. [DOI] [Google Scholar]
  36. Dasarathy B. V., Holder E. B.. Image characterizations based on joint gray levelrun length distributions. Pattern Recognit. Lett. 1991;12:497–502. doi: 10.1016/0167-8655(91)80014-2. [DOI] [Google Scholar]
  37. Zhang H., Hung C. L., Min G., Guo J. P., Liu M., Hu X.. GPU-Accelerated GLRLM Algorithm for Feature Extraction of MRI. Sci. Rep. 2019;9:10883. doi: 10.1038/s41598-019-46622-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Yuan H., Paskov I., Paskov H., González A. J., Leslie C. S.. Multitask learning improves prediction of cancer drug sensitivity. Sci. Rep. 2016;6:31619. doi: 10.1038/srep31619. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

am5c17708_si_001.pdf (2.7MB, pdf)

Data Availability Statement

196 TIFF images, parameter metrics, and Python scripts are openly available under an open-source license at: https://github.com/karnegre/efm-ml-framework.git.


Articles from ACS Applied Materials & Interfaces are provided here courtesy of American Chemical Society

RESOURCES