Skip to main content
Open Research Europe logoLink to Open Research Europe
. 2023 Feb 22;2:96. Originally published 2022 Aug 12. [Version 2] doi: 10.12688/openreseurope.14745.2

Calendar ageing modelling using machine learning: an experimental investigation on lithium ion battery chemistries

Burak Celen 1, Melik Bugra Ozcelik 2, Furkan Metin Turgut 2,a, Cisel Aras 2, Thyagesh Sivaraman 3, Yash Kotak 4, Christian Geisbauer 4, Hans-Georg Schweiger 4
PMCID: PMC10446031  PMID: 37645330

Version Changes

Revised. Amendments from Version 1

- For XGBoost and battery ageing models, more recent references added (8,10,16) - All variables used in the equations made italic and tried to be explained in a nomenclature section at the beginning - Figure 1 deleted - Figure 6 (before Fig7) now has point on the plot lines to show the data sample points - For Figure 1 (before Fig2) “%” sign added for SOC percentage values - In Figures 3,4,5, “%” sign added for all values - For the purpose of giving more details regarding the methodology, the used Python APIs and methodologies have been mentioned

Abstract

Background: The phenomenon of calendar ageing continues to have an impact on battery systems worldwide by causing them to have undesirable operation life and performance. Predicting the degradation in the capacity can identify whether this phenomenon is occurring for a cell and pave the way for placing mechanisms that can circumvent this behaviour.

Methods: In this study, the machine learning algorithms, Extreme Gradient Boosting (XGBoost) and artificial neural network (ANN) have been used to predict the calendar ageing data belonging to six types of cell chemistries namely, Lithium Cobalt Oxide, Lithium Iron Phosphate, Lithium Manganese Oxide, Lithium Titanium Oxide, Nickle Cobalt Aluminum Oxide and Nickle Manganese Cobalt Oxide.

Results: Prediction results with overall Mean Absolute Percentage Error of 0.0126 have been obtained for XGBoost algorithm. Among these results, Nickle Cobalt Aluminum Oxide and Nickle Manganese Cobalt Oxide type cell chemistries stand out with their mean absolute percentage errors of 0.0035 and 0.0057 respectively. Also, algorithm fitting performance is relatively better for these chemistries at 100% state of charge and 60°C temperature compared to ANN results. ANN algorithm predicts with mean absolute error of approximately 0.0472 overall and 0.0238 and 0.03825 for Nickle Cobalt Aluminum Oxide and Nickle Manganese Cobalt Oxide. The fitting performance of ANN for Nickle Manganese Cobalt Oxide at 100% state of charge and 60°C temperature is especially poor compared to XGBoost.

Conclusions: For an electric vehicle battery calendar ageing prediction application, XGBoost can establish itself as the primary choice more easily compared to ANN. The reason is XGBoost’s error rates and fitting performance are more usable for such application especially for Nickel Cobalt Aluminum Oxide and Nickel Manganese Cobalt Oxide chemistries, which are amongst the most demanded cell chemistries for electric vehicle battery packs.

Keywords: lithium-ion batteries, calendar ageing, artificial neural network, machine learning, XGBoost

Nomenclature

Abbreviation Definition
N Number of inputs
w Weights in between the neurons
x Inputs forwarded to the next neuron
i Enumeration of inputs
j Next layer’s enumeration of neurons
o Output of the respected or activated neuron
Q Number of neurons
y Actual output
E Error
α Learning rate

Introduction

Lithium-ion (Li-ion) batteries are becoming the most promising key component for the vision of a sustainable future with the rise of e-mobility. It has earned this well-deserved reputation due to its high power and energy density and long cycle life 1 . Although their proven success, as any other system does, Li-ion batteries also suffer from degradations over time because of different ageing phenomenons such as cyclic and calendar ageing. While both ageing mechanisms have their similarities in terms of the resulting electrochemical reaction, which is, simply put, the growth of solid electrolyte interface on the anode particle of battery 2 , calendar ageing occurs during storage 3 while the cyclic ageing occurs during the deintercalation and intercalation of lithium due to the operation (charging-discharging) of the battery 4 . In the end, both ageing mechanisms will result in decreased capacity 3 and increased resistance 5 . In this paper, the focus will be on the capacity degradation due to the calendar ageing.

As mentioned, the calendar ageing of a battery occurs during storage, but the magnitude of the phenomenon is determined by three main factors: the rest state of charge (SOC), the rest temperature and the duration of the rest time of the Li-ion battery 6 .

The recent studies conducted by Geisbauer et al., proved that despite Li-ion technology offering a wide range of chemistries, such as Nickle Manganese Cobalt Oxide (NMC), Nickle Cobalt Aluminum Oxide (NCA), Lithium Titanium Oxide (LTO), Lithium Cobalt Oxide (LCO) or Lithium Iron Phosphate (LFP), none of them were able to avoid the calendar ageing 7 . The study also clearly demonstrated high temperatures lead to even higher degradations 7 . These factors are important because, e.g. if the application is an electric vehicle, the vehicle will spend most of its lifetime parked and this period of time may include high rest temperatures.

In order to implement functionalities that prolong the lifetime of the battery, the ageing of the Li-ion cells should be accurately estimated. There are physical, empirical and semi-empirical modeling solutions that can describe the behaviour of cyclic and calendar ageing. However, the physical models require a great deal of physical-chemical knowledge, really complex modeling and computational effort 6, 8 . In the meantime, less complex semi-empirical models require lesser physical-chemical knowledge but are not as accurate compared to the physical model 6, 8, 9 . A method that can minimize the computational effort, eliminates the need for physical-chemical knowledge for different cell chemistries but is still as accurate as an electrochemical (physical) model was lacking. However, a solution to this problem has been proposed by many authors in the literature 5, 6, 10 . The overall solution is to employ advanced machine learning algorithms that can accurately estimate calendar ageing. Therefore, this paper considers such algorithms as the main methodology of this research. It also covers the missing aspect of market and literature that is the comparison of machine learning algorithms accuracy for a wide spectrum of chemistries such as NMC, NCA, LTO, LMO, LCO and LFP. The reasons to study Li-ion chemistries in this paper are, (a) due to the significant portion of the current energy storage system applications are based on Li-ion technology, and (b) the study aligns with the previous research of 7. Thus, this study can take full advantage of a calendar ageing dataset of a wide range of Li-ion chemistries. This will enable the validation of algorithms on a much wider sense.

Calendar ageing with the cyclic ageing comprises the complete ageing phenomenon for Li-ion cells, and validation of whether the estimation of the calendar ageing is accurate is highly crucial since the phenomenon does not only have an impact on state of health estimation but also on the SOC estimation as well. The inaccurate estimations of the ageing of the battery will result in inaccurate SOC estimations (e.g limit breaches at full capacity during the charge of the battery due to faulty SOC estimation can occur. SOC estimation algorithm can estimate the accumulated charge more than the actual accumulated charge if the actual usable capacity of the cell is lower due to the ageing. This scenario can easily be realized with an ageing estimation model with low accuracy that underestimates the capacity decrease of the battery). The battery system’s current limitation estimations will be affected by these wrong SOC estimations due to the remaining charge being an important input in defining the current limits in both charge and discharge directions 11 .

Consequently, this chain of wrong estimations along with the mentioned non-accurate current limitations will speed up the ageing effects of the batteries 12 . The resulting impractical system may easily diminish battery operation lifetime and increase the safety risks 13 . Therefore, the variety and the size of the validation data and the variety and comparison of the employed coupled modelling activities play an important role in this paper and help to prove that the chosen methods would yield highly accurate results. Another benefit of this work is that it can shed light on future system implementations that don’t exclude the phenomenon of calendar ageing. Usable accuracy results drawn from this work can realize systems that take into account the other half of the medallion in terms of ageing, without the time-consuming parameterization or characterization test activities (this half of the medallion being the calendar and the other one being the cyclic ageing which is being implemented in various systems).

The coupled modelling activities that will be demonstrated and will be used for the aim of predicting the calendar ageing via capacity degradation of the cells by using some of the early test data are Extreme Gradient Boosting (XGBoost) and Artificial Neural Network (ANN). XGBoost is a decision tree based machine learning algorithm, while the ANN is an artificial adaptive system that uses its base elements, called neurons and connections, to transform its global inputs into a predicted output 14 . The motivation behind the selection of these algorithms is that both methods are known for their ability to yield reliable results 6, 15, 16 as well as, by employing these algorithms, there would be a means of comparing the prediction performances of machine learning algorithms on different chemistries.

Methods

Ageing setup, data acquisition and processing

The complete ageing procedure of the different cell chemistries was performed at Technische Hochschule Ingolstadt (THI). Mainly, the cells were calendar aged in temperature chambers (Vötsch VT 4011 and Vötsch VT 4021) at raised temperatures of 50°C, 60°C and 70°C and high, medium and low voltages. The detailed storage conditions are described in previous research 7 . Before any fast ageing procedure was employed, the cells were validated for their functionality with a capacity check and pulse profile to validate the internal resistance. The cylindrical cells were mounted onto self-constructed wooden shelves to maximize the number of cells per climate chamber. Check-ups, during which the cells were taken out, measured for capacity and internal resistance after a fixed relaxation time, were established in regular intervals for the different storage conditions. Before returning the cells into the climate chambers, they were recharged to their dedicated storage SOC as described in the 7.

The instrumentation equipment (Neware BTS 4008-5V6A cell tester) had a measurement accuracy of ±0.05% 7 . This means that at this level of accuracy, the instrument can induce a capacity measurement error of 10,925 A.s (ampere-second) and subsequently a small relative error of 0.121388889. Because of this reason, in this work, the prediction results from the machine learning algorithms are only trying to fit to the measurement data which may incorporate these small accuracy errors. The performances of the algorithms are also validated according to this measurement data with significantly small accuracy errors.

For a more detailed description of the data acquisition, please refer to 7. After the ageing was finished, the evaluated data 17 was prepared for the modelling to be performed at AVL, where also post-processing methods were applied to make the datasets more usable for the machine learning algorithms.

Extreme Gradient Boosting (XGBoost)

Extreme Gradient Boosting (XGBoost) is a decision tree based, state-of-the-art supervised machine learning algorithm that is widely used in regression or classification problems. It proved its success by being the main solution technique of many machine learning competitions and problems in the industry 18 . A decision tree consists of nodes, branches, and leaves. The algorithm starts with the root node and splits the data by branches according to the condition in it. Thereafter, it checks for the condition in the next node or makes the final decision in the leaves 19 .

Gradient boosting algorithms are also based on decision trees. By gradient boosting methods, some weak learners - e.g. decision trees - can be transformed into strong learners. For regression tasks, a loss function is calculated for the results in the leaves, and the tree is reconstructed by calculating residuals 20 . Therefore, the results are improved in every step of learning.

One of the main superiorities of XGBoost is its capability of being able to automatically handle null values in the dataset. In conventional machine learning tasks, data scientists must clear the null data before starting to build their learning algorithm. However, XGBoost can work on a dataset with null values and reduce the effort for data pre-processing to a great extent 18 .

Also, XGBoost uses a greedy algorithm, weighted quantile sketch for finding the best learning score. It splits the data into instances like the quantile sketch algorithm. The improvement of XGBoost is that every portion of data is weighted 18 .

Furthermore, XGBoost uses parallel computing with storing data in “column blocks”. These blocks are stored in the cache owing to its “cache-aware” system. The efficiency of this method depends on the size of blocks 18 .

For the calendar ageing problem, the Python Application Programming Interface (API) xgboost’s XGBRegressor methodology has been used in an implementation to predict the capacity values. Figure 1 demonstrates the decision tree generated by the XGBoost algorithm.

Figure 1. Calendar ageing XGBoost implementation decision tree.

Figure 1.

It has been demonstrated in earlier studies that it is possible to deploy XGBoost models to embedded processors 21 . This ability ensures the usage of this method for practical scenarios as well. Embedded control systems can take advantage of this ability to actualize the XGBoost implementations for a wider range of real-life problems.

Artificial neural networks

In this work, artificial neural network (ANN) architecture is another method to model the calendar ageing in the battery cells. In ANNs, the modelling is done via adaptation of the individual weights based on the partial derivative of the error between the simulated output and desired output. At each cycle, the weights are adjusted to minimize the mentioned error.

In the end, the goal of this architecture is to diminish the error levels by estimating the function that will result with the desired output by any given input 22 . In this study, the model is something rather conventional that maps a set of inputs to an output. The model’s architecture has five inputs that are listed as important features, five hidden layers with the sequence of 9,9,9,5,5 neurons and lastly, the output layer is predicting the capacity of the given cell considering the calendar ageing as a function of storage conditions like SOC and temperature, exposure duration in days to those conditions, specification of the Li-ion chemistry and brand-new capacity information of the cell. The described structure can be observed in Figure 2.

Figure 2. Implemented artificial neural network (ANN) structure.

Figure 2.

For the model, the results of each neuron can be mathematically described with the summation of the respected weights and inputs at the end activated with a special function known as Rectified Linear Unit (ReLU). This gives an output of 0 while the given input is smaller than zero (f(x) = 0 if x < 0) and output of itself while the input is either equal to zero or higher (f(x) = x if x => 0).

oj=f(i=1Nwi,jxi)(1)

In Equation 1, N is the number of inputs, w is the weights in between the neurons, x is the inputs forwarded to the next neuron. Here, i represents the enumeration of inputs, j represents the next layer’s enumeration of neurons. Finally, the output of the respected neuron is denoted as o.

E=k=1Qokyk(2)

Updating the weights used in the equation or in other terms, estimating the function, is done via a technique called gradient descent. During the training process, the idea is to find the error equation described in Equation 2. Here, Q is the number of neurons, o is the activated neuron output, y is the actual output. The error is a function of weights, taking the partial derivative of the error equation with respect to weights and finally multiplying with an entity known as the learning rate α results in the update of weights as described in Equation 3.

wi,j=αEwi,j(3)

During the training and verification steps, various configurations of hidden layers, neurons, learning rates and learning strategies are employed. Among them, normalizing the inputs around 0 to 1, setting hidden layers to 5 with 9,9,9,5,5 neurons and setting the learning rate ( α) to be constant 0.015 yielded the desirable outputs in the end. The aforementioned parameter set has yielded the mentioned desirable outputs in an implementation that uses the Python API sklearn and specific methodology, the MLPRegressor. Later on, when needed, these results and studies can be used as a baseline to generate a system for embedded boards on battery management systems 23 .

Evaluation of results

In order to evaluate the performances of the algorithms, an evaluation metric has to be chosen. In this context, mean absolute error (MAE) along with its percentile variation mean absolute percentage error (MAPE) have been decided as the key performance indicators to benchmark two algorithms. MAE and MAPE are metrics that usually find applications to themselves in the machine learning field 24 . Both MAE and MAPE determine the average magnitude of errors between predicted and measured values 24 . The only difference is MAE is not unitfree which makes comparisons somewhat difficult, while MAPE doesn’t have this problem since it calculates based on the percentage error 24 . This makes MAPE a better choice in the machine learning field to explain the relative errors 24 . For both metrics, the same rule applies; the smaller the MAE/MAPE value, the higher the accuracy of prediction. The only disadvantage is, even though determining the magnitude of error gives an important insight, MAE or MAPE cannot provide the direction of error 24 . The basic formulas for MAE and MAPE can be seen in equations Equation 4 and Equation 5.

MAE=1n1n|DpreDact|(4)
MAPE=100%ni=1n|DpreDactDact|(5)

Here, D pre represents the predicted value and D act is the measured value. Meanwhile, n represents the number of result points.

Results and discussion

As mentioned in the introduction and methods, the algorithms have not been preferred just for achieving satisfactory results, but also because of the prospect of benchmarking and performance comparisons. For this reason, the results of both algorithms (XGBoost and ANN) were obtained for each cell chemistry in terms of MAPE and plotted as a bar graph in Figure 3 for clear inspection. Also, this graph has been supported with the corresponding ampere-hour (Ah) values to the MAPE values with Table 1. The first noticeable point from Figure 3 and Table 1 is that ANN is significantly underperforming for the prediction of capacity degradations due to the calendar ageing compared to the performance of XGBoost. Another point from this graph is the prediction performance of algorithms for cobalt included chemistries such as NMC, NCA and LCO, seems significantly better compared to their performances on the cobalt-free chemistries.

Figure 3. Bar graph of mean absolute percentage errors (MAPE) of Lithium Cobalt Oxide (LCO), Lithium Iron Phosphate (LFP), Lithium Manganese Oxide (LMO), Lithium Titanium Oxide (LTO), Nickle Cobalt Aluminum Oxide (NCA) and Nickle Manganese Cobalt Oxide (NMC) cell chemistries resulting from the artificial neural network (ANN) and Extreme Gradient Boosting (XGBoost) algorithms.

Figure 3.

Table 1. Mean absolute error (MAE) values of Lithium Cobalt Oxide (LCO), Lithium Iron Phosphate (LFP), Lithium Manganese Oxide (LMO), Lithium Titanium Oxide (LTO), Nickle Cobalt Aluminum Oxide (NCA) and Nickle Manganese Cobalt Oxide (NMC) cell chemistries resulting from the artificial neural network (ANN) and Extreme Gradient Boosting (XGBoost) algorithms (in terms of ampere-hour).

Algorithms LCO LFP LMO LTO NCA NMC Total
ANN 0.067567 0.076977 0.139979 0.076037 0.059225 0.092450 0.085372
XGB 0.018649 0.020621 0.028154 0.03472 0.008248 0.013664 0.020676

The superior performance of XGBoost on the chemistries that are dominating the automotive industry like NCA, NMC and LFP is another result that is highly convenient. NCA, NMC and LFP are comprising the most significant portion of the electric vehicle battery development projects 25 , and this prediction performance brings forth the XGBoost’s feasibility for the automotive batteries’ calendar ageing prediction applications, combined with these algorithms’ prospects of deployment on embedded processors as mentioned in the methods.

In Figure 4, more detailed prediction results of ANN have been presented. The graph demonstrates ANN’s performance at different SOC and temperature combinations. Also, as done with the only-chemistry-focused results, Figure 4 has been supported with the corresponding Ah values to the MAPE values with Table 2. The same kind of visualization and details for XGBoost can be seen in Figure 5 and Table 3.

Figure 4. Bar graph of artificial neural network (ANN) mean absolute percentage errors (MAPE) of Lithium Cobalt Oxide (LCO), Lithium Iron Phosphate (LFP), Lithium Manganese Oxide (LMO), Lithium Titanium Oxide (LTO), Nickle Cobalt Aluminum Oxide (NCA) and Nickle Manganese Cobalt Oxide (NMC) cell chemistries at different state of charge (SOC) and temperature combination.

Figure 4.

Figure 5. Bar graph of extreme gradient boosting (XGBoost) mean absolute percentage errors (MAPE) of Lithium Cobalt Oxide (LCO), Lithium Iron Phosphate (LFP), Lithium Manganese Oxide (LMO), Lithium Titanium Oxide (LTO), Nickle Cobalt Aluminum Oxide (NCA) and Nickle Manganese Cobalt Oxide (NMC) cell chemistries at different state of charge (SOC) and temperature combination.

Figure 5.

Table 2. Artificial neural network (ANN) mean absolute error values (MAE) of Lithium Cobalt Oxide (LCO), Lithium Iron Phosphate (LFP), Lithium Manganese Oxide (LMO), Lithium Titanium Oxide (LTO), Nickle Cobalt Aluminum Oxide (NCA) and Nickle Manganese Cobalt Oxide (NMC) cell chemistries at different state of charge (SOC) and temperature combinations (in terms of ampere-hour).

SOC% & T LCO LFP LMO LTO NCA NMC
SOC100% & 60°C 0.081378 0.101446 0.145732 0.06983 0.033591 0.106814
SOC50% & 60°C 0.057263 0.079261 0.201832 0.128171 0.083061 0.100742
SOC50% & 50°C - - 0.112657 0.115104 - -
SOC20% & 60°C 0.045575 0.043559 0.072332 0.054119 0.061023 0.063603
SOC20% & 50°C - - 0.131381 0.042448 - -

Table 3. Extreme Gradient Boosting (XGB) mean absolute error (MAE) values of Lithium Cobalt Oxide (LCO), Lithium Iron Phosphate (LFP), Lithium Manganese Oxide (LMO), Lithium Titanium Oxide (LTO), Nickle Cobalt Aluminum Oxide (NCA) and Nickle Manganese Cobalt Oxide (NMC) cell chemistries at different state of charge (SOC) and temperature combinations (in terms of ampere-hour).

SOC% & T LCO LFP LMO LTO NCA NMC
SOC100% & 60°C 0.031725 0.025438 0.029895 0.025872 0.014391 0.024037
SOC50% & 60°C 0.004605 0.014662 0.021757 0.042077 0.006978 0.008930
SOC50% & 50°C - - 0.015376 0.002688 - -
SOC20% & 60°C 0.007829 0.019535 0.016149 0.042144 0.003373 0.007844
SOC20% & 50°C - - 0.055854 0.04091 - -

From Figure 4 and Figure 5 it can be seen that, generally, the error increases when algorithms attempt to predict the capacity degradation at 100% SOC or 50°C temperature. The 50°C temperature behaviour may be explained as the majority of the data points are from 60°C conditioning, thus the algorithms expect more drastic capacity drops from their training as occurs in 60°C. However, the reason behind the relatively high errors at 100% SOC is most likely related to the calendar ageing phenomenon’s behaviour itself. The deliberate decision of training the algorithms with all SOC regions available in the data led to the expectation of mild capacity drops under all conditions in them. ANN has its exceptions to this behaviour with its results on LMO, LTO and NCA while XGBoost only shows this exception for LTO.

Another exception to the general behaviour is LTO chemistry results, where the largest error can be seen at 50% SOC and 60°C temperature. In the experiments conducted by THI, this behaviour of the algorithms was explained 7 . From Figure 4 in 7, it can be seen that LTO is the only chemistry that is subject to a larger capacity degradation at mid-SOC compared to its degradation at the high-SOC level. So ANN’s and XGBoost’s outputs on this are most likely to be more related to chemistry’s calendar ageing behaviour and less dependent on the data sample points or training-test splits.

In Figure 6, the continuous time plots of the measured and predicted capacity of cells at 100% SOC and 60°C temperature in terms of passed days plotted and the performances of algorithms in terms of prediction accuracy, can be observed. XGBoost mostly yields outputs that fit the measured data perfectly. Meanwhile, ANN always predicts smaller capacity outputs compared to the cell beginning-of-life (as it should due to the calendar ageing) as the days pass. However, for some chemistries, it may end up with an offset for estimating the degradation, like the results on LMO. Nevertheless, the predicted rate of degradation is the same with the measurement. For LCO and NMC, a pattern of under-estimation for the capacity degradation can be clearly seen. The results fit better at the later points for LFP, LTO and NCA chemistries.

Figure 6. Exemplary continuous time plots of Lithium Cobalt Oxide (LCO), Lithium Iron Phosphate (LFP), Lithium Manganese Oxide (LMO), Lithium Titanium Oxide (LTO), Nickle Cobalt Aluminum Oxide (NCA) and Nickle Manganese Cobalt Oxide (NMC) cell chemistries at 100% state of charge (SOC) and 60°C.

Figure 6.

While the accuracy of algorithms is an important aspect for an application, the performances in terms of elapsed time for predictions play a huge role as well. Table 4 represents time required for each algorithm to produce the results. From Table 4 it can be seen that ANN can make predictions several times faster than the XGBoost for the whole validation set. However, when an application or the calendar ageing sampling practices are considered, this difference may not have an impact. If the application would be a calendar ageing prediction of an automotive battery pack, executing a function that updates the value of the capacity in 0.002 seconds, 0.0229 seconds (times required to predict the whole validation set on a PC processor) or any other value would not be significant. The reason is probably that this kind of function would have a call time interval in terms of days to write a meaningful capacity change due to the behavior of the phenomenon and when it does get called, how much time is required to update the value and accuracy/computing time trade-off won’t be as significant as is in a SOC estimation application.

Table 4. Prediction performances of artificial neural network (ANN) and Extreme Gradient Boosting (XGBoost) algorithms in terms of elapsed time in seconds (whole validation set, PC processor).

Algorithms Training Prediction
ANN 0.259736 0.002041
XGB 0.563075 0.022953

The periodic update of the SOC may be required for determining accurate current limits and various other tasks, but periodic updating of the capacity drop due to the calendar ageing won’t be desired during the vehicle operation where embedded software tasks usually run in milli-second intervals. The generated overhead won’t be rewarded with meaningful changes in the capacity. In this work, the elapsed time for predictions is not known for an embedded processor and it is known that the computational efficiency challenges for machine learning applications on embedded systems stand as an obstacle for a wider range adoption of these applications. However, these challenges may be overcome with function calls during the long vehicle park time durations, like nighttime period for an automotive battery pack calendar ageing prediction application, when all application-specific points are considered.

Conclusion

In this study, the calendar ageing data of six different cell chemistries have been used to train and validate two different machine learning algorithms. One of the expected challenges was XGBoost’s possible overfitting due to the datasets’ length limitation. However, the results demonstrated that the XGBoost algorithm can be used to effectively predict the calendar ageing of most chemistries with significantly minimal mean absolute error. Meanwhile, ANN produces relatively better results only for LFP, LCO, NCA and NMC than other chemistries when analysed in the context of its own results.

The ability to deploy these models on control systems combined with XGBoost’s satisfying performance on chemistries preferred heavily for the automotive industry (NCA, NMC and LFP), shows that the XGBoost algorithm can be incorporated into the electric vehicle battery application softwares to successfully predict the calendar ageing effects and provide better operation life to electric vehicle batteries. While the same satisfactory use case applies to ANN as well, the prediction performance of XGBoost is far more superior and thus it can establish itself more easily as the first choice. The challenges that stand ahead for these kinds of applications could be the reliance on a large amount of data and computing times 26 , but machine learning based on big data and cloud computing can be the solution to this particular challenge 26 without sacrificing from application specifications mentioned in the previous section.

The models developed in this study are used to predict the capacity degradation resulting from calendar ageing. However, this is only one of the indicators of the effect of calendar ageing as mentioned earlier in this paper. As future work, the other indicator of the calendar ageing in batteries, the resistance increase, can be modeled to be the output of the algorithms. This further work can provide another standpoint to the issue where it can be seen whether or not machine learning algorithms can estimate the calendar ageing via resistance increase with satisfactory performance.

The further work suggestions can be multiplied with some issues mentioned in the results and discussion section, such as the improved and stable correlation between inputs and outputs of the machine learning algorithms for the cobalt-included cell chemistries. This topic should be investigated more since the results on why the usage of cobalt included chemistries is more feasible for calendar ageing prediction via machine learning, will probably be highly interesting and enlightening for many applications. Also, results on the different SOC and temperature combinations discussed based on Figure 4 and Figure 5 can be further investigated. Specifically LTO cells can be researched with a dataset comprising more data points. Current claims are based on educated guesses and quick analyses which revolve around the ideas of data limitations or chemistry specific behaviours based on previous work conducted by 7 and available outputs from the scripts. Further investigations can validate these claims or uncover different root causes.

Lastly, it is worth noting that the results for ANN would be much more promising with denser data, while XGBoost seems to be not affected much by this limitation.

Funding Statement

This research was financially supported by the European Union’s Horizon 2020 research and innovation programme under the grant agreement No H2020-EU.3.4, 875568 (CObalt-free Batteries for FutuRe Automotive Applications [COBRA]) and by EFRE-supported EU research network project SENSE BAY (grant number EU-1802-0005). This report reflects only the author’s view. The European Commission and the Innovation and Networks Executive Agency (INEA) are not responsible for any use that may be made of the information it contains.

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

[version 2; peer review: 2 approved]

Data availability

Underlying data

Zenodo: Lithium-ion battery calendar ageing data - discharge and charge capacity, days passed, temperature and SOC. https://doi.org/10.5281/zenodo.6546753 17

Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

References

  • 1. Asef P, Milan M, Lapthorn A, et al. : Future trends and aging analysis of battery energy storage systems for electric vehicles. Sustainability. 2021;13(24):13779. 10.3390/su132413779 [DOI] [Google Scholar]
  • 2. Kupper C, Weißhar B, Rißmann S, et al. : End-of-life prediction of a lithium-ion battery cell based on mechanistic aging models of the graphite electrode. J Electrochem Soc. 2018;165(14):A3468–A3480. 10.1149/2.0941814jes [DOI] [Google Scholar]
  • 3. Liu K, Li Y, Hu X, et al. : Gaussian process regression with automatic relevance determination kernel for calendar aging prediction of lithium-ion batteries. IEEE Trans Industr Inform. 2020;16(6):3767–3777. 10.1109/TII.2019.2941747 [DOI] [Google Scholar]
  • 4. Gantenbein S, Schönleber M, Weiss M, et al. : Capacity fade in lithium-ion batteries and cyclic aging over various state-of-charge ranges. Sustainability. 2019;11(23):6697. 10.3390/su11236697 [DOI] [Google Scholar]
  • 5. Azkue M, Lucu M, Martinez-Laserna E, et al. : Calendar ageing model for li-ion batteries using transfer learning methods. World Electr Veh J. 2021;12(3):145. 10.3390/wevj12030145 [DOI] [Google Scholar]
  • 6. Hosen MS, Jaguemont J, Van Mierlo J, et al. : Battery lifetime prediction and performance assessment of different modeling approaches. iScience. 2021;24(2):102060. 10.1016/j.isci.2021.102060 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Geisbauer C, Wöhrl K, Koch D, et al. : Comparative study on the calendar aging behavior of six different lithium-ion cell chemistries in terms of parameter variation. Energies. 2021;14(11):3358. 10.3390/en14113358 [DOI] [Google Scholar]
  • 8. Liang Y, Emadi A, Gross O, et al. : A comparative study between physics, electrical and data driven lithium-ion battery voltage modeling approaches. SAE Technical Paper, 1:0700. 2022. 10.4271/2022-01-0700 [DOI] [Google Scholar]
  • 9. Petit M, Prada E, Sauvant-Moynot V: Development of an empirical aging model for Li-ion batteries and application to assess the impact of Vehicle-to-Grid strategies on battery lifetime. Appl Energy. 2016;172:398–407. 10.1016/j.apenergy.2016.03.119 [DOI] [Google Scholar]
  • 10. Schofer K, Laufer F, Stadler J, et al. : Machine Learning-Based Lifetime Prediction of Lithium-Ion Cells. Adv Sci (Weinh). 2022;9(29):2200630. 10.1002/advs.202200630 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Gan L, Yang F, Shi YF, et al. : Lithium-ion battery state of function estimation based on fuzzy logic algorithm with associated variables. IOP Conf Ser Earth Environ Sci. 2017;94:012133. 10.1088/1755-1315/94/1/012133 [DOI] [Google Scholar]
  • 12. Barcellona S, Piegari L: Effect of current on cycle aging of lithium ion batteries. J Energy Storage. 2020;29:101310. 10.1016/j.est.2020.101310 [DOI] [Google Scholar]
  • 13. Geisbauer C, Wöhrl K, Mittmann C, et al. : Review of safety aspects of calendar aged lithium ion batteries. J Electrochem Soc. 2020;167(9):090523. 10.1149/1945-7111/ab89bf [DOI] [Google Scholar]
  • 14. Grossi E, Buscema M: Introduction to artificial neural networks. Eur J Gastroenterol Hepatol. 2007;19(12):1046–54. 10.1097/MEG.0b013e3282f198a0 [DOI] [PubMed] [Google Scholar]
  • 15. Zhu J, Huang Y, Knapp M, et al. : Data-driven lithium-ion battery capacity estimation from voltage relaxation.2021. 10.21203/rs.3.rs-770709/v1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. NaitMalek Y, Najib M, Bakhouya M, et al. : On the use of machine learning for state-of-charge forecasting in electric vehicles. 2019;408–413. 10.1109/ISC246665.2019.9071705 [DOI] [Google Scholar]
  • 17. Geisbauer C, Schweiger HG, Kotak Y: Experimental calendar ageing data for lithium-ion battery chemistries. [data set]. Zenodo. 2022. 10.5281/zenodo.6546753 [DOI] [PMC free article] [PubMed]
  • 18. Chen T, Guestrin C: Xgboost. Proceed-ings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2016;785–794. 10.1145/2939672.2939785 [DOI] [Google Scholar]
  • 19. Hagenlocher P: Decision tree learning. Reference Source [Google Scholar]
  • 20. Natekin A, Knoll A: Gradient boosting machines, a tutorial. Front Neurorobot. 2013;7:21. 10.3389/fnbot.2013.00021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Alcolea A, Resano J: Fpga accelerator for gradient boosting decision trees. Electronics. 2021;10(3):314. 10.3390/electronics10030314 [DOI] [Google Scholar]
  • 22. Farizawani AG, Puteh M, Marina Y, et al. : A review of artificial neural network learning rule based on multiple variant of conjugate gradient approaches. J Phys Conf Ser. 2020;1529:022040. 10.1088/1742-6596/1529/2/022040 [DOI] [Google Scholar]
  • 23. Celen B, Oniz Y: Trajectory tracking of a quadcopter using fuzzy logic and neural network controllers. 2018 6th International Conference on Control Engineering & Information Technology (CEIT). 2018;1–6. 10.1109/CEIT.2018.8751810 [DOI] [Google Scholar]
  • 24. Jierula A, Wang S, OH TM, et al. : Study on accuracy metrics for evaluating the predictions of damage locations in deep piles using artificial neural networks with acoustic emission data. Appl Sci. 2021;11(5):2314. 10.3390/app11052314 [DOI] [Google Scholar]
  • 25. Xu C, Dai Q, Gaines L, et al. : Future material demand for automotive lithium-based batteries. Commun Mater. 2020;1(1):99. 10.1038/s43246-020-00095-x [DOI] [Google Scholar]
  • 26. Wang S, Jin S, Deng D, et al. : A critical review of online battery remaining useful lifetime prediction methods. Front Mech Eng. 2021;7:719718. 10.3389/fmech.2021.719718 [DOI] [Google Scholar]
Open Res Eur. 2023 Mar 2. doi: 10.21956/openreseurope.16982.r30873

Reviewer response for version 2

Min Ye 1

The revision was perfect.

Is the study design appropriate and does the work have academic merit?

Yes

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Yes

Are all the source data underlying the results available to ensure full reproducibility?

Partly

Are the conclusions drawn adequately supported by the results?

Yes

Are sufficient details of methods and analysis provided to allow replication by others?

Partly

Reviewer Expertise:

energy management system

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Open Res Eur. 2022 Dec 2. doi: 10.21956/openreseurope.15927.r30383

Reviewer response for version 1

Min Ye 1

The authors used the machine learning algorithms, Extreme Gradient Boosting (XGBoost) and artificial neural network (ANN) to predict calendar aging data for six lithium-ion batteries. The prediction results show that XGBoost error rate and fitting performance are more suitable for EV battery calendar aging prediction applications. Hence, the paper may be value of being accepted after carefully revisions.

Some specific comments are as follows:

  1. In terms of literature, this year's references on XGBoost and battery aging models should be appropriately cited.

  2. Please make sure that all the variables are in italics and should be defined in the annotations accordingly.

  3. The diagram of the full paper should be clear, with detailed abscissa ordinate notes and units. For example, the abscissa in Figure 7 should be added to the units.

  4. In the introduction, the organization and the main contributions of the paper are missing, please add relevant content accordingly.

  5. Figure 1 is too simple. Some information about the paper topic should be added. Or else delete it.

  6. In figure 2, % should be added into the Soc value, for example soc < 75%.

  7. It is better to express the number in percentage, such as MAE and MAPE and other results instead of decimal value. The format will be more clear.

Is the study design appropriate and does the work have academic merit?

Yes

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Yes

Are all the source data underlying the results available to ensure full reproducibility?

Partly

Are the conclusions drawn adequately supported by the results?

Yes

Are sufficient details of methods and analysis provided to allow replication by others?

Partly

Reviewer Expertise:

energy management system

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Open Res Eur. 2022 Dec 12.
Furkan Metin Turgut 1

Hello Prof. Ye, I have some questions about your comments. I would appreciate if we could eloborate some of them.

- You mentioned annotations, do you think there should be a nomenclature section where we explain the variables ?

- You wanted an organization and main contributions part in the paper. However, initially even i added that ORE colleagues wanted me to scratch it. Instead, those information has been added to the ORE webpage of the paper

- Prof. Ye, we spent a lot of time on data availability under the guidance of ORE colleagues. As a result, we added really explanatory and rich data to Zenodo. It is the data we use to produce the results. Can you explain why you answered data availability question as "partly" and what can we do for you to change this ?

- Regarding the other "partly" answer, Prof. Ye, we are a R&D company but usually providing services as an automotive industry partner. We cannot exactly share and give away the codes and other tools we used to produce the results because they have been stemmed from customer projects of AVL. Also, investments have been made to companies and employees to accumulate the know-how. You answered the question about the detail of the methodology as "partly" but what can we do for you to reconsider this? Can you eloborate? Other than these, I am already finished with your other remarks,

- You mentioned detailed abscissa for Figure 7, I added the data sample points to the plot (I am assuming this is what you mean by abscissa and is there any other figure needing this other than Figure 7. I only did it for Figure 7) - I deleted the Figure 1 - Converted each MAPE result to their % equivalent - Added % to the SOC values in Fig. 2.

Open Res Eur. 2022 Aug 23. doi: 10.21956/openreseurope.15927.r29904

Reviewer response for version 1

Suleyman Tosun 1

The paper presents a calendar aging modeling method that uses two different machine learning algorithms; namely,  Extreme Gradient Boosting (XGBoost) and artificial neural network (ANN).

In the introduction, the authors explain why the calendar aging modeling is essential for batteries of different types. Therefore, they show how this work fulfills the necessity of modeling calendar aging. 

As a methodology, they use XGBoost and ANN machine learning algorithms. XGBoost is a decision tree-based machine learning algorithm. The authors give the decision tree of the algorithm using 10 different parameters. The algorithm also gives decisions even if some of the data are missing when tracking the branches of the tree. The second method is ANN, which is the most commonly used machine learning algorithm. The selection of these algorithms is suitable for the problem since both algorithms have been proven to have good prediction capabilities. 

The authors compare these two methods in terms of mean absolute error (MAE) and percentile variation mean absolute percentage error (MAPE). The analysis of the results shows that the XGBoost has a better prediction than ANN for all types of batteries in terms of the determined MAE and MAPE results. The results seem accurate and the method can be applied to commercial tools.

Overall, the paper gives the necessary information for a reader who would like to implement and experience the algorithms presented in the article. 

My only suggestion is that the paper needs some grammatical and punctuation corrections. For example, I recommend not to use informal "don't" or "doesn't". Instead, the words "do not" or does not" should be used. Additionally, there are some sentences that need to be checked again for correct punctuation. You can use commercial grammar tools for that.

Is the study design appropriate and does the work have academic merit?

Yes

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Yes

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Are the conclusions drawn adequately supported by the results?

Yes

Are sufficient details of methods and analysis provided to allow replication by others?

Yes

Reviewer Expertise:

Computer engineering and computer science

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Geisbauer C, Schweiger HG, Kotak Y: Experimental calendar ageing data for lithium-ion battery chemistries. [data set]. Zenodo. 2022. 10.5281/zenodo.6546753 [DOI] [PMC free article] [PubMed]

    Data Availability Statement

    Underlying data

    Zenodo: Lithium-ion battery calendar ageing data - discharge and charge capacity, days passed, temperature and SOC. https://doi.org/10.5281/zenodo.6546753 17

    Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).


    Articles from Open Research Europe are provided here courtesy of European Commission, Directorate General for Research and Innovation

    RESOURCES