Abstract
Cancer is a worldwide illness that causes significant morbidity and death and imposes an immense cost on global public health. Modelling such a phenomenon is complex because of the non-stationarity and complexity of cancer waves. Apply modern novel statistical methods directly to raw clinical data. To estimate extreme cancer death rate likelihood at any period in any location of interest. Traditional statistical methodologies that deal with temporal observations of multi-regional processes cannot adequately deal with substantial regional dimensionality and cross-correlation of various regional variables. Setting: multicenter, population-based, medical survey data-based biostatistical approach. Due to the non-stationarity and complicated nature of cancer, it is challenging to model such a phenomenon. This paper offers a unique bio-system dependability technique suited for multi-regional environmental and health systems. When monitored over a significant period, it yields a reliable long-term projection of the chance of an exceptional cancer mortality rate. Traditional statistical approaches dealing with temporal observations of multi-regional processes cannot effectively deal with large regional dimensionality and cross-correlation between multiple regional data. The provided approach may be employed in numerous public health applications, depending on their clinical survey data.
Subject terms: Cancer, Diseases, Risk factors, Mathematics and computing
Introduction
The National Cancer Institute defines cancer as a group of disorders in which aberrant cells may proliferate and invade neighbouring tissue. Cancer may develop in most regions of the body, resulting in various cancer forms, as indicated below, and can sometimes spread via the blood and lymph systems.
Cancer's statistical characteristics received much attention from the current scientific community 1–8. Using current theoretical statistical methods 9–15, it is often rather challenging to compute realistic biological system dependability factors and outbreak probability under actual cancer settings. Typically, this results from many degrees of system freedom and random variables driving vastly dispersed dynamic biological systems. In theory, the dependability of a complex biological system may be precisely evaluated using sufficient observations or direct Monte Carlo simulations. Beginning in 1990, however, a portion of the available cancer observation numbers are limited16–21. Motivated by the latter point, the authors have developed a unique dependability technique for biological and health systems to forecast and control cancer epidemics more precisely. The whole globe was selected because of the enormous internet health observations and associated research1.
In health and engineering fields, statistical modelling of lifetime data and extreme value theory (EVT) are widespread. For example, Gumbel utilised EVT to predict the demography of distinct communities in20–23. Recent papers arguing for and against the upper bounds distribution of life expectancy were done by24. Often, papers in these fields presume a parametric bivariate lifetime distribution obtained from the exponential distribution to get statistically relevant data24. In25, the author proposes a new approach that uses Power Variance Function copulas (e.g., Clayton, Gumbel and Inverse Gaussian copulas), conditional sampling, and numerical approximation used in survival analysis. While in a paper by26, the authors explain that EVT has been used to predict mutation in evolutionary genetics and further develop a likelihood framework from EVT that was used to determine the fitness effects of the mutation.
Similarly, in27, The author applies a Beta-Burr distribution to this EVT hypothesis to calculate the fitness impact. While in28, the author presents a bivariate logistic regression model, which was afterwards used to access multiple MS fatalities with walking difficulties and in a cognitive experiment for visual identification. Finally3, is a relevant work utilising EVT to evaluate the chance of a global cancer breakout. In22,23, similarly, researchers employed EVT to predict and identify cancer abnormalities.
In this research, a cancer outbreak is seen as an unanticipated occurrence that may occur in any location of a nation at any moment; hence, the spatial spread is considered. Moreover, a specific non-dimensional factor is introduced to forecast the cancer risk at any given time and location. Environmental impacts on biological systems are ergodic. The second possibility is to see the process as reliant on specific external characteristics whose time-dependent change may be modelled as an ergodic process on its own. The incidence data of cancer in one hundred ninety-five world countries during the years 1990–2019 were retrieved from the public website1, considered a multi-degree-of-freedom (MDOF) spatio-temporal dynamic bio-system with highly inter-correlated regional components/dimensions.
This research tries to reduce the danger of future cancer outbreaks by forecasting them. However, it focuses simply on the yearly number of documented patient deaths and not on the symptoms themselves. Figure 1 presents the map of the world's countries.
Further research should incorporate one of the common complexity measures, such as fractal, attractor/embedding dimension, and entropy.
Methods
Consider an MDOF (multi-degree of freedom) structure subjected to random ergodic environmental factors (stationary in time). The second possibility is to see the process as reliant on certain external characteristics whose time-dependent change may be modelled as an ergodic process on its own. The MDOF biomedical response vector process is measured and/or simulated over a sufficiently long time interval . Unidimensional global maxima over the duration of time are denoted as , , .By sufficiently long time one primarily means a large value of with respect to the dynamic system auto-correlation time33–40.
Let be consequent in time local maxima of the process at monotonously increasing discrete time instants in . The analogous definition follows for other MDOF response components with and so on. For simplicity, all components, and therefore its maxima are assumed to be non-negative. The aim is to estimate the system failure probability
1 |
with
2 |
being the probability of non-exceedance for response components , , ,… critical values; denotes logical unity operation; and being joint probability density of the global maxima over the entire time span .
In practice, it is not possible to accurately estimate the latter joint probability distribution due to its high dimensionality and available data set limitations. In other words, the time instant when either exceeds , or exceeds , or exceeds , and so on, the system being regarded as immediately failed. Fixed failure levels , , ,…are of course individual for each unidimensional response component of . , ,, and so on.
Next, the local maxima time instants in monotonously non-decreasing order are sorted into one single merged time vector .Note that , . In this case represents local maxima of one of MDOF bio-system response components either or , or and so on. That means that having time record, one just has to continually and concurrently screen for local maximums of unidimensional response components and record their exceeding the MDOF limit vector in any of its components . The maxima of local unidimensional response components are blended into a non-decreasing temporal vector in accordance with the merged time vector . That is to say, each local maxima is the actual encountered local maxima corresponding to either or , or and so on. Finally, the unified limit vector is introduced with each component is either , or and so on, depending on which of or , or etc., corresponding to the current local maxima with the running index .
Next, a scaling parameter is implemented to artificially lower limit values for all response components concurrently, namely the new MDOF limit vector with , , , … is introduced. The unified limit vector is introduced with each component is either , or and so on. The latter automatically defines probability as a function of , note that from Eq. (1). Non-exceedance probability can be now estimated as follows
3 |
In practice, a dependency between neighbouring is not always negligible; thus, the following one-step (called here conditioning level ) memory approximation is introduced
4 |
for (called here conditioning level ). The approximation introduced by Eq. (4) can be further expressed as
5 |
where (will be called conditioning level ), and so on. The goal is to monitor each isolated failure that occurs locally first in time, thereby preventing cascade local inter-correlated exceedances.
Equation (5) presents subsequent refinements of the statistical independence assumption. The latter type of approximation enables capturing the statistical dependence effect between neighbouring maxima with increased accuracy. Since the original MDOF bio-process was assumed ergodic and therefore stationary, the probability for will be independent of but only dependent on conditioning level . Thus non-exceedance probability can be approximated as in the Naess-Gaidai method29,30, where
6 |
Note that Eq. (6) follows from Eq. (1) by neglecting , as the design failure probability is usually very small. Further, it is assumed . Note that Eq. (5) is similar to the well-known mean up-crossing rate equation for the probability of exceedance32. There is obvious convergence with respect to the conditioning parameter
7 |
Note that Eq. (6) for turns into the quite well-known non-exceedance probability relationship with the mean up-crossing rate function
8 |
where is the mean up-crossing rate of the response level for the above assembled non-dimensional vector assembled from scaled MDOF bio-system response . Note that constructed -vector has no data loss at all; see Fig. 2.
In the preceding, the assumption of stationarity has been employed. The proposed methodology can also treat the non-stationary case. An illustration of how the methodology can be used to treat non-stationary cases is provided. Consider a scattered diagram of environmental states, each short-term bio-environmental state having a probability , so that . The corresponding long-term equation is then
9 |
with being the same function as in Eq. (7) but corresponding to a specific short-term environmental state with the number . The above introduced as functions are often regular in the tail, specifically for values of approaching and exceeding . More precisely, for , the distribution tail behaves similarly to with being suitably fitted constants for suitable tail cut-on value. Therefore, one can write
10 |
Next, by plotting versus , often nearly perfectly linear tail behaviour is observed. Optimal values of the parameters may also be determined using a sequential quadratic programming (SQP) method incorporated in the NAG Numerical Library31.
For levels of approaching , the approximate limits of a p-% confidence interval (CI) of can be given as follows41–46
11 |
with being estimated from the inverse normal distribution, for example, , . with being the total number of local maxima assembled in the analysed vector .
Results
Predictions of cancer-related mortality have been the focus of epidemiology and mathematical biology for a long time. It is common knowledge that the dynamics of public health are a highly non-linear, multidimensional, spatially cross-correlated dynamic system that is always difficult to analyse. Previous studies have used a variety of approaches to model cancer cases. This section presents the application of the above-described methodology to the real-life cancer data sets, presented as a new annual recorded time series for all world countries. The statistical information presented in this section was obtained from the official World website1. The website provides cancer death rates per country from 1990 to 2019. Patient death numbers from one hundred ninety-five different world countries were chosen as components , thus constituting an example of a one hundred ninety-five dimensional (195D) dynamic biological system. To unify all 195 measured time series the following scaling was performed
12 |
making all 195 responses non-dimensional and having the same failure limit equal to 1. Failure limits , or in other words, cancer thresholds, are not an obvious choice. The most straightforward choice would be for different countries to set failure limits equal to the corresponding country population in per cent to local population, basically making equal to the annual death rate per country. Next, all local maxima from 195 measured time series were merged into one single time series by keeping them in time non-decreasing order: with the whole vector being sorted according to non-decreasing times of occurrence of these local maxima.
Figure 3 presents the number of new annual recorded deaths as a 195D vector , consisting of assembled regional new annual death rate for each corresponding country. Greenland, Mongolia, Monaco and Hungary data were excluded from analysis, since were regarded as outliers. Note that vector is assembled of different regional components with different cancer backgrounds. Index is just a running index of local maxima encountered in a non-decreasing time sequence.
Figure 4 presents the annual death rate (percentage of deaths from cancer to the population of a given country) prediction, 100 years return level extrapolation according to Eq. (10) towards cancer outbreak with a 100-year return period, indicated by the horizontal dotted line. Somewhat beyond, % cut-on value was used, percentage of the local population on the horizontal axis. The dotted lines indicate extrapolated 95% confidence interval according to Eq. (11). According to Eq. (5) is directly related to the target failure probability from Eq. (1). Therefore, in agreement with Eq. (5), system failure probability can be estimated. Note that in Eq. (6), corresponds to the total number of local maxima in the unified response vector . Conditioning parameter was found to be sufficient due to occurrence of convergence with respect to , see Eq. (6). Figure 4 exhibits reasonably narrow 95% CI. The latter is an advantage of the proposed method.
The predicted cancer death rate in any world country in any year to come for the next 100 years was found to be about 0.24%.
Note that, although being unique, the above-described technique has the distinct benefit of using existing measured data sets very effectively owing to its capacity to deal with the multidimensionality of the health system and to execute correct extrapolation using relatively small data sets.Note that the predicted non-dimensional level, indicated by the star in Fig. 4, represents the probability of cancer outbreak in any world country in the years to come.
In order to validate the suggested methodology, a twice smaller data set was used to obtain predictions for the same probability levels of interest as in Fig. 4. The twice smaller data set was obtained from the original data set by sampling every second consecutive data point. Predicted , based on reduced data set, was found within 95% CI based on the entire data set, indicated in Fig. 4.
The second-order difference plot (SODP) originated from the Poincare plot. SODP provides observing the statistical situation of consecutive differences in time series data.
Figure 5 presents SODP along with a third-order difference plot TODP and a fourth-order difference plot FODP. These kinds of plots can be used for data pattern recognition and comparison with other data sets, for example, for the entropy artificial intelligence (AI) recognition approach32. Note that EVT is asymptotic and 1DOF, while this study introduces MDOF and sub-asymptotic approaches. To summarise, the predicted non-dimensional λ level, indicated by the star in Fig. 4, represents the probability of world cancer deaths in the years to come. The methodology's limitation lies in its assumption of the underlying bio-environmental process quasi-stationarity.
Discussion
Traditional health systems reliability methods dealing with observed time series do not have the advantage of dealing efficiently with systems possessing high dimensionality and cross-correlation between different system responses. The essential advantage of the introduced methodology is its ability to study the reliability of high dimensional non-linear dynamic systems.
Despite the simplicity, the present study successfully offers a novel multidimensional modelling strategy and a methodological avenue to implement forecasting of the cancer death rate. Proper setting of health system alarm limits (failure limits) per country has been discussed.
This paper studied recorded cancer death rates from all world countries, constituting an example of a one hundred ninety-five dimensional (195D) observed from 1990 to 2019. In real-time, the novel reliability method was applied to cancer annual death rate numbers as a multidimensional system. The theoretical reasoning behind the proposed method is given in detail. Note that the use of direct either measurement or Monte Carlo simulation for dynamic biological system reliability analysis is attractive; however, dynamic system complexity and its high dimensionality require the development of novel robust and accurate techniques that can deal with a limited data set at hand, utilising available data as efficient as possible.
The main conclusion is that the public health system under local environmental and epidemiologic conditions is well managed. This study predicted an annual death rate 100-year return period risk level equal to about 0.24%. Therefore, under current national health management conditions, cancer still represents a future threat to world health.
This study further aimed to develop a general-purpose, robust, and straightforward multidimensional reliability method. The method introduced in this paper has been previously validated by application to a wide range of simulation models, but for only one-dimensional system responses and, in general, very accurate predictions were obtained. Both measured and numerically simulated time series responses can be analysed. It is shown that the proposed method produced a reasonable confidence interval. Thus, the suggested methodology may become appropriate for various non-linear dynamic biological systems reliability studies. Finally, the suggested methodology can be used in many public health applications. The presented cancer example does not limit areas of new method applicability (Supplementary file).
The suggested method can work well with non-stationary data sets (for example, seasonal variations) as soon as they represent the proof of interest. If, however, there is an underlying trend in the process of interest or the data was manipulated, those effects have to be identified. In that case, trend analysis should be performed, a topic for future studies. In any case, authors assume that within 3 years, horizon quasi-stationarity may be assumed. Therefore, the limitation of this study lies within the assumption of bio-system quasi-stationarity, which is, of course, not valid for many years to come.
Supplementary Information
Acknowledgements
The authors declare no conflicts of interest. No funding was received. All authors contributed equally. Authors declare their research conformity with journal ethical standards.
Author contributions
O.G.—theory, P.Y.—data analysis, Y.X.—corresponding author.
Data availability
The datasets analysed during the current study are available online1 https://ourworldindata.org/causes-of-death. The authors confirm that all methods were performed following the relevant guidelines and regulations according to the Declarations of Helsinki.
Code availability
For software used to extrapolate probability tails in this study, see https://github.com/cran/acer.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-023-27547-x.
References
- 1.Ritchie, H., Spooner, F. & Roser, M. Causes of death. In Our World in DataOur World in Data, https://ourworldindata.org/causes-of-death.
- 2.Siegel R, Miller K, Fuchs H, Jemal A. Cancer statistics. CA Cancer J. Clin. 2022 doi: 10.3322/caac.21708. [DOI] [PubMed] [Google Scholar]
- 3.Yabroff KR, Wu XC, Negoita S, Stevens J, Coyle L, Zhao J, Mumphrey BJ, Jemal A, Ward KC. Association of the COVID-19 pandemic with patterns of statewide cancer services. J. Natl. Cancer Inst. 2021;2021:28. doi: 10.1093/jnci/djab122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Surveillance, Epidemiology, and End Results (SEER) Program. SEER*Stat Database: Incidence- SEER 9 Registries Research Data with Delay- Adjustment, Malignant Only, November 2020 Submission (1975- 2018) <Katrina/Rita Population Adjustment>- Linked to County Attributes- Total US, 1969- 2018 Counties. National Cancer Institute, Division of Cancer Control and Population Sciences, Surveillance Research Program, Surveillance Systems Branch (2021).
- 5.Surveillance, Epidemiology, and End Results (SEER) Program. SEER*Stat Database: Incidence- SEER 18 Registries Research Data + Hurricane Katrina Impacted Louisiana Cases, November 2020 Submission (2000- 2018) <Katrina/Rita Population Adjustment>- Linked to County Attributes- Total US, 1969–2018 Counties. National Cancer Institute, Division of Cancer Control and Population Sciences, Surveillance Research Program, Surveillance Systems Branch (2021).
- 6.Surveillance Research Program. SEER*Explorer: an interactive website for SEER cancer statistics. National Cancer Institute 2021 (Accessed15 Apr 2021); seer.cancer.gov/explorer/.
- 7.Surveillance, Epidemiology, and End Results (SEER) Program. SEER*Stat Database: Incidence- SEER Research Limited- Field Data With Delay- Adjustment, 21 Registries, Malignant Only, November 2020 Submission (2000- 2018)- Linked To County Attributes- Time Dependent (1990- 2018) Income/Rurality, 1969- 2019 Counties. National Cancer Institute, Division of Cancer Control and Population Sciences, Surveillance Research Program (2021).
- 8.Surveillance Research Program, Statistic Methodology and Applications. DevCan: Probability of Developing or Dying of Cancer Software. Version 6.7.9. National Cancer Institute (2021).
- 9.Surveillance, Epidemiology, and End Results (SEER) Program. SEER*Stat Database: North American Association of Central Cancer Registries (NAACCR) Incidence Data- Cancer in North America Analytic File, 1995- 2018, With Race/Ethnicity, Custom File With County, American Cancer Society Facts and Figures Projection Project (which includes data from the Center for Disease Control and Prevention's National Program of Cancer Registries, the Canadian Council of Cancer Registries' Provincial and Territorial Registries, and the National Cancer Institute's SEER Registries, certified by the NAACCR as meeting high- quality incidence data standards for the specified time periods). National Cancer Institute, Division of Cancer Control and Population Sciences, Surveillance Research Program (2021).
- 10.Sherman, R., Firth, R. & Charlton, M. et al. Cancer in North America: 2014- 2018. Volume One: Combined Cancer Incidence for the United States, Canada and North America. North American Association of Central Cancer Registries, Inc (2021).
- 11.Sherman, R., Firth, R. & Charlton, M. et al. Cancer in North America: 2014- 2018. Volume Two: Registry- Specific Cancer Incidence in the United States and Canada. North American Association of Central Cancer Registries, Inc (2021).
- 12.Surveillance, Epidemiology, and End Results (SEER) Program. SEER*Stat Database: Mortality- All Causes of Death, Total US (1969- 2019) <Katrina/Rita Population Adjustment>- Linked To County Attributes- Total US, 1969- 2019 Counties (underlying mortality data provided by the National Center for Health Statistics). National Cancer Institute, Division of Cancer Control and Population Sciences, Surveillance Research Program (2021).
- 13.Wingo PA, Cardinez CJ, Landis SH, et al. Long- term trends in cancer mortality in the United States, 1930–1998. Cancer. 2003;97(12 suppl):3133–3275. doi: 10.1002/cncr.11380. [DOI] [PubMed] [Google Scholar]
- 14.Murphy, S. L., Kochanek, K. D., Xu, J. & Heron, M. Deaths: Final Data for 2012. National Vital Statistics Reports. Vol 63, No. 9. National Center for Health Statistics (2015). [PubMed]
- 15.Steliarova-Foucher E, Stiller C, Lacour B, Kaatsch P. International classification of childhood cancer. Cancer. 2005;103:1457–1467. doi: 10.1002/cncr.20910. [DOI] [PubMed] [Google Scholar]
- 16.Fritz A, Percy C, Jack A, et al. International Classification of Diseases for Oncology. World Health Organization; 2000. [Google Scholar]
- 17.World Health Organization (WHO). In International Statistical Classification of Diseases and Related Health Problems, 10th revision. Vol I- III. WHO (2011).
- 18.Surveillance Research Program. In SEER*Stat software, version 8.3.8. National Cancer Institute (2020).
- 19.Surveillance Research Program. In Joinpoint Regression Program version 4.9.0.1. National Cancer Institute, Statistical Research and Applications Branch (2021).
- 20.Mariotto AB, Zou Z, Johnson CJ, Scoppa S, Weir HK, Huang B. Geographical, racial and socio- economic variation in life expectancy in the US and their impact on cancer relative survival. PLoS ONE. 2018;13:e0201034. doi: 10.1371/journal.pone.0201034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Clegg LX, Fever EJ, Mistune DN, Fay MP, Hankey BF. Impact of reporting delay and reporting error on cancer incidence rates and trends. J. Natl. Cancer Inst. 2002;94:1537–1545. doi: 10.1093/jnci/94.20.1537. [DOI] [PubMed] [Google Scholar]
- 22.Gumbel E. Statistics of Extremes. Columbia University Press; 1958. [Google Scholar]
- 23.Sarkar SK. A continuous bivariate exponential distribution. J. Am. Stat. Assoc. 1987;82(398):667–675. doi: 10.1080/01621459.1987.10478481. [DOI] [Google Scholar]
- 24.Gupta RD, Kundu D. Theory & methods: Generalised exponential distributions. Aust. N. Z. J. Stat. 1999;41(2):173–188. doi: 10.1111/1467-842X.00072. [DOI] [Google Scholar]
- 25.Romeo JS, Meyer R, Gallardo DI. Bayesian bivariate survival analysis using the power variance function copula. Lifetime Data Anal. 2018;24:355–383. doi: 10.1007/s10985-017-9396-1. [DOI] [PubMed] [Google Scholar]
- 26.Beisel CJ, Rokyta DR, Wichman HA, Joyce P. Testing the extreme value domain of attraction for distributions of beneficial fitness effects. Genetics. 2007;176(4):2441–2449. doi: 10.1534/genetics.106.068585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Joyce P, Abdo Z. Determining the distribution of fitness effects using a generalised Beta-Burr distribution. Theor. Popul. Biol. 2018;122:88–96. doi: 10.1016/j.tpb.2017.07.001. [DOI] [PubMed] [Google Scholar]
- 28.Kristensen SB, Bibby BM. A bivariate logistic regression model based on latent variables. Stat. Med. 2020;39(22):2962–2979. doi: 10.1002/sim.8587. [DOI] [PubMed] [Google Scholar]
- 29.Naess A, Gaidai O. Estimation of extreme values from sampled time series. Struct. Saf. 2009;31(4):325–334. doi: 10.1016/j.strusafe.2008.06.021. [DOI] [Google Scholar]
- 30.Naess A, Moan T. Stochastic Dynamics of Marine Structures. Cambridge University Press; 2013. [Google Scholar]
- 31.Numerical Algorithms Group . NAG Toolbox for Matlab. World NAG Ltd; 2010. [Google Scholar]
- 32.Rice SO. Mathematical analysis of random noise. Bell Syst. Tech. J. 1944;23:282–332. doi: 10.1002/j.1538-7305.1944.tb00874.x. [DOI] [Google Scholar]
- 33.Xing Y, Gaidai O, Ma Y, Naess A, Wang F. A novel design approach for estimation of extreme responses of a subsea shuttle tanker hovering in ocean current considering aft thruster failure. Appl. Ocean Res. 2022;2022:123. doi: 10.1016/j.apor.2022.103179. [DOI] [Google Scholar]
- 34.Gaidai O, Wang F, Wu Y, Xing Y, Medina A, Wang J. Offshore renewable energy site correlated wind-wave statistics. Probab. Eng. Mech. 2022;2022:68. doi: 10.1016/j.probengmech.2022.103207. [DOI] [Google Scholar]
- 35.Sun J, Gaidai O, Wang F, Naess A, Wu Y, Xing Y, van Loon E, Medina A, Wang J. Extreme riser experimental loads caused by sea currents in the Gulf of Eilat. Probab. Eng. Mech. 2022;2022:68. doi: 10.1016/j.probengmech.2022.103243. [DOI] [Google Scholar]
- 36.Xu X, Wang F, Gaidai O, Naess A, Xing Y, Wang J. Bivariate statistics of floating offshore wind turbine dynamic response under operational conditions. Ocean Eng. 2022;2022:257. doi: 10.1016/j.oceaneng.2022.111657. [DOI] [Google Scholar]
- 37.Gaidai O, Xing Y, Wang F, Wang S, Yan P, Naess A. Improving extreme anchor tension prediction of a 10-MW floating semi-submersible type wind turbine, using highly correlated surge motion record. Front. Mech. Eng. 2022;2022:51. doi: 10.3389/fmech.2022.888497. [DOI] [Google Scholar]
- 38.Gaidai O, Xing Y, Xu X. COVID-19 epidemic forecast in USA East coast by novel reliability approach. Res. Sq. 2022 doi: 10.21203/rs.3.rs-1573862/v1. [DOI] [Google Scholar]
- 39.Xu X, Xing Y, Gaidai O, Wang K, Sandipkumar Patel K, Dou P, Zhang Z. A novel multidimensional reliability approach for floating wind turbines under power production conditions. Front. Mar. Sci. 2022 doi: 10.3389/fmars.2022.970081. [DOI] [Google Scholar]
- 40.Gaidai O, Xing Y, Balakrishna R. Improving extreme response prediction of a subsea shuttle tanker hovering in ocean current using an alternative highly correlated response signal. Results Eng. 2022 doi: 10.1016/j.rineng.2022.100593. [DOI] [Google Scholar]
- 41.Cheng, Y., Gaidai, O., Yurchenko, D., Xu, X., Gao, S. Study on the dynamics of a payload influence in the polar ship. In The 32nd International Ocean and Polar Engineering Conference, Paper Number: ISOPE-I-22-342 (2022).
- 42.Gaidai, O. et al. On-board trend analysis for cargo vessel hull monitoring systems. In The 32nd International Ocean and Polar Engineering Conference, Paper Number: ISOPE-I-22-541 (2022).
- 43.Gaidai O, Xu X, Naess A, Cheng Y, Ye R, Wang J. Bivariate statistics of wind farm support vessel motions while docking. Ships Offshore Struct. 2020;16(2):135–143. doi: 10.1080/17445302.2019.1710936. [DOI] [Google Scholar]
- 44.Gaidai O, Yan P, Xing Y, Xu J, Wu Y. A novel statistical method for long-term coronavirus modelling. F1000 Res. 2022;11:1282. doi: 10.12688/f1000research.125924.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Gaidai O, Xu J, Yan P, Xing Y, Zhang F, Wu Y. Novel methods for wind speeds prediction across multiple locations. Sci. Rep. 2022;12:19614. doi: 10.1038/s41598-022-24061-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Gaidai O, Xing Y. Novel reliability method validation for offshore structural dynamic response. Ocean Eng. 2022;266:5. doi: 10.1016/j.oceaneng.2022.113016. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets analysed during the current study are available online1 https://ourworldindata.org/causes-of-death. The authors confirm that all methods were performed following the relevant guidelines and regulations according to the Declarations of Helsinki.
For software used to extrapolate probability tails in this study, see https://github.com/cran/acer.