Abstract
Assessment of monthly maximum wind speed (MMWS) is essential in various environmental disciplines, including architectural risk assessment, climatology, renewable energy sources, building design, and agricultural activities. The distribution of wind speed is an important module of its development. Presently, several mixing distributions have been suggested for fitting the theoretical maximum wind speed distributions. However, the drawback of various distribution elements is the outcome of numerous fitting presentations, and it is challenging to choose the best form of single distribution to employ when building a hybrid model. In order to solve those difficulties, utilizing the mixtures of Sine-skewed-von Mises distributions (MSS-VM) with various prior parameter distributions is extended in the present study. In this research, the assessment of MMWS employing the maximum-likelihood estimation (MLE) method is taken into consideration for the wind speed data from the Pakistan province of Baluchistan Gwadar station. The results of the Anderson Darling (AD), Chi-square, BIC, and AIC tests, as well as the Kolmogorov Smirnov (KS) test, indicated that MSS-VM was the best distribution for the Gwadar station. The suggested models automatically find the optimum number of elements, in addition to having more appropriate performance. The outcomes demonstrate that MSS-VM performance is more suitable than other heterogeneous and single-distribution models. The design estimations derived using the Bayesian mixture distribution provide a sense of the highest wind speed that will occur across a particular area. It is therefore crucial for designers and policymakers in the planning, designing, and building of various structures.
Keywords: Comparison wind speed distribution, Mixture of sine-skewed-von mises distributions, MLE, Heterogeneous distribution, Gwadar
1. Introduction
The wind represents a highly influential, naturally occurring phenomenon of the environment that affects a wide range of other variables. For instance, the topography that winds blow over affects both wind speed and temperature, which consequently causes other harmful climatic events like dust storms, typhoons, and hurricanes [1]. Currently, the single distribution most widely used and recognized parametric approach to describing wind speed is the distribution of Weibull [[2], [3], [4]]. Additional widely used distributions, like the Gamma [5], and Lognormal [6], are also taken into account when fitting distributions of the wind speed. The authors confirmed that the generalized Gamma distributions can fit the wind speed distributions with reliability [7]. According to Ref. [8], most wind speed data in the UAE can be more accurately fitted by the Log Pearson III, Generalized-Gamma, and Kappa distributions, according to the study on the L-moment ratio diagram. A five-parameter Wakeby probability distribution was found to be an appropriate fit for the information on wind speeds in Germany [9]. The characteristics of additionally typical distributions, such as inverse Burr, Marshall-Olkin Power Lomax [10], generalized extreme value (GEV) [11], and inverse Gamma [12] distributions, were also investigated in several areas.
1.1. Heterogeneous mixture distributions
In [13] using normal, extreme-value type 1, Weibull, and Gamma distributions, twelve heterogeneous and homogeneous mixed distributions are generated, and they are utilized for fitting the average data on wind speeds in the United Arab Emirates. Results showed that out of all heterogeneous and homogeneous mixture distributions, as well as single models, the Weibull Extreme Value type-one mixture distribution achieved better than others. A singularly truncated normal Weibull mixed probability (TN-W) was proposed in Ref. [14] to study wind speed features of Elazig Keban, Elazig, and Elazig Madden sites. TNW demonstrated superior distribution fit abilities when contrasted with two-component Weibull mixture distribution (WW). Based on the results presented in Ref. [15], the mixture Ga-W distribution is superior to the N-W, Weibull-Weibull, and normal mixture distributions in terms of estimating wind speed distribution. An algorithm built using Burr-GE, Wakeby, and Kappa suggested a method for estimating wind distributions worldwide [16]. The authors of [17] demonstrated that, out of ten mixing distributions, W-LN worked particularly well in Aljouf, whereas Gamma-Rayleigh performed best in Sharurah. In Ref. [18], some mixed heterogeneous distributions were suggested to generalize extreme values (Lognormal, Weibull-Lognormal, and Weibull-GEV). The related results showed that GEV-LN performed well in fitting the popular bell-shaped unidirectional distribution, but W-GEV was able to fit both bi-modal and unimodal wind distributions effectively. The fit performance of 23 mixtures and 20 single models for the wind distribution was investigated in Ref. [19]. The findings demonstrated that mixture distributions outperformed single distributions in most cases, including Dagum-GEV, Burr-GEV, GEV-W, and Dagum-Weibull.
As the influence of single distributions is limited many other researchers are creating mixed distributions. The results reported in the latest literature indicate that mixture models perform better than single models, especially in complex wind regimes with several modalities. Bimodality is a key focus of this study. Therefore, in this research, we also emphasize the use of mixture distributions to model wind speeds in various wind regimes. Nevertheless, there are constraints in achieving improved fitting performance through mixture modeling. The most challenging of the various distribution models to take into consideration while building the mixture distribution is that, according to recent literature, users make a subjective decision about the number; the number is chosen to be two in most heterogeneous mixture models [13,19,20]. Ouarda et al. [8], Fawad et al. [21], and Goel et al. [22] emphasized the three basic assumptions of stationarity, independence, and homogeneity, which are thought to be important when performing wind speed analysis. If such assumptions are violated, structural design estimations may be misrepresented, which may limit their application.
The Bayesian technique for symmetric directions especially, Bayesian analysis utilizing the symmetric von Mises-Fisher and von Mises distributions, has been widely studied in the literature [[23], [24], [25], [26]]. The mixture model von Mises Fisher is employed by Roge et al. [27] and Taghia et al. [28] and Mulder et al. [29] emphasize non-informative priors while providing a Bayesian model of inference for von-Mises mixture distributions utilizing the reverse run Monte Carlo Markov chain (MCMC) generator. It is evident from the previous that there exists a gap in the scientific literature, which motivated us to suggest a novel application of Bayesian estimation for wind data. Skew Wrapped Cauchy mixing model and skew von Mises-Fisher distribution have been exposed to Bayesian estimation [30,31]. The research gap related to the inadequate comprehension and examination of wind speed distributions, specifically with the application of mixture distributions such as the sine-skewed-von Mises (SSVM) distribution model in wind energy evaluations. Although much research has concentrated on single distribution models, like the Rayleigh, Weibull distributions and several others [[32], [33], [34], [35]]. There aren't many thorough examinations of how effectively mixture distribution models describe wind speed data under different circumstances and at different elevations.
The study proposes a novel mixture of the sine-skewed von Mises (MSS-VM) distribution, which enhances the modeling of wind speed data and multimodal behavior that are often observed in wind speed distributions. The research utilizes Bayesian estimation methods to derive a posterior distribution for estimating monthly maximum wind speeds. This approach not only improves the accuracy of the parameter estimates but also provides a robust framework for quantifying uncertainty in wind speed predictions. The effectiveness of the proposed Bayesian method, the study conducts a simulation analysis. This empirical evaluation demonstrates how well the MSS-VM model performs in estimating wind speed distributions. The study includes a thorough comparison of the proposed MSS-VM model with several other mixture distributions and established single distributions.
The remaining portion of this work is structured as follows: Study area is discussed in Section 2, material and methodology are discussed in section 3, and results are presented in section 4, and Section 5 presents a conclusion and recommendation.
2. Study area and data
A nation's economic growth and development are significantly influenced by its energy sector. Renewable sources of energy such as solar and wind power are accessible and affordable. Pakistan has not adequately investigated the possibilities of using alternative technology. Pakistan's energy consumption has increased dramatically over the last few decades and is predicted to continue rising. The primary energy resources of Pakistan include nuclear, hydro, natural gas, and oil. Pakistan has experienced a serious crisis of electricity in recent decades, which has resulted in poor economic growth because of insufficient energy strategy [36,37].
The Meteorological Department of Pakistan provided data on monthly maximum wind speeds from 1981 to 2019. Table 1 shows the statistical indicators, including mean, standard deviation, coefficient of variation, minimum, maximum, skewness, and kurtosis. The Gwadar district of Baluchistan meteorological station is situated at an elevation of 29.86 m, 25° 08′ latitude, and 62° 20′ longitude. At 10-min intervals, maximum wind data is recorded, which includes wind speed in meters per second at heights of ten- and 50-m. Typical wind speeds of higher than 9.2 miles/h, the windier portion of the year covers 6.1 months, from 3 March – 6 September. In Gwadar, May is the windiest month of the entire year, with an ordinary hourly wind speed of 10.4 mph [38,39]. Fig. 1 shows the geographical positions and displays in the Gwadar station wind speed directions. Fig. 1 makes it very clear that the station's wind velocity direction is looking southwest.
Table 1.
Presents the descriptive statistics of Gwadar station.
| Gwadar | Mean | S D | CV | Min | Max | Skewness | kurtosis | latitude | longitude | Elevation |
|---|---|---|---|---|---|---|---|---|---|---|
| 10m | 6.368 | 0.715 | 11.23 | 4.03 | 8.63 | −0.15 | 0.20 | 25° 08′ | 62° 20′ | 29.86 |
| 50m | 7.505 | 0.721 | 9.62 | 5.49 | 10.08 | 0.37 | 0.42 | 25° 08′ | 62° 20′ | 29.86 |
Fig. 1.
Study area and Present direction of wind speed in district Gwadar.
Gwadar has an arid environment that is hot and dry, and it is situated between 0 and 984 ft above sea level. The effect of the ocean keeps the temperature higher in the winter and lower in the summer. June is the hottest month with an average temperature that stays between 31 °C and 32 °C. January's coldest month with an average temperature ranging from 18 °C (64 °F) to 19 °C (66 °F). One distinctive feature of the Balochistan coastline region is its consistent temperature [40]. The winters are comfortable, except occasional cold spells brought on by winds sweeping down the Balochistan desert. Winters in Gwadar are shorter than summers [41]. The climate in Gwadar is the same as that of the Middle East because the majority of the rain falls between December and January. On June 6, 2010, the most rainfall previously recorded 227 mm or 8.9 inches in 24 h was recorded.
Gwadar station was selected for this study due to its unique coastal location along the Arabian Sea. This makes it ideal for capturing wind patterns influenced by both land-sea interactions and regional climatic phenomena, such as the monsoon. Additionally, Gwadar's proximity to important maritime routes and its development as a port city further highlight the significance of understanding local wind dynamics, which can impact shipping, infrastructure, and renewable energy projects, particularly wind energy potential.
3. Materials and methods
3.1. Candidate probability distributions (PDs)
In the present study, Wind speed analysis was calculated using many distributions. These distributions, which are frequently employed in the Analysis of exceptional occurrences, included five single distributions, and multiparameter mixture distributions. These are Gamma (GAM), Generalized Gamma (GG), Dagam3, Fatigue Life (3), Inverse Gaussian (IGA), Weibel (3), Beta, (NAK), Jonson Su (JON), Normal (N), Kumaraswami, Log gamma (LG), Kumaraswamy-normal, Nakagami-Gumbel, Dagum-Beta, Log normal-log Pearson, Log logistics-inverse gaussian, Normal-Gamma distributions. The majority of such distributions were proposed for on-site wind speed and analysis across multiple countries [[42], [43], [44], [45], [46], [47], [48], [49], [50]].
3.2. Estimation parameters of PDs by the maximum likelihood estimation
The MLE approach was utilized in this study to estimate the characteristics of PDs. It was previously applied in the frequency estimation of wind speed [[51], [52], [53], [54]]. The MLE approach is typically employed when estimating a population parameter from sample data on wind speed. RA Fisher established and promoted this technique between 1912 and 1922 [55]. Estimation of parameters is used in the assessment of the method of a linear method to produce the maximum. Probability that an observation will be made. The probabilistic function can be defined as a collection of values for the parameters, for distribution with a probability density function (pdf) provided by f(x) and PDF from the Union of Observing Condition [56].
3.3. The sine-skewed von mises distribution
The probability distribution is called the sine-skewed von-Mises distribution. This model is an extension of the classical von-Mises distribution, which is widely used for modeling circular data (data that represents angles or directions) [57]. The von-Mises distribution is often referred to as the circular analog of the normal distribution due to its symmetrical, unimodal nature. However, the classical von-Mises distribution is limited in its ability to capture asymmetry or skewness. To address this limitation, we introduce skewness into the von-Mises distribution using a sine function, resulting in the sine-skewed von-Mises distribution [58]. This modification allows the distribution to model both symmetrical and asymmetrical circular data. Nature, making it more flexible in capturing real-world phenomena that exhibit directional bias or asymmetry [59].
Numerous distributions on a single circle possess a common feature: they are symmetrical regarding their position μ ∈ [−π, π]. But, since the symmetry assumption in data is frequently rejected [60], introduces the k sine skewed von Mises distribution and density function as follows:
| [1] |
Here is the initial type of level 0, a location factor, μ ∈ [−π, π), τ > 0 is the parameter of concentration, the measure of skewness is , and a positive integer is k. The distributions are skewed left when λ > 0, and the distributions are skewed right when λ < 0. The expression for a combination of SSVM distributions by elements are
| [2] |
Where the parameter vectors are and ) and , Weights that represent the relative proportions of every element in the entire sample which is represented by the vector , . Algorithm1 was implied to create a sample based on the SSVM distribution equation (1). The proof of Sine-Skewed von Mises Distribution is discussed in Appendix B.
Algorithm 1
processes to create a sample based on the SSVM.
-
1
Create base on Von-Mises distribution given parameters k and .
-
2
Calculate U using U (0.1).
3When , as a result .
3.4. Estimation parameters of PDs by the maximum likelihood estimation
The MLE approach was utilized in this study to estimate the characteristics of PDs. It was previously applied in the frequency estimation of wind speed [[51], [52], [53], [54]]. The MLE approach is typically employed when estimating a population parameter from sample data on wind speed. RA Fisher established and promoted this technique between 1912 and 1922 [55]. Estimation of parameters are used in the assessment of the method of linear method to produce the maximum. Probability that an observation will be made. The probabilistic function can be defined as a collection of values for the parameters for distribution with a probability density function (pdf) provided by f(x) and PDF from the Union of Observing Condition [56].
The mixture of sine skewed Von-Mises in equation (2) can be represented by the function of log-likelihood as follows:
| [3] |
The partial derivations of equation (3) about ), the MLE parameters ) could be acquired. Because there are no closed-form equations available, estimations should be obtained numerically. The MLE is calculated employing the Differential Evolution (DE) technique [61], which uses the DEoptim package in the R programming language [62]. DE is a heuristic evolution approach for universal optimization that is being extensively investigated for its substantial efficiency as a worldwide optimization method for continuously minimizing numerical problems. It is effective in many scientific and technology-related applications.
3.4.1. Prior selection
The first stage in Bayesian analysis is to create a prior distribution, which captures the researcher's assumptions or understanding of the parameters before data observation. The choice of prior can significantly influence the results, especially when the sample size is small. Priors can be informative, reflecting strong prior beliefs based on previous studies or expert knowledge, or non-informative, aiming to exert minimal influence on the posterior distribution. In this study, the prior distribution for the parameters of the sine-skewed von Mises mixture model should be selected based on the predictable performance of wind speeds.
3.4.2. MCMC settings
The posterior distribution is computed using MCMC techniques like Gibb's sampling and the Metropolis-Hastings algorithm, particularly in cases when the posterior distribution cannot be computed analytically. These procedures create samples from the posterior distribution by creating a Markov chain that has the desired posterior distribution as its symmetry distribution.
Some important MCMC settings are:
-
•
A lot of beginning samples are eliminated in order to let the chain settle into a stationary distribution.
-
•
Decreasing autocorrelation by keeping only the k kth sample in the chain
-
•
The entire number of samples drawn, which must be sufficient to guarantee that the posterior distribution of the sample is well approximated.
-
•
Techniques like trace graphs and the Gelman-Rubin statistic to determine if the MCMC method has converged.
3.5. Goodness of fit tests
The process of assessing probability distribution (PD), the technique used for parameter estimation, and the accessibility of information on wind speeds are some of the variables that affect the final selection of PD. In the current study, the Akaike information criteria (AIC), Bayesian information criteria (BIC), Anderson Darling (AD) test, and Kolmogorov-Smirnov (KS) criterion were employed to assess the sufficiency of various PDs. To analyze the wind speed, the previous graphic tests and goodness-of-fit tests were performed [13,[63], [64], [65], [66], [67], [68]]. The previously mentioned goodness of fit tests can be applied with confidence to select the most suitable distribution in environmental statistics.
3.5.1. Kolmogorov-Smirnov (KS) test
It can be determined using the KS test; it depends on an empirical distribution function if the data set is taken according to a suggested distribution [65]. Assuming that we have a sample selected at random from the distributions, the EDF can be expressed as follows:
| [4] |
The highest vertical separation between the theoretical PD and EDF determines the KS statistic (D). The KS test statistic (D) is.
| [5] |
F represents the theoretically cumulative distribution's function, xi represents the ith level of the data, and n represents the sample size in equation (5).
3.5.2. Anderson-Darling (AD) test
The fit between an actual distribution model and the estimated distribution model is evaluated using the AD test. The downward trend of the distribution, which becomes a crucial feature in simulating extreme occurrences, is given more weight by the AD test [69]. The formula for the AD statistic A2 is as follows:
| [6] |
Where X represents the parameter being studied, represents the distribution's performance, and n represents the size of the sample. A2 represents the test outcome in this particular case.
3.5.3. Chi-Squared ( test
The goodness of fit test in chi-square is employed to evaluate the relationship between the two sets of data. This equation can be used to calculate it:
| [7] |
where the predicted value represents and the actual value represents . A significant level of relationship between the two sample distributions can be seen in the small test value.
3.5.4. Akaike and Bayesian information criteria
The alternative approach, which includes both BIC and AIC, depends upon the relative measurement loss of material when using the model to characterize the observations [70]. Both of these methods, though, represent one of the most widely used techniques. The usual approach to AIC and BIC as
| [8] |
| [9] |
where L represents the log-likelihood function, K represents the parameter, and n represents the sample size. The model with the smallest values of AIC and BIC for the wind speed information has been selected as having the most optimal fit distribution.
4. Results
It is important to confirm the primary principle, namely the homogeneity, stationarity, and independence of the maximum monthly wind speed (MMWS), before doing deep learning analysis. Mann-Whitney test statistics are utilized to assess homogeneity, while L-Jung-Box Q (LBQ) test statistics are employed to assess stationarity and independence [71]. A non-parametric test was recommended by Mann and Whitney [72] to verify the homogeneity. This test recommends first dividing the initial sample into two samples of relatively similar length so that . The complete sample should now be arranged in ascending order, with each sample value being noted as belonging to either the first or second sample. The fundamental premise of this test is that if the ranking orders of the components in the first sample are consistently higher or lower than the ranking orders of another sample, the entire sample will not be homogeneous. The smallest value of “V" between and is what this test depends on, as seen below.
| [10] |
where R1 displays the first sample's overall ranking order as a sum. Under the null hypothesis, which states that the data set at present is homogeneous, Mann-Whitney claimed that the variance as well as the mean for both samples n1 and n2 are almost normally distributed with the mean and variance as follows:
| [11] |
Also, the standard normal test statistic as
| [12] |
The benefit of utilizing the LBQ test is that it is a composite test that examines randomness “overall” as opposed to testing randomness at each lag [73,74]. The lag duration should not be greater than , in the LBQ test, where is the record length of the ith site, and the hypothesis to be set is that all K up to that lag is zero. In this test, H0: all the observed values are independent vs. Ha: all the observed values are non-independent. The test statistic is
| [13] |
In equation (13), m is the lag length, and is the sample autocorrelation at lag k. At the selected level of significance, the null hypothesis is rejected if . It is the Box-Pierce Q statistic in its modified form. Both statistics have a chi-square distribution with m degrees of freedom when n is large. Comparing LBQ statistics to Box-Pierce Q statistics, the former is superior for both small and large sample sizes. Table 2 reveals that the LBQ statistic for each lag of the wind speed data series is insignificant (individually 10m and 50m), which is adequate proof that the fundamental series are stationary and independent. For the M − W test, every group has been split into two portions. The even number of observations series has been split into two equal portions, while the other series are separated into around equal portions. All sequences display insignificance outcomes since 10m and 50m of Gwadar site p values are higher than 0.05, and it is therefore determined that the two groups of MMWS series have the same distribution. All testing revealed that every series was suitable for further analysis Table 2. In the case that certain MWMS sequence assumptions are not met, then more complicated statistical techniques must be used to account for the relationship between the data as it is observed and how it develops over time.
Table 2.
Outcomes of fundamental assumptions for the Gwadar wind farms.
| Stations | LJ-BOX | MWU-test | ||
|---|---|---|---|---|
| Gwadar | Statistics -value | P - Value | Statistics -value | P - Value |
| 10m | 2.346 | 0.531 | 1.68119 | .09296 |
| 50m | 0.632 | 0.654 | 0.09181 | .92,828 |
4.1. Probability Distributions (PDs) selection
The statistical analysis of the wind speed information is a critical instrument to use for wind farms, wind generators, and other purposes like irrigation. To assess the potential for wind energy in a particular area, it is important to accurately determine the probability distribution of the wind. In the beginning, the study took into consideration twenty PDs, i.e., Gamma (GAM), Generalized Gamma (GG), Dagam3, Fl (3), Inverse Gaussian (IGA), Weibel (3), Beta, Nakagami (NAK), Jonson Su (JON), Normal (N), Kumaraswami, Log gamma (LG), KUM-N, NAK-GUM, JON-DAG, DAG-B, LN-LP, LL-IGA, N-GAM, and Mixture SSVM. The goodness of fit methods were employed to evaluate the PDs, including the chi-square test, Anderson Darling, Kolmogorov Smirnov, AIC, and BIC. The assessments based on R2, AIC, BIC, AD, and KS criteria for the most accurate fit distributions are listed in Table 3. According to the smallest goodness of fit tests amongst the twenty PDs taken into consideration during the research, the best-fitted distribution was selected for Gwadar stations at 10 m and 50 m. The most significant outcomes for every PD are shown in Table 3. Each of the models under consideration will be applied and used to estimate the maximum wind speed distributions on wind speed databases, taking into account the specifications of the various models. Five goodness of fit criteria are calculated for each of the models. Table 3 shows the results of multiple models at Gwadar station 10m. Among the twenty models, the mixture SSVM distribution model performed best concerning all goodness of fit indices except KS, AD, chi-square, R2, and BIC, and KUM-N is also the most effective wind speed distribution model for Gwadar 10m. Interestingly, the predicted distributions of wind speed are produced by a mixture of SSVM. Regarding the four goodness-of-fit measurements, Kumaraswamy outperforms each of the other approaches out of the twelve single-distribution models. The mixture of SSVM is superior to KUM-N when these two are compared. Generally, mixture SSVM distributions outperform singular distributions and heterogeneous mixture distributions in this set of data. In Gwadar 50m, as shown in Table 4, in terms of KS, AD, and BIC, MSSVM is the best model, while NAK-GUM performs the best in terms of AD, KS, BIC, AIC, and all heterogeneous mixture distributions (MSSVM) and outperforms the other models under comparison concerning the distribution fitting of wind speeds. Among single and heterogeneous mixture distributions, the optimal distribution models for Gwadar 50m are Log gamma, NAK-GUM, KUM-N, DAG-B, and MSSVM, respectively.
Table 3.
Statistical assessment of different distribution functions for 10m Gwadar station.
| Gwadar 10m | Models | KS | AD | χ2 | AIC | BIC |
|---|---|---|---|---|---|---|
| Single | GA3 | 0.055 | 1.348 | 16.039 | 4750.93 | 4781.83 |
| GG | 0.052 | 1.028 | 16.548 | 4751.04 | 4782.15 | |
| DAG | 0.070 | 2.458 | 20.013 | 4753.39 | 4784.21 | |
| FL (3) | 0.044 | 0.820 | 12.808 | 4740.81 | 4771.51 | |
| IGA (3) | 0.045 | 0.849 | 11.341 | 4738.07 | 4769.05 | |
| WE (3) | 0.037 | 0.707 | 7.911 | 4733.67 | 4764.56 | |
| B | 0.040 | 0.602 | 11.702 | 4739.03 | 4769.04 | |
| NAK | 0.052 | 1.163 | 16.947 | 4750.53 | 4782.45 | |
| J SU | 0.062 | 5.398 | 14.261 | 4745.91 | 4776.92 | |
| N | 0.046 | 0.806 | 13.181 | 4743.68 | 4774.73 | |
| KUM | 0.038 | 0.702 | 12.618 | 4734.70 | 4765.71 | |
| L. GA | 0.582 | 0.821 | 17.675 | 4744.21 | 4775.31 | |
| Heterogeneous | KUM-N | 0.027 | 0.315 | 4.817 | 4727.03 | 4758.04 |
| NAK-GUM | 0.031 | 0.359 | 7.998 | 4732.54 | 4763.43 | |
| LP-GEV | 0.038 | 0.948 | 6.637 | 4735.99 | 4766.92 | |
| DAG-B | 0.031 | 0.347 | 5.716 | 4729.99 | 4758.85 | |
| LN-LP | 0.035 | 0.592 | 11.40 | 4737.67 | 4769.57 | |
| LL-IGA | 0.036 | 0.464 | 9.443 | 4736.81 | 4767.71 | |
| N-GAM | 0.034 | 0.671 | 5.739 | 4734.90 | 4765.84 | |
| Mixture | MSS-VM | 0.020 | 0.232 | 4.182 | 4724.67 | 4743.39 |
Table 4.
Statistical assessment of different distribution functions for 50m Gwadar station.
| Gwadar 50m | Models | KS | AD | χ2 | AIC | BIC |
|---|---|---|---|---|---|---|
| Single | GA3 | 0.025 | 0.389 | 3.788 | 3753.93 | 3818.83 |
| GG | 0.025 | 0.379 | 7.942 | 3760.04 | 3821.15 | |
| DAG | 0.024 | 0.377 | 4.262 | 3763.39 | 3812.21 | |
| FL (3) | 0.025 | 0.400 | 4.512 | 3761.81 | 3825.51 | |
| IGA (3) | 0.026 | 0.363 | 3.671 | 3758.07 | 3815.05 | |
| WE (3) | 0.025 | 0.395 | 3.925 | 3753.67 | 3822.56 | |
| B | 0.024 | 0.352 | 3.667 | 3750.03 | 3811.04 | |
| NAK | 0.026 | 0.421 | 8.076 | 3751.53 | 3827.45 | |
| J SU | 0.023 | 0.352 | 6.193 | 3745.91 | 3819.92 | |
| N | 0.028 | 0.424 | 5.971 | 3743.68 | 3814.73 | |
| KUM | 0.025 | 0.384 | 4.262 | 3734.70 | 3810.71 | |
| L. GA | 0.023 | 0.304 | 3.390 | 3754.21 | 3814.31 | |
| Heterogeneous | KUM-N | 0.023 | 0.315 | 2.830 | 3755.03 | 3808.04 |
| NAK-GUM | 0.020 | 0.266 | 4.794 | 3748.54 | 3805.43 | |
| LP-GEV | 0.023 | 0.316 | 6.629 | 3751.99 | 3813.92 | |
| DAG-B | 0.022 | 0.300 | 3.737 | 3759.99 | 3811.85 | |
| LN-LP | 0.024 | 0.351 | 3.668 | 3757.67 | 3808.57 | |
| LL-IGA | 0.023 | 0.331 | 3.813 | 3756.81 | 3807.71 | |
| N-GAM | 0.023 | 0.338 | 3.810 | 3754.90 | 3806.84 | |
| Mixture | MSS-VM | 0.022 | 0.243 | 5.330 | 3743.67 | 3801.39 |
Fig. 2, Fig. .3 indicate the trace chain has been changed to a stationary distribution during long burns. The most noticeable aspect of stationary distributions from trace plots is a usually constant mean and variance. These figures similarly display a perfect trace plot. Because the chain's core looks to be near constant mean values with very minor changes. This suggests that the chain will reach the desired (right, stationary) distribution. Kernel density charts show that the Bayesian point estimate is the average posterior mean or median, and the range of 0.77th to 95.5th percentile is the 95 % Bayesian confidence interval, also known as the credible interval. The numerical outcomes of the variables produce a graphical representation with similar results, the MCMC output (trace, ACF, and density plots), produced by STATA software. As a result, the posterior kernel density is meant to be stable and convergent across all parameters. Auto-correlation function plots for each parameter chain also reveal the dimensions of the posterior distribution that are rapidly mixing. Rapid mixing is frequently linked with a low posterior correlation among metrics. The graphs show that all parameters mix well, with autocorrelation vanishing before five delays in each case.
Fig. 2.
The trace, ACF, and density plots for Gwadar station 10m.
Fig. .3.
The trace, ACF, and density plots for Gwadar station 50m.
Fig. 4 shows the average rankings for all models of distribution that have been computed. The conclusions are as follows:
-
•
Single distributions like Beta and Log-Gamma will provide greater flexibility and ranking and outperform each of the single distributions using fewer parameters for modeling in terms of fitting.
-
•
Average (mean) rankings of mixture heterogeneous distributions are more significant. Compared to the majority of single distributions, the former cases might benefit from various distributions' fitting capabilities when there are fewer elements within mixture distribution modeling.
-
•
MSS-VM outperforms all other models. It also performs more suitably than other heterogeneous mixtures and single distribution models as well.
Fig. 4.
Twenty wind speed distribution models' average ranks on Gwadar databases.
The estimation of the wind velocity distribution of the best five simulation models at the Gwadar site is displayed in Fig. 5. The two models that have been suggested, based on Fig. 5, have the capability of modeling empirically measured wind speed distribution in both unimodal and multimodal wind regimes. According to the probability difference and quantile plots of the top five models at 10m, they are M-SSVM, KUM-N, DAG-B, Kumaraswamy, and Weibel, and at 50m, they are MSS-VM, NAK-GAM, KUM-N, L.GA, and B respectively. The maximum wind speed distributions assessed for each of these models are displayed in Fig. 6. The wind velocity distribution for the best five simulation performance models at the Gwadar site is depicted in Fig. 7. Based on the findings from this figure, two models have been recommended for their ability to effectively model both unimodal and multimodal wind regimes using empirically measured wind speed data. At a height of 10 m, the top five models identified from the quantile plots include M-SSVM, KUM-N, DAG-B, Kumaraswamy, and Weibel. These models have demonstrated robust performance in capturing the distributional characteristics of wind speeds observed at this altitude. Similarly, at a height of 50 m, the recommended models based on their performance in modeling wind speed distributions are MSS-VM, NAK-GAM, KUM-N, L.GA, and B. These models are noted for their ability to handle the variability and complex patterns exhibited by wind speeds at higher altitudes in the Gwadar region. Overall, the analysis presented in Fig. 7 underscores the suitability of these selected models for accurately representing wind velocity distributions across different heights, providing valuable insights for wind energy assessments and related applications in the region.
Fig. 5.
The Five most precise models identified wind speed probability density function distribution at Gwadar station.
Fig. 6.
Probability difference of maximum wind speed for the five most accurate models at Gwadar station.
Fig. 7.
The quantile plots of maximum wind speed for the five most effective models at Gwadar station.
The MCMC method discards 2500 first iterations. The burn-in period allows the algorithm to reach a stationary distribution, guaranteeing that the samples are pulled from the genuine posterior distribution rather than the initial values. The burn-in duration is chosen based on the specific problem and the MCMC chain's behavior. It is frequently determined by inspecting trace plots and autocorrelation plots to check that the chain has sufficiently converged before collecting samples for inference. Thinning is commonly done after the burn-in stage. For example, if the burn-in period is 2500 iterations the total number of iterations is 10,000, and you utilize 3 thinning, you would discard the first 2500 samples while keeping every third sample from the next 7500 iterations. The gamma distribution is used as a prior distribution for the estimating of parameters of the model.
The choice of Gwadar as a case study location is significant for several reasons, despite relying on data from a single measurement station. Gwadar, being a coastal city, is positioned at a key location where the interplay of land-sea interactions influences the wind patterns. Coastal regions are known for their distinct wind characteristics, which can vary from inland regions due to the proximity to large water bodies and the resulting temperature differentials. Gwadar experiences unique wind regimes, driven by both the local topography and regional climatic conditions. Its location at the junction of the Arabian Sea and the surrounding hilly terrain creates specific wind speed distributions, which may differ significantly from neighboring regions. A detailed case study of Gwadar can provide insights into the coastal wind characteristics, especially for areas with similar geographic features. Additionally, the selection of Gwadar aligns with the study's broader objectives of understanding wind energy potential in coastal regions, making this a representative choice. While data from a single station could be seen as a limitation, it is important to note that wind speed patterns tend to exhibit spatial coherence, especially in coastal regions where large-scale meteorological drivers dominate. Therefore, the station at Gwadar can serve as a reasonable representation of the local wind climate, allowing for meaningful conclusions about wind speed distributions. Future studies may include additional stations to further validate the findings.
Sensitivity analyses improve the comprehension of the reliability and durability of the MSS-VM model for predicting wind speed distributions. By examining parameter sensitivity, and input data variability, and conducting comparative assessments with other models, researchers can ensure that the MSS-VM model remains a dependable tool for wind energy assessments. This comprehensive approach not only strengthens the validity of the model's predictions but also provides stakeholders with greater confidence in making informed decisions regarding wind energy projects.
5. Discussion
The accurate characterization of wind speed distributions is superlative for optimizing the efficiency and economic viability of wind energy projects [75,76]. This study undertakes a comprehensive evaluation of different probability distributions (PDs) to identify models that best fit wind speed data from the Gwadar region. The selection process involves rigorous statistical analysis and comparison using established goodness of fit criteria, aiming to enhance the reliability of wind resource assessments. The initial step in this study involved the consideration of 20 distinct PDs, ranging from commonly used distributions such as Gamma and Normal to more specialized models like Mixture SSVM. Each distribution was evaluated based on its ability to capture the empirical characteristics of wind speed data collected at heights of 10m and 50m in Gwadar. Based on the results from these tests, clear distinctions emerged among the PDs. At 10m height, the Mixture SSVM distribution demonstrated superior performance across multiple criteria, including AIC, BIC, AD, KS, and R2. This suggests its ability to effectively model the variability and distributional characteristics of wind speeds observed at lower altitudes in Gwadar. Conversely, at 50m height, the NAK-GUM distribution emerged as the most suitable model, displaying robustness in terms of AD, KS, AIC, BIC, and overall fit indices.
The complex patterns seen in wind speed data are well captured by the MSS-VM model, especially in areas like Gwadar where the wind regime can be more versatile. In contrast with traditional single distribution models, which could find it difficult to accurately depict multimodal or skewed data, by combining different distributions, the MSS-VM mixture framework enables it to provide a more complex depiction of the underlying wind speed statistics. Such flexibility is necessary to accurately simulate wind speed and variability and is necessary for accurate estimations of energy production.
The results of this research show that the MSS-VM model performs better than other distributions at several heights, especially at 10m. Its consistent performance at different altitudes indicates that it can adjust to the dynamics of wind profiles that change as height increases. This flexibility is essential for wind energy projects because the unique wind conditions at various elevations may have significant effects on turbine performance.
5.1. Computational efficiency of Bayesian models vs. Traditional approaches
5.1.1. Bayesian models
Bayesian approaches, for instance, the Mixture SSVM (MSS-VM), offer significant versatility in simulating challenging distributions due to their ability to include prior information and modify opinions using fresh data. The Bayesian method makes use of linear models and mixtures that are better able to identify the fundamental framework of data about wind speeds. Compared to more traditional approaches. It is extremely computationally difficult to perform Bayesian inference, especially when using Markov Chain Monte Carlo (MCMC) techniques. The processor load is increased by the requirement to do computations for posterior sampling and convergence, as well as performing numerous iterations (frequently thousands). For quantifying uncertainty, Bayesian approaches offer a complete probabilistic representation of parameter uncertainty, as well as reliable interval and posterior distributions. This is especially useful for wind speed modeling, where knowing the range of potential parameter values and the uncertainty associated with their essential.
5.1.2. Traditional approaches
Traditional techniques, including maximum likelihood estimation (MLE) in the context of single distribution models, may offer limited flexibility. They frequently assume a certain distribution form (e.g., Weibull, Gamma) and are unable to dynamically adjust to the complexity of the data. These techniques may have trouble processing complicated data patterns, particularly when the data does not easily fit into established distributions while being simple and requiring minimal computing resources. Traditional approaches like MLE are typically faster due to their direct optimization of likelihood functions rather than iterative sampling. These methods can be more computationally effective for simpler data structures and models with fewer parameters. In general, computing power requirements for assessing the goodness of fit for single distributions are lower than those for performing MCMC for Bayesian models, as demonstrated by Table 3, Table 4 Traditional approaches usually yield confidence intervals and point estimates, if relevant. Although helpful, this does not provide the same level of quantified uncertainty as Bayesian techniques.
5.2. Statistical validation
Several researchers have conducted wind speed analysis employing numerous methods [54,[77], [78], [79], [80], [81]], particularly the regional and at-site wind speed frequency analyses, that have drawn a lot of interest. The choice of an appropriate probability distribution (PD) and the estimation of its parameters are two crucial components in the assessment of severe occurrences [[82], [83], [84]]. According to studies, Weibull (WEI) distribution is a relatively commonly utilized PD for wind speed modeling [[85], [86], [87], [88]]. Additionally, the Weibel (WEI) distribution, Several PDs have been used to suit the EWS data. For instance, the Generalized Pareto (GPA) distribution was suggested to fit WS in the UK and Australia [89,90]. Correspondingly, Pandey and Sutherland [91] suggested utilizing the L-moment to estimate wind speed employing the WEI and generalized extreme value (GEV) distribution. Several PDs, including the generalized logistic (GLO), GEV, GPA, lognormal 3 (LN3), exponential (EXP), log-Pearson Type III (LP3), Pearson type 3 (P3), and WEI, for estimating wind speed [92,93]. GEV, Gamma (GAM), LP3, Gumbel (GUM), Lognormal (LN2), and WEI distributions were used in numerous studies of WSFA investigations to estimate wind speed measurements in different countries [80,[94], [95], [96]].
5.3. Practical implications for wind energy applications
Wind turbines produce power Without releasing greenhouse gases or other pollutants linked to standard energy sources like natural gas or coal [97]. Wind farms can damage local wildlife and ecological habitats, however, good planning and location can limit these effects [98]. Since wind farms don't need water for cooling, they use less water than thermal power plants, which eases the strain on the region's water supplies [99]. For wind energy applications in Gwadar, precisely simulating wind speed distributions is critical for projecting energy yields and optimizing turbine performance. The observed distributions offer helpful insights into the wind speed variability and peaks in the area in addition to providing a good statistical fit for the data. In practical terms, selecting an appropriate PD has significant implications for wind energy assessments. Accurate modeling of wind speed distributions informs decisions on turbine placement, operational strategies, and energy yield predictions. The findings from this study provide actionable insights into optimizing wind farm performance and enhancing energy production efficiency. Moreover, the Bayesian framework employed in MSS-VM provides a robust means of estimating design parameters critical for infrastructure planning and development.
Although the study presents insightful information, it's important to recognize any potential drawbacks, such as the assumptions that underlie each probability distribution and the distinctive characteristics of the Gwadar wind data. The outcome of this study has important ramifications for wind energy evaluation approaches. By determining the most suitable PDs for a certain height in Gwadar, Stakeholders may choose the best sites, design their turbines, and implement operational plans to optimize energy production and profitability. Technological improvements in wind energy have real-world consequences on incorporation and efficacy. Turbine design advancements that have decreased operating costs and increased efficiency include the use of larger rotor diameters and taller towers [100]. The unpredictable nature of wind power requires advancements in grid technology and energy collection instruments [101].
5.4. Strengths and limitations
The research uses a mixture distribution model, which fits complicated wind speed data better than single distributions. Such flexibility is essential for effectively representing the features of wind speed under various conditions. The reliability of the results is increased by using various goodness-of-fit tests (KS test, AD test, chi-square, AIC, and BIC). These tests offer a comprehensive assessment of the model's performance against multiple distribution types, guaranteeing that the best models are found for various heights (e.g., 50m and 100m). The results have important implications for the generation of wind energy, especially in terms of helping choose the right wind turbines depending on local wind conditions. This may result in more effectively produced electricity and improved financial results for wind farms.
The study limited information from Gwadar, Pakistan, which might not be typical of other areas with various geographic and meteorological characteristics. It is still necessary to test the feasibility of the model in various scenarios. Mixture models can provide difficulties in interpretation and application even while they offer superior fitting for complicated distributions. The performance of the model may be negatively impacted if the selected components do not accurately match the underlying wind speed data. Using complicated models with lots of parameters carries the danger of overfitting, which can result in inadequate generalization for fresh sets of data. Thorough validation is required to guarantee that the model is robust under various circumstances. Additionally, this study is based on secondary data collected from the Pakistan metrological department, and as such, we do not have access to detailed information regarding the measurement equipment, sampling rate, or quality assurance procedures used during data collection. This lack of specific information may limit our ability to fully assess the data quality and could introduce some uncertainty in the interpretation of the findings. We recommend that future studies collect primary data or obtain more detailed documentation to address these gaps.
6. Conclusion
Appropriate distribution of maximum wind speeds assessment can better understand the wind's features and possibilities for wind energy production in an observable region. It helps in the appropriate wind turbine selection as well. In the present study, the appropriate efficacy of mixture distribution is significantly influenced by the kind and number of distribution components. A mixture of SSVM distribution is suggested and utilized to estimate wind speed distribution. Based on the performance of the suggested models at Pakistan's Gwadar station, several conclusions will be drawn.
The MLE technique was employed to estimate the PDs' parameters. Among the twenty prospective PDs, the most appropriate distributions have been determined using the KS test, AD test, chi-square, AIC, BIC test, probability density function plots, probability difference plots, and Q-Q plots. The goodness-of-fit findings show that in 100m M-SSVM, KUM-N, DAG-B, Kumaraswamy, and Weibel, and at 50 m, they are M-SSVM, NAK-GAM, KUM-N, L.GA, and B were found to be the appropriate distributions for frequency assessment.
Single distribution Several model parameters increase the model's flexibility, making it better for fitting difficult wind speed distributions. When compared to other single distributions, in 10 m Kumaraswamy, and Weibel as well as at 50m log gamma, beta distribution performed better. In general, the mixture model performs better than heterogeneous mixtures and single distributions, which, using several heterogeneous mixtures, frequently perform better than single distributions.
Compared to the mixture and other single distributions, the suggested model MSSVM fit the wind speed distributions under various conditions more accurately with the smallest value of KS (0.020), AD-based indicator (0.232 m/s), Chi square-based indicator (4.182 m/s), AIC value (4724.67) and BIC-based criterion (4743.39). The suggested model will be used in the future to calculate the distribution of maximum wind speeds at unknown sites. Furthermore, the presentations of several mixture distributions, such as Kumaraswamy, beta, Weibull, log gamma, etc., will be assessed to explain the distributions of wind speeds. The MMWS estimate derived from these distributions may have significant policy implications concerning the codification of wind loads for various structural designs. By providing a more accurate representation of wind speed variability, these estimates can inform building codes and engineering standards, ensuring that structures are designed to withstand extreme wind events. This is particularly crucial for infrastructure located in areas prone to high wind speeds, as it can help mitigate potential losses due to structural failures. Policymakers can utilize this information to update existing regulations, promoting safer design practices that enhance resilience against extreme weather conditions. Furthermore, integrating these estimates into planning frameworks can facilitate the development of more robust infrastructure, ultimately reducing economic losses and enhancing public safety in the face of climate change and increasing frequency of severe weather events.
Key massage
The study shows that the MSSVM mixture distribution model outperforms other models in fitting wind speed distributions, as indicated by superior goodness-of-fit metrics. This model accuracy at Gwadar station suggests it can effectively estimate wind speeds at new sites. Its use could enhance wind turbine selection and inform wind load regulations to better manage extreme wind conditions.
CRediT authorship contribution statement
Tasir Khan: Writing – original draft, Methodology, Conceptualization. Yejuan Wang: Writing – review & editing, Supervision, Conceptualization.
Data availability statement
All the authors of this manuscript have confirmed that the data supporting the study's findings are included in the article. The necessary data is both available and readily accessible.
Consent for publication
All the authors agree to publish this paper.
Ethical statement
All experimental procedures were authorized by the Animal Welfare and Ethics Committee at Lanzhou University (approval number LZU-201805-224).
Declaration of competing interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgments
We would like to express our gratitude to our esteemed reviewers, particularly Yejuan Wang, for their insightful comments and suggestions, which significantly contributed to the improvement of this paper.
Contributor Information
Tasir Khan, Email: tasirkhan.state@gmail.com.
Yejuan Wang, Email: wangyj@lzu.edu.cn.
Appendix A.
| Distributions | Abbreviations | CDF | |
|---|---|---|---|
| Gamma | GAM | ||
| Generalized Gamma |
GG | ||
| Dagum | DAG | ||
| Fatigue Life | FL | ||
| Inv. Gaussian | IGA (3P) | ||
| Weibull | WE (3p) | ||
| Beta | B | ||
| Nakagami | NAK | ||
| Johnson SU | J.SU | ||
| Kumaraswamy | KUM |
Where |
|
| Log-Gamma | LG | ||
| Kumaraswamy-normal | KUM-N | ||
| Nakagami-Gumbel | NAK-GUM | ||
| Log Pearson-generalize extreme value | LP-GEV | ||
| Dagum-Beta | DAG-B | ||
| Log normal-log Pearson | LN-LP | ||
| Log logistics- inverse gaussian | LL-IG | ||
| Normal-Gamma | N-GAM |
Appendix B.
Proof of Sine- Skewed von Mises Distribution
| (1) |
If the random variable has the pdf as given in (1), then
We define the circular moment of random variable with pdf , with as
Characterizations
We shall now derive the two characterizations of the skew von Mises distribution. We will need the following two lemmas for the characterization of the skew von Mises distribution by truncated moment.
Assumption A
Let be a continuous random variable with cdf and pdf . Assume that exists and is differentiable. We assume further that and .
Lemma 1
.
If , where is a continuous differentiable function in then
Where is determined by the condition .
proof
.
We know from elementary theory that
Differentiate to both sides
Intergrade both sides
where is determined by the condition .
Lemma 1
Under the Assumption A, if, whereis a
where is determined by the condition .
Proof
we know that
Hence, equating the RHS of both sides, we have,
Differentiate with respect to both sides
Re-arrange
integrating both sides,
This completes the proof.
Theorem 1
.
Suppose that the random variable satisfies the conditions given in Assumption A, with pdf , cdf and . Then ,
and
If and only if has the skew von Mises distribution with pdf as
and k is any real number.
Proof
.
Then
Thus,
Suppose that
Then
Hence
By Lemma 1
Integrating the above equation with respect to .
Using the condition we obtain
Theorem 2
.
Suppose that the random variable satisfies the conditions given in Assumption A, with and . Then
and
Proof
.
If the pdf of the random variable is then
Thus
Suppose that
then
Thus, by Lemma 2
On integrating the above equation with respect to , we obtain .
Using the condition we obtain
References
- 1.Du S., et al. A review of multi-scale modelling, assessment, and improvement methods of the urban thermal and wind environment. Build. Environ. 2022;213 [Google Scholar]
- 2.Miao S., et al. Determining suitable region wind speed probability distribution using optimal score-radar map. Energy conversion managment. 2019;183:590–603. [Google Scholar]
- 3.Bahrami A., et al. Technical and economic analysis of wind energy potential in Uzbekistan. J. Clean. Prod. 2019;223:801–814. [Google Scholar]
- 4.Deep S., et al. Estimation of the wind energy potential for coastal locations in India using the Weibull model. Renew. Energy. 2020;161:319–339. [Google Scholar]
- 5.Guedes K.S., et al. Performance analysis of metaheuristic optimization algorithms in estimating the parameters of several wind speed distributions. Appl. Energy. 2020;268 [Google Scholar]
- 6.Kiss P., Jánosi I.M.J.E.C. Management. Comprehensive empirical analysis of ERA-40 surface wind speed distribution over Europe. Energy Conversion and Management. 2008;49(8):2142–2151. [Google Scholar]
- 7.Campisi-Pinto S., Gianchandani K., Ashkenazy Y.J.R.E. Statistical tests for the distribution of surface wind and current speeds across the globe. Renew. Energy. 2020;149:861–876. [Google Scholar]
- 8.Ouarda T.B., et al. Review of criteria for the selection of probability distributions for wind speed data and introduction of the moment and L-moment ratio diagram methods, with a case study. Energy Convers. Manag. 2016;124:247–265. [Google Scholar]
- 9.Jung C.J.E. High spatial resolution simulation of annual wind energy yield using near-surface wind speed time series. Energies. 2016;9(5):344. [Google Scholar]
- 10.Ul Haq M.A., et al. Marshall–Olkin power Lomax distribution for modeling of wind speed data. Energy Rep. 2020;6:1118–1123. [Google Scholar]
- 11.Baran S., Szokol P., Szabó M.J.E. Truncated generalized extreme value distribution‐based ensemble model output statistics model for calibration of wind speed ensemble forecasts. Environmetrics. 2021;32(6):e2678. [Google Scholar]
- 12.Masseran N.J.E. Evaluating wind power density models and their statistical properties. Energy. 2015;84:533–541. [Google Scholar]
- 13.Shin J.-Y., Ouarda T.B., Lee T.J.R.E. Heterogeneous mixture distributions for modeling wind speed, application to the UAE. Renew. Energy. 2016;91:40–52. [Google Scholar]
- 14.Akpinar S., Akpinar E.K.J.E.C. Management. Estimation of wind energy potential using finite mixture distribution models. Energy Conversion and Management. 2009;50(4):877–884. [Google Scholar]
- 15.Chang T.P.J.A.E. Estimation of wind energy potential using different probability density functions. Appl. Energy. 2011;88(5):1848–1856. [Google Scholar]
- 16.Gugliani G., et al. Location wise comparison of mixture distributions for assessment of wind power potential: a parametric study. Int. J. Green Energy. 2017;14(9):737–753. [Google Scholar]
- 17.Alrashidi M., Rahman S., Pipattanasomporn M.J.R.E. Metaheuristic optimization algorithms to estimate statistical distribution parameters for characterizing wind speeds. Renew. Energy. 2020;149:664–681. [Google Scholar]
- 18.Kollu R., et al. Mixture probability distribution functions to model wind speed distributions. nternational Journal of energy and environmental engineering. 2012;3:1–10. [Google Scholar]
- 19.Jung C., Schindler D.J.E.C. And management. Global comparison of the goodness-of-fit of wind speed distributions. Energy Conversion and Management. 2017;133:216–234. [Google Scholar]
- 20.Ouarda T.B., Charron C.J.E.C. Management. On the mixture of wind speed distribution in a Nordic region. Energy Conversion and Management. 2018;174:33–44. [Google Scholar]
- 21.Fawad M., et al. Estimation of wind speed using regional frequency analysis based on linear‐moments. J. Climatol. 2018;38(12):4431–4444. [Google Scholar]
- 22.Goel N.K., et al. Wind quantile estimation using a pooled frequency analysis approach. 2004;92(6):509–528. [Google Scholar]
- 23.Bangert M., Hennig P., Oelfke U. 2010 Ninth International Conference on Machine Learning and Applications. IEEE; 2010. Using an infinite von Mises-Fisher mixture model to cluster treatment beam directions in external radiation therapy. [Google Scholar]
- 24.Damien P., Walker S. A full Bayesian analysis of circular data using the von Mises distribution. The Canadian Journal of Statistics/La Revue Canadienne de Statistique. 1999:291–298. [Google Scholar]
- 25.Hornik K., Grün B. On conjugate families and Jeffreys priors for von Mises–Fisher distributions. J. Stat. Plann. Inference. 2013;143(5):992–999. doi: 10.1016/j.jspi.2012.11.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Nunez-Antonio G., Gutiérrez-Pena E. A Bayesian analysis of directional data using the von Mises–Fisher distribution. Commun. Stat. Simulat. Comput. 2005;34(4):989–999. [Google Scholar]
- 27.Røge R.E., et al. Infinite von Mises–Fisher mixture modeling of whole brain fMRI data. Neural Comput. 2017;29(10):2712–2741. doi: 10.1162/neco_a_01000. [DOI] [PubMed] [Google Scholar]
- 28.Taghia J., Ma Z., Leijon A. Bayesian estimation of the von-Mises Fisher mixture model with variational inference. IEEE Trans. Pattern Anal. Mach. Intell. 2014;36(9):1701–1715. doi: 10.1109/TPAMI.2014.2306426. [DOI] [PubMed] [Google Scholar]
- 29.Mulder K., Jongsma P., Klugkist I. Bayesian inference for mixtures of von Mises distributions using reversible jump MCMC sampler. J. Stat. Comput. Simulat. 2020;90(9):1539–1556. [Google Scholar]
- 30.Rad N.N., Bekker A., Arashi M. 2021 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI) IEEE; 2021. Bayesian inference for skew-wrapped Cauchy mixture model using a modified Gibbs sampler. [Google Scholar]
- 31.Nakhaei Rad N., et al. Coming together of Bayesian inference and skew spherical data. Frontiers in big Data. 2022;4 doi: 10.3389/fdata.2021.769726. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Ozay C., Celiktas M.S. Statistical analysis of wind speed using two-parameter Weibull distribution in Alaçatı region. Energy Convers. Manag. 2016;121:49–54. [Google Scholar]
- 33.Parajuli A. A statistical analysis of wind speed and power density based on Weibull and Rayleigh models of Jumla, Nepal. Energy Power Eng. 2016;8(7):271–282. [Google Scholar]
- 34.Zambak M.F., et al. Evaluation and analysis of wind speed with the Weibull and Rayleigh distribution models for energy potential using three models. Int. J. Energy Econ. Pol. 2023;13(2):427–432. [Google Scholar]
- 35.Ding Y., et al. Probabilistic method for wind speed prediction and statistics distribution inference based on SHM data-driven. Probabilist. Eng. Mech. 2023;73 [Google Scholar]
- 36.Rehman A., Deyuan Z. Pakistan's energy scenario: a forecast of commercial energy consumption and supply from different sources through 2030. Energy, sustainability and society. 2018;8:1–5. [Google Scholar]
- 37.Chen L., et al. Streamflow forecast uncertainty evolution and its effect on real-time reservoir operation. J. Hydrol. 2016;540:712–726. [Google Scholar]
- 38.Harijan K., et al. Forecasting the diffusion of wind power in Pakistan. Energy. 2011;36(10):6068–6073. [Google Scholar]
- 39.Masseran N. Evaluating wind power density models and their statistical properties. Energy. 2015;84:533–541. [Google Scholar]
- 40.Sheikh M.M., et al. Climate profile and past climate changes in Pakistan. Global Change Impact Studies Center (GCISC)-RR-01. 2009:1–189. [Google Scholar]
- 41.Khan B., Sadeghi S. Sustainable Urban Fabric for the Emerging City of Gwadar. 2020. Vernacular modernity in balochistan. [Google Scholar]
- 42.Sohoni V., Gupta S., Nema R. A comparative analysis of wind speed probability distributions for wind power assessment of four sites. Turk. J. Electr. Eng. Comput. Sci. 2016;24(6):4724–4735. [Google Scholar]
- 43.Lins D.R., et al. Comparison of the performance of different wind speed distribution models applied to onshore and offshore wind speed data in the Northeast Brazil. Energy. 2023;278 [Google Scholar]
- 44.Alavi O., Mohammadi K., Mostafaeipour A. Evaluating the suitability of wind speed probability distribution models: a case of study of east and southeast parts of Iran. Energy Convers. Manag. 2016;119:101–108. [Google Scholar]
- 45.Jung C., Schindler D. Global comparison of the goodness-of-fit of wind speed distributions. Energy Convers. Manag. 2017;133:216–234. [Google Scholar]
- 46.Natarajan N., Vasudevan M., Rehman S. Evaluation of suitability of wind speed probability distribution models: a case study from Tamil Nadu, India. Environ. Sci. Pollut. Control Ser. 2022;29(57):85855–85868. doi: 10.1007/s11356-021-14315-5. [DOI] [PubMed] [Google Scholar]
- 47.Filom S., et al. Exploring wind energy potential as a driver of sustainable development in the southern coasts of Iran: the importance of wind speed statistical distribution model. Sustainability. 2021;13(14):7702. [Google Scholar]
- 48.Bagci K., Arslan T., Celik H.E. Inverted Kumarswamy distribution for modeling the wind speed data: lake Van, Turkey. Renew. Sustain. Energy Rev. 2021;135 [Google Scholar]
- 49.Shin J.-Y., Ouarda T.B., Lee T. Heterogeneous mixture distributions for modeling wind speed, application to the UAE. Renew. Energy. 2016;91:40–52. [Google Scholar]
- 50.Özkan R., Sen F., Balli S. Evaluation of wind loads and the potential of Turkey's south west region by using log-normal and gamma distributions. Wind Struct. 2020;31(4):299–309. [Google Scholar]
- 51.Aljeddani S.M., Mohammed M. An extensive mathematical approach for wind speed evaluation using inverse Weibull distribution. Alex. Eng. J. 2023;76:775–786. [Google Scholar]
- 52.Gugliani G., et al. New methods to assess wind resources in terms of wind speed,load,power and direction. Renew. Energy. 2018;129:168–182. [Google Scholar]
- 53.Yılmaz V., Çelik H.E. A statistical approach to estimate the wind speed distribution: the case of Gelibolu region. Doğuş Üniversitesi Dergisi. 2008;9(1):122–132. [Google Scholar]
- 54.Khan T., et al. Comparison approach for wind resource assessment to determine the most precise approach. Energy Environ. 2024;35(3):1315–1338. [Google Scholar]
- 55.Efron B. RA Fisher in the 21st century. Stat. Sci. 1998:95–114. [Google Scholar]
- 56.Rocha P.A.C., et al. Comparison of seven numerical methods for determining Weibull parameters for wind energy generation in the northeast region of Brazil. Appl. Energy. 2012;89(1):395–400. [Google Scholar]
- 57.Rad N.N., Bekker A., Arashi M. Enhancing wind direction prediction of South Africa wind energy hotspots with Bayesian mixture modeling. Sci. Rep. 2022;12(1) doi: 10.1038/s41598-022-14383-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Hassanzadeh F. A smoothing spline model for multimodal and skewed circular responses: applications in meteorology and oceanography. Environmetrics. 2021;32(2):e2655. [Google Scholar]
- 59.Hernández-Sánchez E., Scarpa B. A wrapped flexible generalized skew-normal model for a bimodal circular distribution of wind directions. Chilean Journal of Statistics (ChJS) 2012;3(2) [Google Scholar]
- 60.Abe T., Pewsey A.J.S.P. Sine-skewed circular distributions. Stat. Pap. 2011;52:683–707. [Google Scholar]
- 61.Storn R., Price K. Differential evolution–a simple and efficient heuristic for global optimization over continuous spaces. J. Global Optim. 1997;11:341–359. [Google Scholar]
- 62.Mullen K., et al. DEoptim: an R package for global optimization by differential evolution. J. Stat. Software. 2011;40(6):1–26. [Google Scholar]
- 63.Rajabi M., Modarres R.J. Extreme value frequency analysis of wind data from Isfahan, Iran. J. Wind Eng. Ind. Aerod. 2008;96(1):78–82. [Google Scholar]
- 64.Ouarda T.B., et al. Review of criteria for the selection of probability distributions for wind speed data and introduction of the moment and L-moment ratio diagram methods, with a case study. Energy Conversion and managmwnt. 2016;124:247–265. [Google Scholar]
- 65.Ouarda T.B., et al. Probability distributions of wind speed in the UAE. Energy Convers. Manag. 2015;93:414–434. [Google Scholar]
- 66.Dhunny A.Z., et al. Statistical modelling of wind speed data for Mauritius. Int. J. Renew. Energy Resour. 2014;4(4):1056–1064. [Google Scholar]
- 67.Lawan S., et al. Statistical modelling of long-term wind speed data. American Journal of Computer and Information Technology. 2015;3(1):1–6. [Google Scholar]
- 68.Jung C., Schindler D. Wind speed distribution selection–A review of recent development and progress. Renew. Sustain. Energy Rev. 2019;114 [Google Scholar]
- 69.Huang K., et al. vol. 565. 2018. pp. 341–353. (Flood Hydrograph Coincidence Analysis for Mainstream and its Tributaries). [Google Scholar]
- 70.Rad N.N., Bekker A., Arashi M.J.S.R. Enhancing wind direction prediction of South Africa wind energy hotspots with Bayesian mixture modeling. Sci. Rep. 2022;12(1) doi: 10.1038/s41598-022-14383-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Ljung G.M., Box G.E.J.B. On a measure of lack of fit in time series models. Biometrika. 1978;65(2):297–303. [Google Scholar]
- 72.Mann H.B., Whitney D.R. On a test of whether one of two random variables is stochastically larger than the other. Ann. Math. Stat. 1947:50–60. [Google Scholar]
- 73.Box G.E., Pierce D.A. Distribution of residual autocorrelations in autoregressive-integrated moving average time series models. J. Am. Stat. Assoc. 1970;65(332):1509–1526. [Google Scholar]
- 74.Safi S. A.A.J.S.J.o.A.M. Al-Reqep, and Statistics. comparative study of portmanteau tests for the residuals autocorrelation in ARMA models. Science Journal of Applied Mathematics and Statistics. 2014;2(1):1–13. [Google Scholar]
- 75.Saeed M.A., Ahmed Z., Zhang W. Wind energy potential and economic analysis with a comparison of different methods for determining the optimal distribution parameters. Renew. Energy. 2020;161:1092–1109. [Google Scholar]
- 76.Guedes K.S., et al. Performance analysis of metaheuristic optimization algorithms in estimating the parameters of several wind speed distributions. Appl. Energy. 2020;268 [Google Scholar]
- 77.Rajabi M., Modarres R. Extreme value frequency analysis of wind data from Isfahan, Iran. J. Wind Eng. Ind. Aerod. 2008;96(1):78–82. [Google Scholar]
- 78.Lobeto H., Menendez M., Losada I.J. Future behavior of wind wave extremes due to climate change. Sci. Rep. 2021;11(1):7869. doi: 10.1038/s41598-021-86524-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Chiodo E., et al. A review on wind speed extreme values modeling and bayes estimation for wind power plant design and construction. Energies. 2023;16(14):5456. [Google Scholar]
- 80.Yu I., Kim J., Jeong S. Development of probability wind speed map based on frequency analysis. Spatial Information Research. 2016;24:577–587. [Google Scholar]
- 81.Fawad M., et al. Estimation of wind speed using regional frequency analysis based on linear-moments. Int. J. Climatol. 2018;38(12):4431–4444. [Google Scholar]
- 82.Kang S., et al. Wind resource assessment and potential development of wind farms along the entire coast of South Korea using public data from the Korea meteorological administration. J. Clean. Prod. 2023;430 [Google Scholar]
- 83.Xianga C., et al. Research on the probability model of basic wind speed estimation in China. Wind Struct. 2021;32(6):587–596. [Google Scholar]
- 84.Chen L., Singh V.P., Guo S. Measure of correlation between river flows using the copula-entropy method. J. Hydrol. Eng. 2013;18(12):1591–1606. [Google Scholar]
- 85.Stevens D.E. Vorticity, momentum and divergence budgets of synoptic-scale wave disturbances in the tropical eastern Atlantic. Mon. Weather Rev. 1979;107(5):535–550. [Google Scholar]
- 86.Rehman S., Halawani T., Husain T. Weibull parameters for wind speed distribution in Saudi Arabia. Sol. Energy. 1994;53(6):473–479. [Google Scholar]
- 87.Celik A.N. Energy output estimation for small-scale wind power generators using Weibull-representative wind data. J. Wind Eng. Ind. Aerod. 2003;91(5):693–707. [Google Scholar]
- 88.Toure S. Investigations on the Eigen‐coordinates method for the 2‐parameter weibull distribution of wind speed. Renew. Energy. 2005;30(4):511–521. [Google Scholar]
- 89.Holmes J., Moriarty W. Application of the generalized Pareto distribution to extreme value analysis in wind engineering. J. Wind Eng. Ind. Aerod. 1999;83(1–3):1–10. [Google Scholar]
- 90.Harris I. Generalised Pareto methods for wind extremes. Useful tool or mathematical mirage? J. Wind Eng. Ind. Aerod. 2005;93(5):341–360. [Google Scholar]
- 91.Pandey M., Sutherland H. Probabilistic analysis of list data for the estimation of extreme design loads for wind turbine components. J. Sol. Energy Eng. 2003;125(4):531–540. [Google Scholar]
- 92.Fawad M., et al. Multiparameter probability distributions for at-site frequency analysis of annual maximum wind speed with L-moments for parameter estimation. Energy. 2019;181:724–737. [Google Scholar]
- 93.Alcantara A.L., Ahn K.-H. Probability distribution and characterization of daily precipitation related to tropical cyclones over the Korean Peninsula. Water. 2020;12(4):1214. [Google Scholar]
- 94.Hong H., Ye W. Estimating extreme wind speed based on regional frequency analysis. Struct. Saf. 2014;47:67–77. [Google Scholar]
- 95.Zaharim A., et al. Fitting of statistical distributions to wind speed data in Malaysia. Eur. J. Sci. Res. 2009;26(1):6–12. [Google Scholar]
- 96.Chen L., et al. An objective method for partitioning the entire flood season into multiple sub-seasons. J. Hydrol. 2015;528:621–630. [Google Scholar]
- 97.Rahman A., Farrok O., Haque M.M. Environmental impact of renewable energy source based electrical power plants: solar, wind, hydroelectric, biomass, geothermal, tidal, ocean, and osmotic. Renew. Sustain. Energy Rev. 2022;161 [Google Scholar]
- 98.Danovaro R., et al. Making eco-sustainable floating offshore wind farms: siting, mitigations, and compensations. Renew. Sustain. Energy Rev. 2024;197 [Google Scholar]
- 99.Karim S., Faissal A. Handbook of Algal Biofuels. Elsevier; 2022. National renewable energy laboratory; pp. 599–613. [Google Scholar]
- 100.Elia A., et al. Wind turbine cost reduction: a detailed bottom-up analysis of innovation drivers. Energy Pol. 2020;147 [Google Scholar]
- 101.Worku M.Y. Recent advances in energy storage systems for renewable source grid integration: a comprehensive review. Sustainability. 2022;14(10):5985. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
All the authors of this manuscript have confirmed that the data supporting the study's findings are included in the article. The necessary data is both available and readily accessible.







