Abstract
We carry out a time series analysis on the yearly crop yield data in six east African countries (Burundi, Kenya, Somalia, Tanzania, Uganda and Rwanda) using the autoregressive integrated moving average (ARIMA) model. We describe the upper tail of the yearly crop yield data in those countries using the power law, lognormal, Fréchet and stretched exponential distributions. The forecast of the fitted ARIMA models suggests that the majority of the crops in different countries will experience neither an increase nor a decrease in yield from 2019 to 2028. A few exceptional cases correspond to significant increase in the yield of sorghum and coffee in Burundi and Rwanda, respectively, and significant decrease in the yield of beans in Burundi, Kenya and Rwanda. Based on Vuong’s similarity test p–value, we find that the power law distribution captured the upper tails of yield distribution better than other distributions with just one exceptional case in Uganda, suggesting that these crops have the tendency for producing high yield. We find that only sugar cane in Somalia and sweet potato in Tanzania have the potential of producing extremely high yield. We describe the yield behaviour of these two crops as black swan, where the “rich getting richer” or the “preferential attachment” could be the underlying generating process. Other crops in Burundi, Kenya, Somalia, Tanzania, Uganda and Rwanda can only produce high but not extremely high yields. Various climate adaptation/smart strategies (use of short-duration pigeon pea varieties, use of cassava mosaic disease resistant cassava varieties, use of improved maize varieties, intensive manuring with a combination of green and poultry manure, early planting, etc) that could be adapted to increase yields in east Africa are suggested. The paper could be useful for future agricultural planning and rates calibration in crop risk insurance.
1 Introduction
Africa is the poorest continent. It is struggling to feed its people. Hence, enhancement of crop production is important.
Furthermore, farmers are more interested in investing in crops that are capable of producing high yields not crops that can produce extremely low yield. They want to maximize the profit on their investment. Crops that have the potential for high yield are likely to attract low premium in crop yield insurance.
There have been several papers on high crop yield in African countries. While discussing nutrients in the west African Sudano-Sahelian zone, [1] noted that “shrubs and trees with their alternating periods of nutrient storing and recycling in leaves and wood, micro-depressions, termite mounts and ant nests become localised points of nutrient concentration and high crop productivity”. While investigating the importance of liming acid soils, [2] demonstrated that “severely acidified soils of the western highlands of Cameroon should be limed at moderate rates to sustain crop productivity”. While examining the seed supply system for maize production in southwestern Nigeria, [3] observed that “about 39% of farmers used improved varieties for high crop yields, 24% for disease resistance and 22% for market preferences, whereas local varieties were cultivated by 37% of farmers because of market preferences and availability, 16% because of low cost and 12% because of disease resistance”. [4] demonstrated that continuous-flow drip irrigation in Bauchi state of Nigeria delivers “high crop yields especially if the crops are grown under appropriate agronomic practices that enable protraction of the growth season”. [5] demonstrated that high maize yields on sandy soils in Zimbabwe can be achieved by using mineral fertilizers. According to [6], among many oilseed crops (for example, sunflower, soybeans, rapeseed/mustard, sesame, groundnuts, etc) grown in Kenya, oilseed rape is preferred because of its high yields (1.5 tons—4.0 tons / hectare) with high oil content of 42–46%. While comparing three fertigation strategies of grapes in the Berg River Valley region of South Africa, [7] found that “less berry crack contributed to a higher yield and higher export percentage of grapes”. While analysing the benefits of soil conservation in the Kondoa eroded area of Tanzania by conducting a household survey of 240 households, [8] observed that 56% of the respondents gained high crop yields. [9] investigated limited nitrogen content, a major challenge to sustainable and high crop production, for agricultural soils of lower eastern Kenya. While evaluating small holder farmers’ preferences for climate smart agricultural practices in Tehuledere district, northeastern Ethiopia, [10] found that “high and moderate climate resilience and high crop yield agricultural practices had a positive utility”. [11] demonstrated that phosphorus treatment for rice fields in lowlands in the central highlands of Madagascar significantly and consistently accelerated initial production with high crop growth rate and shortened days to heading. According to [12], “rain fed agriculture has a high crop yield potential if rainfall and soil nutrient input resources are utilized effectively”.
But none of these papers discuss the distribution of crop yield or forecasts. The distributions of crop yields is very useful in agribusiness. These distributions can help to tackle food shortages and insecurity by understanding how natural resources and farmers attitude towards crops selection and cultivation can control agricultural productivity, in agricultural policy assessment and to calibrate rates and premiums in crop insurance. Similarly, understanding the trend of crop yield and the insights gained from crop yield predictions can go a long way in helping to address the current global issue of increase in food prices and demand as well as to understand the associated risk of food production by helping farmers to make informed decisions especially on what and where to grow.
We are also not aware of any previous research that has focused on predicting crop yield in east Africa let alone doing so in such an almost holistic manner as we have done in this paper; so, to bridge this research gap, we follow [13] to provide some crop yield forecast in some east African countries. We believe that the results herein will be of extreme importance to east African regional farmers.
The aim of this paper is two folded. First, to forecast the crop yield and secondly to identify cash crops that are capable of producing extremely high yield in some east African countries by modelling the tail region of crop yield data. The remainder of this paper contains data in Section 2, methods in Section 3, results and discussion in Section 4 and conclusions in Section 5.
We use two methods for analyzing the data: time series analysis and fit of heavy tailed distributions. Time series analysis and forecasting is a branch of statistics. Time series forecasting uses models to predict future outcomes based on past observations. With time series visualizations, trends and seasonal patterns could be identified. We could then seek to gain deeper insight as regards to the reason behind these trends. Several time series models have been developed, studied and widely applied in many fields. Box-Jenkins’ auto-regressive integrated moving average (ARIMA) model [14] arguably stands out among others as the most widely used perhaps due to its simplistic application appeal and high precision in modelling. For instance, [15] used the ARIMA model to forecast rice production, consumption, importation, exportation and self-sufficiency in the Benin Republic. [16] used the ARIMA model to forecast the consumption of some livestock products such as eggs, milk, chicken and cow meat to see if the forecast of consumption was on the increase. [17] highlighted that the past century has witnessed significant rise and fall of cocoa production in Nigeria due to diverse institutional and climate changes. They used the ARIMA model to predict cocoa production in Nigeria between 2018 and 2025. Their forecast showed a decreasing trend where cocoa production is expected to fall by more than 20% in 2025 against the 2017 value. [18] used the ARIMA model to forecast maize production in India from 2018 to 2022. The model predicted about 13.76% increase in maize production in India. [19] used the ARIMA model to forecast soybean yield in Zambia. The forecast suggested 23430.3 hectogram / hectare yield increase in 2020 compared to the 2016 figure of 19624 hectogram / hectare. [20] used the ARIMA model to forecast Kharif rice production in West Bengal, India which contributes about 15% of the total paddy in India. [21] used the ARIMA model to forecast sorghum production in South Africa from 2017 to 2020. Their forecast depicted an increasing trend. [22] used the ARIMA model to forecast sugar cane production in Pakistan from 2019 to 2030. Their forecast indicated a significant increase.
Quantifying the tail of the crop yield distribution is vital for managing agricultural production risk and rating crop insurance [23]. The simplest and the most widely used distribution for modelling rare outcomes occurring in the tail region is the power law distribution. Many processes follow the power law over large magnitude of values. Recent examples are the distribution of stock returns [24], income [25, 26], wealth of world billionaires [27], persisters-antibiotic-tolerant cells [28], duration size of unhealthy air pollution events [29], tourism recommendations [30], cumulative coal production [31], agricultural land size [32], rates of wetland loss [18], union size [33], strike size [34] and growth rate of CO2 [35]. Popular alternatives to the power law distribution are the lognormal, stretched exponential, and Fréchet distributions.
2 Data
Yearly data from 1961 to 2018 on the yield of cash crops like banana, plantain, beans, cassava, coffee, sorghum, potato, sweet potato, maize, rice, sugar cane, wheat, millet and cotton seed from six countries in east Africa (namely, Burundi, Kenya, Somalia, Tanzania, Uganda and Rwanda) were obtained from Food and Agriculture Organization of United Nations-FAO, see http://www.fao.org/faostat/en/#home. The data obtained were yields aggregated at national levels.
The time plots of the crops in different countries are shown in Figs 1 and 2. Some sudden changes, particularly big drops and falls could be seen at different times indicating periods of high and low yields. These changes could be as a result of the global economic outlook, environmental/climate changes or even changes in farming practices.
Fig 1. Time series plots for crop yield in different countries.
Fig 2. Time series plots for crop yield in different countries.
Some descriptive statistics of the data for crops are presented in Table 1. The statistics include the mean, median, standard deviation, minimum, maximum, skewness and kurtosis. The discrepancy between the mean and the median values appears not to be large for almost all the crops across the countries. The mean is larger than the standard deviation for all the crops across the countries. This suggests that the data are underdispersed. Note that underdispersion could be as a result of serial correlation which is typical of time series data. We can remove serial correlation by random variable transformation. But, this may lead to (a) loss of data information and (b) limits us to specific class of models to use. The data exhibit varying degrees of skewness and kurtosis across crops and countries. The lowest (highest) positive skewness of 0.0261 (3.1150) corresponds to maize (sweet potato) in Kenya (Burundi). The lowest (highest) negative skewness of -0.0934 (-2.0220) corresponds to rice (cassava) in Kenya (Burundi). The lowest (highest) positive kurtosis of 0.0410 (13.0916) corresponds to sorghum (banana) in Rwanda (Burundi). The lowest (highest) negative kurtosis of -0.0043 (-1.0478) corresponds to coffee (maize) in Uganda (Kenya). Crop yield skewness has been used to characterize crop yield tendencies. [36] reported that crop yield is positively skewed in the presence of independent, identical and uniform resource availability distribution. Crop yield is negatively skewed whenever the distributions are Gaussian, i.e. skewness depends on asymmetries in resource availabilities, meaning that a negatively skewed yield occurs whenever production is tightly controlled so that the left tails of some resources availabilities distributions are thin [36]. However, in addition to the observable similarities between the mean and the median crop yield values, we notice that for majority of the cases, the skewness and kurtosis values are close to zero, suggesting possible symmetry and mesokurtosis.
Table 1. Descriptive measures for the crop yield data sets.
Country | Crop | Min. | 1st qu. | Median | Mean | 3rd qu. | Max. | Std. dev. | Skewness | Kurtosis |
---|---|---|---|---|---|---|---|---|---|---|
Burundi | Banana | 46915 | 54433 | 56947 | 60700 | 62984 | 127352 | 12204.970 | 3.0704 | 13.0916 |
Beans | 6044 | 9067 | 10036 | 9599 | 10380 | 13184 | 1375.755 | -0.8491 | 0.9814 | |
Cassava | 41867 | 85880 | 89890 | 85078 | 90894 | 112378 | 12870.850 | -2.0220 | 4.2142 | |
Coffee | 2687 | 6961 | 8251 | 8100 | 9379 | 11598 | 1769.862 | -0.5861 | 0.1126 | |
Sorghum | 5890 | 9837 | 10000 | 10426 | 11881 | 14042 | 1886.384 | -0.3327 | -0.1633 | |
Sweet potato | 59048 | 63302 | 63837 | 68215 | 65848 | 133015 | 13063.090 | 3.1150 | 10.0345 | |
Kenya | Beans | 3127 | 4783 | 5556 | 5413 | 6122 | 8382 | 1034.896 | 0.0585 | 0.3111 |
Coffee | 9212 | 14918 | 16839 | 18293 | 21078 | 31991 | 5003.894 | 0.7453 | 0.2387 | |
Maize | 10713 | 12957 | 15813 | 15507 | 17266 | 20712 | 2605.723 | 0.0261 | -1.0478 | |
Rice | 13076 | 34802 | 39621 | 39988 | 46510 | 61813 | 10166.130 | -0.0934 | -0.0593 | |
Sugar Cane | 297552 | 689809 | 808548 | 774305 | 885805 | 1211845 | 217545.200 | -0.4537 | -0.0124 | |
Wheat | 9212 | 14918 | 16839 | 18293 | 21078 | 31991 | 5003.894 | 0.7453 | 0.2387 | |
Somalia | Banana | 88430 | 169641 | 170374 | 185025 | 198616 | 317500 | 46006.370 | 0.7969 | 0.7965 |
Maize | 4149 | 8173 | 9758 | 10065 | 11765 | 17901 | 3029.413 | 0.5619 | -0.1600 | |
Sorghum | 2040 | 3320 | 3522 | 3992 | 4280 | 9824 | 1306.699 | 2.2982 | 6.9370 | |
Sugar Cane | 272727 | 350000 | 407143 | 604201 | 887500 | 1455975 | 346839.200 | 1.0554 | -0.2674 | |
Tanzania | Maize | 4808 | 9170 | 12722 | 12734 | 14414 | 31359 | 4893.517 | 1.3077 | 2.9604 |
Millet | 4522 | 7010 | 8308 | 8697 | 9982 | 19507 | 2401.514 | 1.5969 | 5.1457 | |
Rice | 7143 | 12826 | 16286 | 16396 | 19172 | 27382 | 4933.184 | 0.2870 | -0.7743 | |
Cotton Seed | 2328 | 4356 | 5136 | 5117 | 5783 | 7936 | 1245.132 | 0.2257 | -0.4752 | |
Sorghum | 4423 | 6554 | 9151 | 8832 | 10100 | 17963 | 2717.268 | 0.4293 | 0.5903 | |
Sweet potato | 10448 | 18029 | 29252 | 34621 | 49412 | 72759 | 19284.670 | 0.5541 | -1.0485 | |
Uganda | Banana | 23298 | 39412 | 42070 | 40891 | 44927 | 48333 | 5636.359 | -1.3614 | 1.613 |
Cassava | 32973 | 44771 | 66988 | 71711 | 89971 | 144083 | 32354.340 | 0.7664 | -0.5467 | |
Coffee | 3839 | 5443 | 6131 | 6402 | 7150 | 10283 | 1408.237 | 0.6347 | -0.0043 | |
Millet | 8092 | 11486 | 14017 | 13341 | 15986 | 16751 | 2637.066 | -0.6120 | -0.945 | |
Plantain | 42971 | 52141 | 56585 | 59014 | 60867 | 84235 | 11223.650 | 0.8573 | -0.0373 | |
Sweet potato | 24009 | 35504 | 41558 | 40660 | 44017 | 62075 | 6398.938 | 0.3221 | 1.5048 | |
Rwanda | Beans | 5606 | 6980 | 8020 | 7858 | 8522 | 10258 | 1037.677 | -0.0566 | -0.8005 |
Cassava | 11778 | 55212 | 91644 | 82820 | 116873 | 164000 | 37331.23 | -0.2683 | -1.0465 | |
Coffee | 2678 | 5267 | 5994 | 6051 | 6776 | 11019 | 1473.642 | 0.6369 | 1.6200 | |
Potato | 22821 | 64313 | 68656 | 71994 | 82871 | 130600 | 20435.600 | 0.2417 | 0.7926 | |
Sorghum | 6850 | 10014 | 11000 | 11109 | 12086 | 15084 | 1754.313 | 0.2655 | 0.0410 | |
Sweet potato | 34388 | 53682 | 62550 | 63978 | 75039 | 96163 | 13269.530 | 0.0977 | -0.4860 |
Figs 3 and 4 show boxplots to support the descriptive statistics in Table 1 and to compare the yield performance of some of the crops that are produced in more than one east African country. We see that Somalia recorded the highest banana and sugar cane yields. Burundi recorded the highest beans, coffee and sweet potato yields. Rwanda recorded the highest cassava yield. Kenya recorded the highest rice yield. Tanzania recorded the highest sorghum, maize and millet yields. Also, evident enough in Figs 3 and 4 are the presence of extreme (high and low) yields for some of the crops which are indicated by observations lying outside of the whiskers in the box plots. The power law distribution discussed later is especially useful for modelling unusually high yields.
Fig 3. Box plots for crop yield in different countries.
Fig 4. Box plots for crop yield in different countries.
We tested heavytailedness of the each data set using [37]’s test based on Kolmogorov-Smirnov statistic corrected for correlation [38]. The p–values of this test for banana, beans, cassava, coffee, sorghum and sweet potato in Burundi were 0.182, 0.0664, 0.151, 0.102, 0.156 and 0.115, respectively. The p–values for the crops in Kenya were 0.162, 0.171, 0.059, 0.120, 0.166 and 0.145. The p–values for the crops in Somalia were 0.167, 0.157, 0.098 and 0.115. The p–values for the crops in Tanzania were 0.168, 0.112, 0.095, 0.068, 0.096 and 0.125. The p–values for the crops in Uganda were 0.177, 0.114, 0.087, 0.171, 0.105 and 0.077. The p–values for the crops in Rwanda were 0.075, 0.169, 0.068, 0.098, 0.061 and 0.158. The p–values reported show that there is no significant evidence against the fact that each data has a heavy tail. Hence, unusually high yields can be modeled by heavy tailed distributions as done in Section 4.
3 Methods
3.1 Time series analysis of crop yields
One possible technique for time series analysis is to assume that the overall mean is either constantly increasing or constantly decreasing with respect to time. In this case, the fit of a sloping line might be appropriate for the time series. This type of line is typically referred to as a linear trend model or a trend-line model and it is a special case of a simple linear regression model with time index t as the only predictor variable, i.e. t = 1, 2, 3, …. The estimated trend line is the line that minimizes the sum of the squared vertical deviations from the data. Trend lines serve as important visual aids. However, they often perform poorly in forecasting beyond the historical data. In practice, majority of the time series data that arise in different areas cannot be described by some straight lines because their trends often undergo evolution. Given the past observations, the trend-line model attempts to find the intercept and slope that give the best average fit to the data. Unfortunately, the deviation of the linear trend model from the data is usually greatest at the end of the time series where the forecasting starts. Therefore, in time series analysis and forecasting, the important question ‘what is the appropriate model?’ can first be addressed by visually inspecting the time series data for any constantly changing trend or randomly changing trend. Based on Figs 1 and 2, we see that assuming a steady upward or downward linear trend for any of the crop yield data is apparently illogical and out of place because a randomly changing trend is overwhelmingly evident for all the time series data. To model the nonlinear trend in all the time series, we may need to regress the time series on second or higher order terms of t and this may require some trial and errors which may possibly lead to some overestimated or underestimated models. To circumvent the issue of model selection, we consider the most reliable models for nonlinear trends in time series and they are referred to as stochastic time-series models. Examples of such models are the one proposed by [14] which involve straightforward laid down iterative procedures for model fitting unlike the nonlinear regression method mentioned earlier.
In this section, we carry out a time series analysis to study the yield pattern of crops over a specified period of time. We need to isolate first the impact of trends (the overall pattern in the series) and second the impact of random disturbances (the vigorous wiggles in the series). The impact of trends could be due to planting strategies and techniques, advanced mechanized farming, farm management, irrigation, the use of fertilizers and genetically improved seedlings/crops. The impact of random disturbances could be due to pandemics, crop disease outbreaks, wars, recessions, environmental degradations (for example, erosion) and extreme weather conditions such as droughts and floods.
Let xt denote the observed yield of a crop at time t. Suppose we denote all the observed information up to time t by . We are interested in forecasting xt. We can specify the forecast as or more specifically as . The forecast of xt+h given all previous observations up to time t (x1, x2, …, xt) is known as the h–step forecast. The h–step forecasting method can be easily implemented through the famous Box-Jenkins autoregressive integrated moving average (ARIMA) modelling framework. ARIMA models are used for trend analysis and forecasting. The ARIMA (p, d, q) model is defined by
where ϕ’s are the autoregressive (AR) parts of the model, θ’s are the moving average (MA) parts of the model, d is the order of difference, B is known as the backshift operator, c is a constant which is equal to μ(1 − ϕ1 − ⋯ − ϕp), μ is the mean of the dth differenced series (1 − B)dxt and ξt is white noise. ξt are generally assumed to be independent, identically distributed variables sampled from a normal distribution with zero mean. In ARIMA modelling, we make the following assumptions about the time series: there are no seasonality or cyclical trends, there are no outliers, and that the variation about the mean is consistent. After fitting the ARIMA model, we can check the model adequacy viz-a-viz a popular portmanteau test called Ljung-Box test by simply testing whether the residuals from the fitted model are white noise. For Ljung–Box test, we test the hypothesis H0: ρk = 0 versus H1: ρk ≠ 0. The test statistic of Ljung-Box test is
where n is the sample size, is the sample autocorrelation at lag k, and h is the number of lags being tested. Under H0, the statistic Q⋆ is asymptotically chi-square distributed with h degrees of freedom. At α significance level, the critical region for rejecting the hypothesis of randomness is , where denotes the (1 − α)th quantile of the chi-squared distribution with h degrees of freedom.
A detailed discussion of Box-Jenkins ARIMA (p, d, q) model could be read from [39] and [40]. In Figs 1 and 2, we find some evidence of changing variance in some of the series. Each series appears clearly non-stationary as the series wanders up and down. Before proceeding with the data analysis, we ensured that the variance for each series is stabilized by the Box-Cox transformation [41].
The Box Cox transformation involves an exponent, λ ∈ [−5, 5]. In this paper, all values of λ are considered but the optimal value for each data is applied. The optimal value of λ is the one that gives the best approximation of the Gaussian distribution. The transformation of xt has the form:
(1) |
The formula in (1) is not as simple as it appears because testing for all possible values one by one is unnecessarily time consuming. However, most software packages include an option for a Box-Cox transformation. In this paper, we used the function in the package in the ( Core Team, 2022) software to fit the ARIMA (p, d, q) models. Setting the argument to allows a transformation to be automatically selected and implemented using the Box-Cox method. The routinely transformed data are then coerced into stationarity by implementing first or second order differences whenever there is any need to do so before estimating the appropriate model.
Each coerced series was tested for stationarity using [42]’s test. The null hypothesis was that the series is stationary. The p–values for banana, beans, cassava, coffee, sorghum and sweet potato in Burundi were 0.085, 0.085, 0.089, 0.095, 0.075 add 0.083, respectively. The p–values for the crops in Kenya were 0.057, 0.081, 0.051, 0.069, 0.086 and 0.056. The p–values for the crops in Somalia were 0.098, 0.078, 0.089 and 0.083. The p–values for the crops in Tanzania were 0.067, 0.078, 0.082, 0.083, 0.081 and 0.052. The p–values for the crops in Uganda were 0.094, 0.086, 0.095, 0.051, 0.092 and 0.090. The p–values for the crops in Rwanda were 0.098, 0.099, 0.053, 0.073, 0.090 and 0.080.
3.2 Analysis of the maximum crop yields
Suppose we denote the crop yield random variable by X with realizations xi, i = 1, 2, …, n, where n represents the number of observations. For the convenience of fitting distributions to the available data, we assume that the xi are random. The assumption of independence is not technically correct as the data are actually serially correlated. But ignoring dependence in a data set and treating the data as being independent has no effect on parameter estimates, it only affects standard errors (see, for example, [43]). Hence, the results presented later on the fit of heavy tailed distributions are correct as accuracy of estimation is not taken into account.
The probability density functions (PDFs) of the fitted heavy tailed distributions are
- The power law distribution also known as Pareto distribution of type I [44] specified by the PDF
for x ≥ xmin > 0, where xmin is the lower bound and α > 0 is the exponent. At or above xmin, the distribution exhibits properties of a power law distribution. - The lognormal distribution specified by the PDF
for x > 0, where −∞ < a < ∞ and b > 0 are the location and scale parameters, respectively. - The stretched exponential distribution specified by the PDF
for x > 0, where a > 0 is the scale parameter and b > 0 is the shape parameter. - Fréchet distribution [45] specified by the PDF
for x > 0, where a > 0 is the scale parameter and b > 0 is the shape parameter.
We estimated the parameters of all the distributions by the method of maximum likelihood through the optim routine in R [46]. We estimated xmin in the power law distribution by following the method in [47]. That is, we chose xmin that minimized
where Fn(x) and denote, respectively, the empirical and fitted power law distribution functions for x ≥ xmin.
We have used the method of maximum likelihood because of its popularity. There are other methods for estimation; in particular, for estimating α of the power law distribution. Some of these estimators include the rank estimator due to [48], [49]’s estimator and the median estimator due to [50].
Note that each of the four distributions has two free parameters. So, no one distribution is more flexible than the others in terms of the number of parameters. Unlike the power law distribution, the lognormal, Fréchet and stretched exponential distributions model the entire data. We can compare their fits by the following goodness-of-fit measures:
- Bayes information criterion (BIC) due to [51] defined by
- Akaike information criterion with a correction (AICc) due to [52] defined by
where and k denote, respectively, the maximized log likelihood value and the number of unknown parameters.
We can also compare all of the fitted distributions through the Kolmogorov–Smirnov test. Its statistic is given by
which was corrected as in [38] to account for correlation in the data. The larger the value of the corresponding KS p–value the better the fitted distribution. We require the p–value of the Kolmogorov–Smirnov test to be greater than 0.05 to conclude that the distribution is a reasonable model for the data. A p–value less than 0.05 suggests an absolute rejection of the distribution as a candidate for the data. However, one major drawback of the Kolmogorov–Smirnov p–value is that it depends on fixed parameters, hence it does not reflect sampling variability. We can calculate more conservative p–values by a bootstrapping method in [47]. We implemented this method by using 5000 bootstrap replications to obtain the final p–value for the Kolmogorov–Smirnov test. In this paper, we shall use the non-bootstrapped KS p–value to verify the plausibility of each distribution as a candidate model for data. We use the bootstrapped KS p–value to discriminate among competing distributions and to generalize our findings.
Vuong test [53] can be used to discriminate between two non-nested models by testing the null hypothesis that the models provide indistinguishable fits for the same data. Suppose we denote the probabilities for models 1 and 2 by and , respectively, where and denote the parameter estimates for models 1 and 2, respectively. Let . The test statistic for Voung’s test is , where and sd denote the mean and standard deviation of d, respectively. A large, positive test statistic value provides evidence that model 1 is superior to model 2. A large, negative test statistic value gives evidence that model 2 is superior to model 1. Under the null hypothesis that the models are inseparable, the test statistic Λ is asymptotically standard normal distributed. Two finite sample corrections of Vuong’s test are sometimes considered based on the AIC and BIC penalty terms, depending on the complexity of the two models. However, these corrections sometimes generate conflicting conclusions.
4 Results and discussion
Ljung–Box p–values in Table 2 are > 0.05 suggesting that the residuals of the fitted ARIMA models are not statistically significant from white noise at 0.05 significance level for all the crops except for plantain in Uganda which is not statistically significant from white noise at 0.01 significance level. All of the fitted models are suitable for prediction based on the residual analysis. From the 10 years (2019–2028) point forecast (solid blue lines) of the fitted ARIMA models in Figs 5 to 10, we observe the following for Burundi: an initial sharp drop in 2019 followed by an upward swing of yield for banana; a sharp increase in 2019 followed by increasing oscillations of yield for sweet potato; the yield for sorghum shows a quick increase from 2019 to 2028; the yield for beans shows an immediate decline from 2019 to 2028; neither cassava nor coffee indicate any increasing or decreasing pattern from 2019 to 2028. In Kenya, we observe the following: the yield for beans shows a continuous decline from 2019 to 2028; neither upward nor downward yield trend is evident for coffee, rice, wheat and sugar cane from 2019 to 2028; the yield of maize shows a sharp drop in 2019 followed by an increase and then a stable trend. In Somalia, we observe the following: the yield for maize or sugar cane does not indicate any pattern; the yield for banana shows an initial moderate increase in 2019 followed by a period of no trend up to 2028; the yield for sorghum first experienced a sharp drop in 2019 followed by a stable period of no trend up to 2028. In Tanzania, we observe the following: no significant trend could be identified for maize, rice, sweet potato and cotton seed for the entire forecast period; millet is characterized by a slight yield decrease in 2019 followed by a period of no significant trend up to 2028. In Uganda, we observe the following: the forecast for banana, cassava, millet, plantain and sweet potato did not show any significant trend from 2019 to 2028; the yield for coffee shows a slight increase in 2019 followed by a period of neither increase nor decrease. In Rwanda, we observe the following: the yield for beans shows a persistent decline from 2019 to 2028; the yield for sweet potato shows initial jump followed by a slow decline; coffee indicated an upward trend tendency from 2019 to 2028; cassava, potato and sorghum did not indicate any significant trend.
Table 2. Ljung–Box test statistic (Q⋆), its degree of freedom and its p–value for the fitted ARIMA models at lag 10 (i.e. h = 10).
Country | Crop | Fitted ARIMA (p, d, q) a | Q ⋆ | df | p–value |
---|---|---|---|---|---|
Burundi | Banana | ARIMA(2,1,2) | 1.5746 | 6 | 0.9544 |
Beans | ARIMA(0,1,1) with drift | 5.9108 | 8 | 0.6572 | |
Cassava | ARIMA(0,1,0) | 12.0820 | 10 | 0.2796 | |
Coffee | ARIMA(0,1,1) | 8.3159 | 9 | 0.5027 | |
Sorghum | ARIMA(1,0,0) with non-zero mean | 8.9597 | 8 | 0.3457 | |
Sweet potato | ARIMA(2,1,1) | 7.8161 | 7 | 0.3491 | |
Kenya | Beans | ARIMA(1,0,0) | 12.0780 | 8 | 0.1478 |
Coffee | ARIMA(0,1,1) | 13.2300 | 9 | 0.1525 | |
Maize | ARIMA(0,1,3) | 3.4473 | 7 | 0.8408 | |
Rice | ARIMA(0,1,0) | 10.4230 | 10 | 0.4042 | |
Sugar Cane | ARIMA(0,1,0) | 17.9800 | 10 | 0.0553 | |
Wheat | ARIMA(0,1,1) | 13.9030 | 9 | 0.1258 | |
Somalia | Banana | ARIMA(0,0,4) with non-zero mean | 4.1658 | 5 | 0.5258 |
Maize | ARIMA(0,1,0) | 15.1300 | 10 | 0.1274 | |
Sorghum | ARIMA(1,1,1) | 3.8376 | 8 | 0.8715 | |
Sugar Cane | ARIMA(0,1,0) | 13.8200 | 10 | 0.1814 | |
Tanzania | Maize | ARIMA(1,1,1) | 11.8720 | 8 | 0.1570 |
Millet | ARIMA(1,0,0) with non-zero mean | 6.2974 | 8 | 0.6140 | |
Rice | ARIMA(0,1,1) | 7.7560 | 9 | 0.5589 | |
Cotton Seed | ARIMA(0,1,1) | 18.8250 | 9 | 0.0267 | |
Sorghum | ARIMA(2,1,0) | 4.2521 | 8 | 0.8337 | |
Sweet potato | ARIMA(0,1,1) | 12.8150 | 9 | 0.1711 | |
Uganda | Banana | ARIMA(0,1,2) | 3.8909 | 8 | 0.8668 |
Cassava | ARIMA(0,1,0) | 12.1600 | 10 | 0.2745 | |
Coffee | ARIMA(1,0,0) with non-zero mean | 15.3660 | 8 | 0.0524 | |
Millet | ARIMA(0,1,1) | 6.1006 | 9 | 0.7298 | |
Plantain | ARIMA(0,1,0) | 19.0490 | 10 | 0.0397 | |
Sweet potato | ARIMA(0,1,0) | 17.7340 | 10 | 0.0596 | |
Rwanda | Beans | ARIMA(1,0,0) with non-zero mean | 7.9098 | 8 | 0.4423 |
Cassava | ARIMA(1,1,0) | 2.3680 | 9 | 0.9842 | |
Coffee | ARIMA(1,0,0) with non-zero mean | 3.5770 | 8 | 0.8931 | |
Potato | ARIMA(0,1,2) | 4.4913 | 8 | 0.8103 | |
Sorghum | ARIMA(1,0,0) with non-zero mean | 4.4304 | 8 | 0.8164 | |
Sweet potato | ARIMA(2,0,2) with non-zero mean | 2.4531 | 5 | 0.7835 |
aDue to space constraints, we omit the coefficients of the fitted ARIMA models; interested readers can obtain them from the authors upon reasonable request.
Fig 5. Time series plots and 10 years yield forecast with fitted ARIMA models showing 80% and 95% confidence bands for crops in Burundi.
Fig 10. Time series plots along with 10 years yield forecast for the fitted ARIMA models showing 80% and 95% prediction confidence bands for crops in Rwanda.
Fig 6. Time series plots along with 10 years yield forecast for the fitted ARIMA models showing 80% and 95% prediction confidence bands for crops in Kenya.
Fig 7. Time series plots along with 10 years yield forecast for the fitted ARIMA models showing 80% and 95% prediction confidence bands for crops in Somalia.
Fig 8. Time series plots along with 10 years yield forecast for the fitted ARIMA models showing 80% and 95% prediction confidence bands for crops in Tanzania.
Fig 9. Time series plots along with 10 years yield forecast for the fitted ARIMA models showing 80% and 95% prediction confidence bands for crops in Uganda.
The changes observed in Figs 5 to 10 are consistent with findings in the literature. [54] established that both intra- and interseasonal changes in temperature and precipitation influence cereal yields in Tanzania. [55] reported that climate change will reduce mean yields in Africa by 17% for wheat, 5% for maize, 15% for sorghum and 10% for millet. No mean change in yield for rice was detected. Using data from the northern Tanzanian highlands, [56] demonstrated that increasing night time temperature is the most significant climatic variable responsible for diminishing coffea arabica yields between 1961 and 2012. According to [57], annual food crops in the Kilimanjaro region of Tanzania were particularly sensitive to the drought and maize and beans yields were lower than perennial crops during the years of drought. Through a simulation study, [58] predicted climate change in east Africa and found its negative impact on crop production in that region. They projected that the crop output decrease will lie between 1.2% and 4.5%. [59] identified soil erosion by water as one of the major causes of land degradation and dwindling agricultural produce in Africa resulting in an estimated yearly crop yield loss of about 280 million tons. [60] provided evidence to suggest that climate change severely impacted rice production in Rwanda. [61] produced evidence to suggest that temperature increases lead to decline in maize and cassava crops for Tanzania, Malawi, Zambia and South Africa. [62] observed that the yields for maize, sorghum or millet fluctuated at a decreasing trend in the Kongwa district of Tanzania. According to [63], increased temperatures in Kenya due to climate change have a general tendency to reduce rice yields. [64] showed that the impacts of projected changes in climate on maize production areas are the reduction in the suitability of the crop, especially around central and western Tanzania, mid-northern and western Uganda, and parts of western Kenya by 20–40%, and patches of east Africa will experience a reduction as high as 40–60%, especially in northern Uganda, and western Kenya. According to [65], maize production in southern highlands of Tanzania has decreased during the past two decades, since the year 2000. According to [66], climate change has induced a devastating effect on agricultural production in Somalia leading to crop yield to decline including sorghum.
Tables 3 and 4 give the BIC, AICc and the KS p–values of the fitted distributions. The BIC and AICc values for the power law distribution are smaller than those for the remaining distributions. The KS p–value > 0.05 in all the cases except for Millet in Uganda indicating that the power law is not a plausible distribution in this case. We cannot compare the values of the goodness-of-fit measures of the power law distribution with those of the other distributions because the power law distribution fits only the tails whereas the lognormal, Fréchet, and the stretched exponential distributions fit the entire data. Thus, we can only compare the BIC and AICc values of the lognormal, Fréchet, and the stretched exponential distributions. Based on the KS p–value, we can observe that the lognormal distribution could be a plausible distribution for banana and coffee in Burundi, all the crops in Kenya except for sugar cane, maize and sorghum in Somalia, all the crops in Tanzania, all but banana in Uganda and all but cassava in Rwanda. Fréchet distribution appears to be a plausible distribution for banana in Burundi, maize and wheat in Kenya, maize and sorghum in Somalia, all except for maize and sorghum in Tanzania and cassava, coffee and plantain in Uganda and all except for cassava and potato in Rwanda. The stretched exponential distribution appears to be a plausible distribution for beans and coffee in Burundi, all the crops in Kenya, maize in Somalia, all the crops in Tanzania, all except for plantain in Uganda and all the crops in Rwanda.
Table 3. AIC, BIC, AICc and KS p–values (as defined in Section 3) for the fitted distributions.
Country—Crop | Power law | Lognormal | Fréchet | Stretched exponential | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
BIC | AICc | p–value | BIC | AICc | p–value | BIC | AICc | p–value | BIC | AICc | p–value | |
Burundi—Banana | 723.717 | 719.814 | 0.954 | 1239.032 | 1235.129 | 0.051 | 1214.761 | 1210.859 | 0.629 | 1282.202 | 1278.299 | 0.000 |
Beans | 519.823 | 515.920 | 0.532 | 1019.932 | 1016.029 | 0.004 | 1048.091 | 1044.188 | 0.001 | 1005.110 | 1001.208 | 0.092 |
Cassava | 585.320 | 581.417 | 0.132 | 1293.669 | 1289.766 | 0.000 | 1337.395 | 1333.492 | 0.000 | 1250.241 | 1246.338 | 0.000 |
Coffee | 307.680 | 303.777 | 0.931 | 1054.181 | 1050.278 | 0.112 | 1091.141 | 1087.238 | 0.001 | 1036.031 | 1032.129 | 0.645 |
Sorghum | 78.290 | 74.387 | 0.930 | 1053.391 | 1049.489 | 0.001 | 1076.397 | 1072.495 | 0.000 | 1045.201 | 1041.298 | 0.000 |
Sweet potato | 134.892 | 130.989 | 0.901 | 1243.862 | 1239.959 | 0.000 | 1184.906 | 1181.003 | 0.000 | 1290.785 | 1286.882 | 0.000 |
Kenya—Beans | 493.325 | 489.422 | 0.504 | 980.215 | 976.313 | 0.223 | 999.376 | 995.474 | 0.010 | 979.803 | 975.900 | 0.432 |
Coffee | 308.382 | 304.479 | 0.721 | 1043.274 | 1039.371 | 0.109 | 1390.577 | 1386.674 | 0.000 | 1038.009 | 1034.106 | 0.323 |
Maize | 352.530 | 348.627 | 0.756 | 1084.795 | 1080.893 | 0.332 | 1093.394 | 1089.491 | 0.112 | 1085.083 | 1081.180 | 0.532 |
Rice | 785.164 | 781.262 | 0.303 | 1251.779 | 1247.876 | 0.121 | 1282.416 | 1278.513 | 0.012 | 1242.221 | 1238.318 | 0.231 |
Sugar Cane | 771.239 | 767.336 | 0.944 | 1613.395 | 1609.492 | 0.007 | 1639.809 | 1635.906 | 0.000 | 1595.751 | 1591.848 | 0.321 |
Wheat | 418.665 | 414.762 | 0.835 | 1153.186 | 1149.283 | 0.821 | 1160.712 | 1156.810 | 0.712 | 1162.411 | 1158.508 | 0.402 |
Somalia—Banana | 306.400 | 302.498 | 0.889 | 1412.801 | 1408.899 | 0.000 | 1430.688 | 1426.786 | 0.000 | 1421.742 | 1417.839 | 0.000 |
Maize | 577.876 | 573.973 | 0.811 | 1097.639 | 1093.737 | 0.943 | 1109.941 | 1106.038 | 0.423 | 1102.289 | 1098.386 | 0.434 |
Sorghum | 299.456 | 295.554 | 0.954 | 978.390 | 974.487 | 0.131 | 976.162 | 972.259 | 0.203 | 1006.621 | 1002.718 | 0.015 |
Sugar Cane | 1561.743 | 1557.841 | 0.252 | 1621.627 | 1617.724 | 0.009 | 1608.998 | 1605.095 | 0.019 | 1636.254 | 1632.351 | 0.003 |
Tanzania—Maize | 602.620 | 598.718 | 0.632 | 1146.010 | 1142.107 | 0.100 | 1157.454 | 1153.551 | 0.005 | 1155.097 | 1151.194 | 0.103 |
Millet | 357.070 | 353.168 | 0.922 | 1061.506 | 1057.604 | 0.966 | 1066.259 | 1062.356 | 0.533 | 1080.704 | 1076.801 | 0.121 |
Rice | 645.884 | 641.981 | 0.532 | 1157.416 | 1153.513 | 0.802 | 1169.649 | 1165.746 | 0.242 | 1157.795 | 1153.893 | 0.499 |
Cotton Seed | 480.611 | 476.708 | 0.332 | 999.445 | 995.543 | 0.854 | 1016.260 | 1012.358 | 0.166 | 999.811 | 995.908 | 0.664 |
Sorghum | 264.040 | 260.138 | 0.834 | 1088.711 | 1084.809 | 0.211 | 1100.409 | 1096.506 | 0.023 | 1089.489 | 1085.586 | 0.512 |
Sweet potato | 823.410 | 819.507 | 0.200 | 1303.455 | 1299.553 | 0.661 | 1307.969 | 1304.066 | 0.183 | 1305.698 | 1301.795 | 0.402 |
Uganda—Banana | 269.292 | 265.389 | 0.909 | 1187.916 | 1184.013 | 0.005 | 1224.341 | 1220.438 | 0.000 | 1158.594 | 1154.691 | 0.443 |
Cassava | 688.846 | 684.944 | 0.518 | 1361.801 | 1357.898 | 0.535 | 1361.721 | 1357.818 | 0.515 | 1369.707 | 1365.804 | 0.323 |
Coffee | 614.601 | 610.698 | 0.833 | 1007.740 | 1003.837 | 0.907 | 1013.23 | 1009.328 | 0.516 | 1017.461 | 1013.558 | 0.420 |
Millet | 700.174 | 696.271 | 0.003 | 1094.174 | 1090.271 | 0.074 | 1113.196 | 1109.294 | 0.008 | 1079.048 | 1075.145 | 0.187 |
Plantain | 884.580 | 880.678 | 0.667 | 1245.936 | 1242.033 | 0.138 | 1242.476 | 1238.573 | 0.384 | 1261.018 | 1257.115 | 0.004 |
Sweet potato | 618.508 | 614.605 | 0.654 | 1189.226 | 1185.324 | 0.087 | 1209.564 | 1205.662 | 0.012 | 1196.161 | 1192.258 | 0.065 |
Table 4. Continuation of Table 3.
Country—Crop | Power law | Lognormal | Fréchet | Stretched exponential | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
BIC | AICc | p–value | BIC | AICc | p–value | BIC | AICc | p–value | BIC | AICc | p–value | |
Rwanda—Beans | 171.782 | 167.879 | 0.935 | 978.580 | 974.677 | 0.524 | 989.914 | 986.011 | 0.165 | 978.688 | 974.785 | 0.642 |
Cassava | 152.243 | 148.340 | 0.909 | 1410.743 | 1406.840 | 0.019 | 1433.857 | 1429.954 | 0.004 | 1392.647 | 1388.744 | 0.176 |
Coffee | 578.925 | 575.022 | 0.903 | 1017.965 | 1014.062 | 0.332 | 1040.145 | 1036.243 | 0.054 | 1022.827 | 1018.925 | 0.105 |
Potato | 945.602 | 941.699 | 0.810 | 1332.154 | 1328.251 | 0.051 | 1363.027 | 1359.124 | 0.001 | 1324.835 | 1320.932 | 0.142 |
Sorghum | 617.552 | 613.650 | 0.732 | 1038.192 | 1034.289 | 0.543 | 1054.371 | 1050.468 | 0.223 | 1043.405 | 1039.502 | 0.091 |
Sweet potato | 278.219 | 274.317 | 0.983 | 1274.881 | 1270.979 | 0.675 | 1292.205 | 1288.303 | 0.263 | 1274.463 | 1270.560 | 0.243 |
Based on the AICc and BIC values in Tables 3 and 4, we can see that none of the three distributions that model the entire data (i.e. the lognormal, Fréchet and the stretched exponential distributions) consistently provide the best fit. None of them consistently gave the smallest AICc or smallest BIC values across the countries. The bootstrapped KS p–values in Table 5 indicate that the power law distribution is a plausible model for all the crop yield data. In general, the distribution with the smallest AICc and smallest BIC values corresponds to the distribution with the largest bootstrapped KS p–values. Fitting of such distributions to the tail of the data can be compared with that of the power law distribution by using Vuong’s test. The results of this comparison are presented in Table 6. We can observe that the stretched exponential distribution emerges as the best model for millet in Uganda and the power law distribution emerges as the best model for the rest of the crops except for a few cases where the winner is undecided. For instance, for sorghum in Burundi (power law and lognormal), sweet potato in Burundi (power law and Fréchet) and for banana in Somalia (power law and lognormal). The log-log plots of the fitted distributions superimposed with the empirical distributions are displayed in Figs 11 to 16. We can see that the power law distribution fits all the crop yield data well across the countries.
Table 5. Bootstrap KS-test p–values (as defined in Section 3) for the fitted distributions.
Country | Crop | Power law | Lognormal | Fréchet | Stretched exponential |
---|---|---|---|---|---|
Burundi | Banana | 0.966 | 0.732 | 0.925 | 0.094 |
Beans | 0.902 | 0.424 | 0.112 | 0.732 | |
Cassava | 0.832 | 0 | 0 | 0.021 | |
Coffee | 0.943 | 0.831 | 0.322 | 0.941 | |
Sorghum | 0.952 | 0.305 | 0.101 | 0.324 | |
Sweet potato | 0.903 | 0 | 0.221 | 0 | |
Kenya | Beans | 0.913 | 0.902 | 0.591 | 0.944 |
Coffee | 0.928 | 0.821 | 0 | 0.922 | |
Maize | 0.929 | 0.931 | 0.820 | 0.954 | |
Rice | 0.805 | 0.821 | 0.461 | 0.844 | |
Sugar Cane | 0.945 | 0.611 | 0.132 | 0.940 | |
Wheat | 0.905 | 0.962 | 0.901 | 0.922 | |
Somalia | Banana | 0.923 | 0.016 | 0.012 | 0 |
Maize | 0.933 | 0.942 | 0.912 | 0 | |
Sorghum | 0.955 | 0.813 | 0.828 | 0 | |
Sugar Cane | 0.903 | 0.523 | 0.631 | 0 | |
Tanzania | Maize | 0.956 | 0.787 | 0.511 | 0.723 |
Millet | 0.931 | 0.962 | 0.936 | 0.902 | |
Rice | 0.902 | 0.951 | 0.921 | 0.952 | |
Cotton Seed | 0.933 | 0.971 | 0.822 | 0.949 | |
Sorghum | 0.939 | 0.827 | 0.609 | 0.906 | |
Sweet potato | 0.804 | 0.965 | 0.919 | 0.915 | |
Uganda | Banana | 0.965 | 0.514 | 0.055 | 0.931 |
Cassava | 0.901 | 0.961 | 0.919 | 0.924 | |
Coffee | 0.955 | 0.980 | 0.911 | 0.923 | |
Millet | 0.201 | 0.612 | 0.423 | 0.919 | |
Plantain | 0.932 | 0.831 | 0.925 | 0.506 | |
Sweet potato | 0.909 | 0.810 | 0.424 | 0.734 | |
Rwanda | Beans | 0.958 | 0.951 | 0.901 | 0.911 |
Cassava | 0.987 | 0.612 | 0.301 | 0.812 | |
Coffee | 0.943 | 0.902 | 0.613 | 0.832 | |
Potato | 0.924 | 0.732 | 0.321 | 0.815 | |
Sorghum | 0.932 | 0.933 | 0.841 | 0.822 | |
Sweet potato | 0.949 | 0.910 | 0.906 | 0.931 |
Table 6. Vuong test statistic (Λ) and its p–value for comparing the upper tail (i.e. x > xmin) of the fitted power law distribution and the best among the rest of the competing distributions.
Country | Crop | Contest | Statistic (Λ) | p–value | Winner |
---|---|---|---|---|---|
Burundi | Banana | Power law vs Fréchet | 2.6671 | 0.0077 | Power law |
Beans | Power law vs Stretched exponential | 6.7366 | 0 | Power law | |
Cassava | Power law vs Stretched exponential | 10.4399 | 0 | Power law | |
Coffee | Power law vs Stretched exponential | 4.1021 | 0 | Power law | |
Sorghum | Power law vs Lognormal | 1.2454 | 0.2130 | Undecided | |
Sweet potato | Power law vs Fréchet | 0.7442 | 0.4568 | Undecided | |
Kenya | Beans | Power law vs Stretched exponential | 3.9727 | 1.0×10−4 | Power law |
Coffee | Power law vs Stretched exponential | 3.5192 | 4.0×10−4 | Power law | |
Maize | Power law vs Stretched exponential | 2.8513 | 0.0044 | Power law | |
Rice | Power law vs Stretched exponential | 4.4303 | 0 | Power law | |
Sugar Cane | Power law vs Stretched exponential | 4.3048 | 0 | Power law | |
Wheat | Power law vs Lognormal | 2.5905 | 0.0096 | Power law | |
Somalia | Banana | Power law vs Lognormal | 1.7339 | 0.0829 | Undecided |
Maize | Power law vs Lognormal | 2.6642 | 0.0077 | Power law | |
Sorghum | Power law vs Fréchet | 1.9648 | 0.0494 | Power law | |
Sugar Cane | Power law vs Fréchet | 4.5875 | 0 | Power law | |
Tanzania | Maize | Power law vs Lognormal | 5.4006 | 0 | Power law |
Millet | Power law vs Lognormal | 3.8995 | 1.0×10−4 | Power law | |
Rice | Power law vs Stretched exponential | 2.8193 | 0.0048 | Power law | |
Cotton Seed | Power law vs Lognormal | 2.8774 | 0.0040 | Power law | |
Sorghum | Power law vs Stretched exponential | 2.8893 | 0.0039 | Power law | |
Sweet potato | Power law vs Lognormal | 2.3193 | 0.0204 | Power law | |
Uganda | Banana | Power law vs stretched exponential | 2.9194 | 0.0035 | Power law |
Cassava | Power law vs Lognormal | 2.4766 | 0.0133 | Power law | |
Coffee | Power law vs Lognormal | 2.7274 | 0.0064 | Power law | |
Millet | Power law vs Stretched exponential | −2.8155 | 0.0049 | Stretched exponential | |
Plantain | Power law vs Fréchet | 3.1373 | 0.0017 | Power law | |
Sweet potato | Power law vs Lognormal | 3.9022 | 1.0×10−4 | Power law | |
Rwanda | Beans | Power law vs Stretched exponential | 3.5970 | 3.0×10−4 | Power law |
Cassava | Power law vs Stretched exponential | 2.9573 | 0.0031 | Power law | |
Coffee | Power law vs Lognormal | 4.3420 | 0 | Power law | |
Potato | Power law vs Stretched exponential | 5.4570 | 0 | Power law | |
Sorghum | Power law vs Lognormal | 4.1122 | 0 | Power law | |
Sweet potato | Power law vs Lognormal | 2.6648 | 0.0077 | Power law |
Fig 11. Log-log plots for crops yield in Burundi where the red line corresponds to the value of xmin in Table 7.
Fig 16. Log-log plots for crops yield in Rwanda where the red line corresponds to the value of xmin in Table 7.
Fig 12. Log-log plots for crops yield in Kenya where the red line corresponds to the value of xmin in Table 7.
Fig 13. Log-log plots for crops yield in Somalia where the red line corresponds to the value of xmin in Table 7.
Fig 14. Log-log plots for crops yield in Tanzania where the red line corresponds to the value of xmin in Table 7.
Fig 15. Log-log plots for crops yield in Uganda where the red line corresponds to the value of xmin in Table 7.
Since the power law model appears to be a plausible distribution for virtually all the crops across countries, we present the estimate for the parameters of the distribution in Table 7. We see that the power law mechanism may occur at varying degrees depending on the type of crop and country. See the ntail values for crops in Table 7, where ntail denotes the total number of observations equal to or above the threshold value xmin, i.e. the total number of data points following the power law distribution. The occurrence of such extremely high crop yield definitely has positive impact on farmers and food security. In this case, farmers can make huge profits. Crop yield risk insurance policies for such crops can attract relatively lower premium rates compared to crops with lower yields. The α value of the fitted power law model describes the heaviness of the tail distribution corresponding to extremely high crop yield events with yield > xmin. According to Table 7, the estimates of α are all > 2 indicating that the data in the right tail of the distribution show significant high inequality (i.e. large crop yield). However, there are two special cases satisfying 2 < α ≤ 3 specifically in Somalia for sugar cane and Tanzania for sweet potato. In these cases, the variance and higher-order moments for the crop yields are infinite regardless of whether their mean yield exists or not. Hence, the classical central limit theorem does not hold for these yield data. The consequence of the infinite variance and higher order moments is that empirical estimates of the means converge very slowly due to the regular occurrence of extremely large crop yield values. These characteristics suggest that crop harvest with extremely large yield could sometimes occur for sugar cane in Somalia and sweet potato in Tanzania. Such events could often be of great importance to the farmers and other investors in agribusiness. This behavior is referred to as the black swan mechanism (see [67]). The black swan mechanism describes events coming as a surprise. It has a major effect (positive or negative) and is often inappropriately rationalized. Farmers can have the tendency to break even and even enjoy lower crop yield risk insurance policies in Somalia and Tanzania if they invest in sugar cane and sweet potato, respectively, due to their potential for extremely high yield.
Table 7. Parameter estimates for the power law distribution for all the crop yield data sets (xmin and α are parameters of the power distribution; αse is the standard error corresponding to α; ntail is the number of data exceeding xmin).
Country | Crop | n | n tail | x min | α | α se |
---|---|---|---|---|---|---|
Burundi | Banana | 58 | 35 | 55455 | 7.4026 | 1.0822 |
Beans | 58 | 35 | 9888 | 19.9503 | 3.2032 | |
Cassava | 58 | 35 | 89591 | 52.1287 | 8.7685 | |
Coffee | 58 | 35 | 9180 | 15.9219 | 3.3366 | |
Sorghum | 58 | 5 | 12964 | 33.5816 | 14.5710 | |
Sweet potato | 58 | 6 | 89127 | 8.1898 | 2.9352 | |
Kenya | Beans | 58 | 32 | 5429 | 9.4680 | 1.4969 |
Coffee | 58 | 20 | 6669 | 11.9168 | 2.4411 | |
Maize | 58 | 21 | 16922 | 14.5952 | 2.9667 | |
Rice | 58 | 39 | 37568 | 6.7330 | 0.9180 | |
Sugar Cane | 58 | 30 | 806694 | 8.5033 | 1.3699 | |
Wheat | 58 | 22 | 19359 | 6.5818 | 1.1901 | |
Somalia | Banana | 58 | 13 | 222222 | 8.2251 | 2.0039 |
Maize | 58 | 32 | 9544 | 5.4249 | 0.7822 | |
Sorghum | 58 | 18 | 4143 | 5.3367 | 1.0222 | |
Sugar Cane | 58 | 57 | 300000 | 2.7445 | 0.2311 | |
Tanzania | Maize | 58 | 33 | 12427 | 6.0445 | 0.8781 |
Millet | 58 | 21 | 9530 | 8.3193 | 1.5972 | |
Rice | 58 | 34 | 15583 | 5.4751 | 0.7675 | |
Cotton Seed | 58 | 30 | 5132 | 7.2276 | 1.1370 | |
Sorghum | 58 | 15 | 10133 | 7.3606 | 1.6423 | |
Sweet potato | 58 | 37 | 25323 | 2.9081 | 0.3137 | |
Uganda | Banana | 58 | 16 | 44748 | 36.7026 | 8.9257 |
Cassava | 58 | 30 | 66988 | 4.0035 | 0.5484 | |
Coffee | 58 | 37 | 5873 | 6.3153 | 0.8738 | |
Millet | 58 | 39 | 12496 | 6.6790 | 0.9094 | |
Plantain | 58 | 43 | 53474 | 7.3752 | 0.9722 | |
Sweet potato | 58 | 33 | 40960 | 12.6778 | 2.0328 | |
Rwanda | Beans | 58 | 12 | 8914 | 28.4564 | 7.9260 |
Cassava | 58 | 7 | 120265 | 13.0159 | 4.5416 | |
Coffee | 58 | 36 | 5817 | 7.6296 | 1.1049 | |
Potato | 58 | 44 | 64088 | 6.0225 | 0.7572 | |
Sorghum | 58 | 37 | 10688 | 9.6451 | 1.4212 | |
Sweet potato | 58 | 14 | 75333 | 15.2064 | 3.7968 |
All the estimated α values for the power law distribution in Table 7 are > 3 except for sugar cane in Somalia and sweet potato in Tanzania. This indicates that the sample means for these crops are Gaussian distributed and that their variances are finite. Hence, the standard central limit theorem applies for these crop yield data. The finite mean and variance and the observed evidence of underdispersion in Table 1 suggest that east African regional food security does not seem to be extremely volatile as regular crop yields for these crops tend to cluster around the mean crop yield.
Ignoring the impacts of climate and environment, soil structures and compositions/nutrients, crop species, mechanization and technology, etc on crop yields, the observed black swan behaviour for the yields of sugar cane in Somalia and sweet potato in Tanzania could be explained by the so called “rich getting richer” principle or the “preferential attachment” principle. Based on these principles, these two crops have potentials for extremely high yield perhaps because of either high demand (so every farmer tends to make them their choice crops for cultivation) or common practice such as irrigation adopted by all the farmers being capable of increasing crop yield [36]. So, speaking of crop harvest, yield could follow the pattern of the rich getting richer or the preferential attachment principle. The extremely high yields for sugar cane in Somalia and sweet potato in Tanzania are not just a little bit higher than the normal yield for the same or different crops in the same or other countries. Instead they are so much higher that they cause their distributions to skew significantly.
5 Conclusions
We have analyzed the trend and tail of some yearly crop yield data such as banana, plantain, beans, cassava, coffee, sorghum, potato, sweet potato, maize, rice, sugar cane, wheat, millet and cotton seed from 1961 to 2018 in six east African countries: Burundi, Kenya, Somalia, Tanzania, Uganda and Rwanda. An exploratory analysis of the crop yield data reveals three structural patterns in each of the series. They are: increasing, decreasing and stagnant trends. Ten years (2019–2028) time series point forecast based on the fitted ARIMA models shows that majority of the crops will experience stagnant yield in different countries with only sorghum and coffee showing the tendency for significant and persistent upward trend in Burundi and Rwanda, respectively, while beans indicates significant and persistent yield decrease in Burundi, Kenya and Rwanda.
We used the power law, lognormal, Fréchet and stretched exponential distributions to describe high yields in all the crops across the countries. Based on Vuong’s test, we observed that the stretched exponential distribution gave the best fit for millet in Uganda while the power law distribution gave the best fit for the other crops except for a few undecided cases. The log-log plots were used to visually inspect the performance of the fitted distributions. The power law distribution appeared to fit the upper tail of all the crop yield data better than the other distributions in all the countries. Based on the estimated α value of the fitted power law model, we found potential for extremely high yield in sugar cane in Somalia and sweet potato in Tanzania indicating the inappropriateness of the Gaussian distribution for describing these crop yields. Other crops in Burundi, Kenya, Somalia, Tanzania, Uganda and Rwanda can produce only high but not extremely high yields. Though the time series point forecasts for majority of the crops show yield stagnancy with a few exceptions, the evidence from the power law analysis indicates the potential for high yield for all the crops and provides specific calibrations for the yield of all the crops in terms of what quantity of yield is considered high.
We characterize the evidence for extremely high yield for sugar cane and sweet potato in Somalia and Tanzania, respectively, as black swan where the “rich getting richer” or the “preferential attachment” could be the underlying generating process, meaning that either the two crops are increasingly at lower risk of climate change and environmental challenges such as being drought resistant or farmers are constantly doing many things right (such as adopting favorable planting strategies, large crop areas, etc) as far as the cultivation of the two crops are concerned in the two countries.
ARIMA(0,1,1) was used to model and predict coffee in Burundi and Kenya; beans in Burundi; wheat in Kenya; rice, cotton seed, and sweet potato in Tanzania; millet in Uganda. ARIMA(2,1,0) was used to model and predict sorghum in Tanzania. ARIMA(2,1,2) was used to model and predict banana in Burundi. ARIMA(0,1,2) was used to model and predict banana in Uganda and potato in Rwanda. ARIMA(0,1,0) was used to model and predict cassava in Burundi and Uganda; sugarcane in Kenya and Somalia; rice in Kenya; maize in Somalia; plantain and sweet potato in Uganda. ARIMA(1,0,0) was used to model and predict Sorghum in Burundi and Rwanda; beans in Kenya and Rwanda; millet in Tanzania; coffee in Uganda. ARIMA(2,1,1) was used to model and predict sweet potato in Burundi. ARIMA(0,1,3) was used to model and predict maize in Kenya. ARIMA(0,0,4) was used to model and predict banana in Somalia. ARIMA(1,1,1) was used to model and predict sorghum in Somalia; maize in Tanzania. ARIMA(2,0,2) was used to model and predict sweet potato in Rwanda.
The yield forecast in Burundi shows an initial quick decline in 2019 followed by an increase for banana; a sharp increase in 2019 followed by an increase for sweet potato; sorghum shows a quick increase from 2019 to 2028; beans shows a sharp decrease from 2019 to 2028; neither cassava nor coffee show any tendency to increase or decrease from 2019 to 2028. The forecast of the crop yield in Kenya indicates continuous decline of beans yield from 2019 to 2028; no decrease or increase pattern in yield is evident for coffee, rice, wheat and sugar cane from 2019 to 2028; maize shows a sharp decline in 2019 with an immediate increase followed by a stable trend. In Somalia, the yield forecast for maize and sugar cane does not indicate any pattern; banana shows an initial moderate increase in 2019 followed by the lack of pattern until 2028; sorghum experienced a sharp drop in 2019 followed by a period of no trend up to 2028. The yield forecast in Tanzania indicates no significant trend for maize, rice, sweet potato and cotton seed for the whole forecast period; millet is slightly decreased in 2019 and remained stagnant until 2028. The yield forecast in Uganda indicates that banana, cassava, millet, plantain and sweet potato did not show any significant pattern from 2019 to 2028; coffee shows a slight increase in 2019 followed by a period of no change in yield. The yield forecast in Rwanda indicates that beans persistently decreased from 2019 to 2028; sweet potato shows initial increase followed by a slow decrease; coffee indicated an upward trend from 2019 to 2028; cassava, potato and sorghum did not show any significant pattern.
In our discussion in Section 4, we saw how the literature points in the direction of climate change as the major cause of the observed yield stagnancy and decline. On this backdrop, we suggest that a promising future in favour of high crop yield could await east Africa if urgent changes or improvements on the cropping systems and infrastructures that currently exist in east Africa could be made in order to meet up with the inevitable future demand of agricultural produce due to the increasing population and the challenge of negative impacts of climate change. Science and technology could be useful in showing how agricultural production can be significantly improved in east Africa. For instance, the construction of irrigation systems and rainwater harvesting structures could help cushion the impact of climate change.
Further, various climate adaptation/smart strategies could be adapted to increase yields in east Africa. According to [68], short-duration pigeon pea varieties developed by the International Crops Research Institute for Semi-Arid Tropics and the Kenya Agricultural Research Institute can give high yields and escape drought, but require non-traditional management practices (for example, sole-cropping, spraying against insect pests). According to [69], NERICA, a new rice for Africa, has shown high potential to revolutionize rice farming, producing high yield with minimum inputs in stress-afflicted ecologies. [70] observed that cassava mosaic disease (CMD) resistant cassava varieties released in western Kenya and Uganda yielded up to three times more than local varieties. [71] demonstrated that high yields of maize were recorded from certain varieties (Pwani Hybrid 4-PH4, Coast Composite Maize-CCM and the local check-Mdzihana) but they usually required relatively high rainfall amounts in order for them to produce better yields. [72] showed that increased knowledge of varieties, environment and management factors can double total yield of maize, sorghum, millet and groundnut from 1.67 to 3.29 tons per hectare from the average 5.1 hectares that farmers usually crop in south east Zimbabwe. [73] showed that improved maize varieties outyielded the traditional control variety by 26–46% across sites and season in central Mozambique. [74] showed that the use of organic soil management practices such as reduced tillage, mulching and leguminous crops in the northern part of Tanzania increased the production of food crops from an average of 0.5 ton per hectare to 1.5 ton per hectare; subsequently, maize yields increased from 12,000 kilogram to 20,000 kilogram per 4.8 hectares. [75] suggested that relaxing liquidity constraints could help to encourage farmers’ adaptation through the implementation of soil, water and land management strategies; thereby, positioning east Africa for food sufficiency in the face of the current global food crisis. [76] noted that intensive manuring with a combination of green and poultry manure produced high yields of maize in central Uganda that were comparable to those with mineral fertilizers. [77] demonstrated that households in Kenya adapting to climate change and climate variability through uptake of technologies such as early planting, use of improved crop varieties, and crop diversification produced 4877 kilograms of maize yield equivalent / hectare per year against 3238 kilograms of maize yield equivalent / hectare per year for households that did not adapt (a 33.6% difference between the two groups). [78] found that fertilizer application in the intercropping system is eastern and southern Africa improved cereal yields by 71–282% and pigeon pea yields by 32–449%, increased benefit-cost ratios by 10–40%, and reduced variability in cereal yields by 40–56% and pigeon pea yields by 5–52% compared with unfertilized intercrops. [79] showed that drought resistant climate-smart maize hybrids in Kenya increased yields 33 to 54% relative to conventional hybrids. According to [80], climate adaptation strategies in the central highlands of Kenya included the use of fertilizer and manure in combination (71%), terracing (66%), and crop rotation (60%). [81] showed that climate-smart adaptation practices significantly enhanced wheat yield by 34.35% in southern Ethiopia. [82] showed that use of mulching and permanent planting basin dimensions on maize in western Uganda relatively increased yield by 11–66% and water use efficiency by 33–94% compared to conventional practices.
The findings in this paper underscore the importance of using climate-smart agricultural alternatives to improve resilience farming system and the livelihood of subsistence farmers due to the impact of climate change in east Africa. Currently, crop yield for majority of the crops in different countries has been confirmed to neither increase nor decrease with only few crops experiencing all time increase or decrease in yield. Urgent attention should be paid to beans production in the affected countries in order to reverse the persistent downward trend of its yield. This paper brings good news of hope for crop yield increase in east Africa if adaptive farming methods and strategies are adequately harnessed in the region in the face of climate and environmental challenges and rising global demand for agricultural produce.
The data from 1961 to 2018 consist of only 58 observations. Hence, the results and forecasts in this paper should be treated conservatively. A future work is to see if more frequent and more up-to-date data are available. Another is to consider multivariate modelling of yield by considering country and crop. The disadvantage of the length of the observed series can be interpolated by explaining the common factor for each country and crop.
Supporting information
(CSV)
(CSV)
(CSV)
(CSV)
(CSV)
(CSV)
(CSV)
(CSV)
(TXT)
Acknowledgments
The authors would like to thank the Editor and the two referees for careful reading and comments which greatly improved the paper.
Data Availability
All relevant data are within the paper and its Supporting information files.
Funding Statement
The authors received no specific funding for this work.
References
- 1. Buerkert A, Hiernaux P. Nutrients in the west African Sudano-Sahelian zone: Losses, transfers and role of external inputs. Zeitschrift fuer Pflanzenernaehrung und Bodenkunde. 1988; 161: 365–383. doi: 10.1002/jpln.1998.3581610405 [DOI] [Google Scholar]
- 2. Nair KPP, Ngachie V, Nzetchoung F. The importance of liming acid soils of the Sahel regions of Africa to sustain high crop productivity—A case-study from the republic of Cameroon. Experimental Agriculture. 1992; 28: 473–476. doi: 10.1017/S0014479700020184 [DOI] [Google Scholar]
- 3. Daniel IO, Adetumbi JA. Maize seed supply systems and implications for seed sector development in southwestern Nigeria. Journal of Sustainable Agriculture. 2006; 28: 25–40. doi: 10.1300/J064v28n02_04 [DOI] [Google Scholar]
- 4. Mofoke ALE, Adewumi JK, Babatunde FE, Mudiare OJ, Ramalan AA. Yield of tomato grown under continuous-flow drip irrigation in Bauchi state of Nigeria. Agricultural Water Management. 2006; 84: 166–172. doi: 10.1016/j.agwat.2006.02.001 [DOI] [Google Scholar]
- 5. Mtambanengwe F, Mapfumo P. Effects of organic resource quality on soil profile N dynamics and maize yields on sandy soils in Zimbabwe. Plant and Soil. 2006; 281: 173–191. doi: 10.1007/s11104-005-4182-3 [DOI] [Google Scholar]
- 6. Mahasi MJ, Kamundia JW. Cluster analysis in rapeseed (Brassica napus L.). African Journal of Agricultural Research. 2007; 9: 409–411. [Google Scholar]
- 7. Howell CL, Myburgh PA, Conradie WJ. Comparison of three different fertigation strategies for drip irrigated table grapes—Part III: Growth, yield and quality. South African Journal of Enology and Viticulture. 2013; 34: 21–29. [Google Scholar]
- 8. Shrestha RP, Ligonja PJ. Social perception of soil conservation benefits in Kondoa eroded area of Tanzania. International Soil and Water Conservation Research. 2015; 3: 183–195. doi: 10.1016/j.iswcr.2015.08.001 [DOI] [Google Scholar]
- 9. Ondieki DK, Nyaboga EN, Wagacha JM, Mwaura FB. Morphological and genetic diversity of Rhizobia Nodulating Cowpea (Vigna unguiculata L.) from agricultural soils of lower eastern Kenya. International Journal of Microbiology. 2017; article number 8684921. doi: 10.1155/2017/8684921 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Wassie A, Pauline N. Evaluating small holder farmers’ preferences for climate smart agricultural practices in Tehuledere district, northeastern Ethiopia. Singapore Journal of Tropical Geography. 2018; 39: 300–316. doi: 10.1111/sjtg.12240 [DOI] [Google Scholar]
- 11. Rakotoarisoa NM, Tsujimoto Y, Oo AZ. Dipping rice seedlings in p-enriched slurry increases grain yield and shortens days to heading on p-deficient lowlands in the central highlands of Madagascar. Field Crops Research. 2020; 254: article number 107806. doi: 10.1016/j.fcr.2020.107806 [DOI] [Google Scholar]
- 12. Kiboi MN, Ngetich FK, Mucheru-Muna MW, Diels J, Mugendi DN. Soil nutrients and crop yield response to conservation-effective management practices in the sub-humid highlands agro-ecologies of Kenya. Heliyon. 2021; 7: article number 07156. doi: 10.1016/j.heliyon.2021.e07156 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Ray DK, Ramankutty N, Mueller ND, West PC, Foley JA. Recent patterns of crop yield growth and stagnation. Nature Communications. 2012; 3: 1–7. doi: 10.1038/ncomms2296 [DOI] [PubMed] [Google Scholar]
- 14. Box GE, Jenkins GM. Time series analysis, forecasting and control. San Francisco: Holden-Day; 1976. [Google Scholar]
- 15. Akouegnonhou O, Demirbas N. Forecasting of rice self-sufficiency in the Benin Republic using ARIMA model. Selcuk Journal of Agriculture and Food Sciences. 2019; 33: 204–214. [Google Scholar]
- 16. Mgaya JF. Application of ARIMA models in forecasting livestock products consumption in Tanzania. Cogent Food and Agriculture. 2019; 5: article number 1607430. doi: 10.1080/23311932.2019.1607430 [DOI] [Google Scholar]
- 17. Ajetomobi JO, Olaleye AO. Auto-regressive integrated moving average (ARIMA) modelling of cocoa production in Nigeria: 1900-2025. Journal of Crop Improvement. 2019; 33: 445–455. doi: 10.1080/15427528.2019.1610534 [DOI] [Google Scholar]
- 18. Sharma PK, Dwivedi S, Ali L, Arora RK. Forecasting maize production in India using ARIMA model. Agro Economist-An International Journal. 2018; 5: 1–6. [Google Scholar]
- 19.Sikalubya M, Xu S, Yu W, Moonga P. Study on forecasting soybean production: An application of ARIMA model. In: Proceedings of the 2019 International Conference on Intelligent Computing, Automation and Systems (ICICAS). 2019. pp. 447-452.
- 20. Biswas R. Study on ARIMA modelling to forecast area and production of Kharif rice in West Bengal. Cutting-edge Research in Agricultural Sciences. 2021; 12: 142–151. doi: 10.9734/bpi/cras/v12/10700D [DOI] [Google Scholar]
- 21. Shoko RR, Belete A. Efficient planning of sorghum production in South Africa–Application of the Box-Jenkin’s method. Journal of Agribusiness and Rural Development. 2017; 4: 835–841. doi: 10.17306/J.JARD.2017.00352 [DOI] [Google Scholar]
- 22. Mehmood Q, Sial M, Riaz M, Shaheen N. Forecasting the production of sugarcane crop of Pakistan for the year 2018–2030, using Box-Jenkin’s methodology. Journal of Animal and Plant Sciences. 2019; 5: 1396–1401. [Google Scholar]
- 23.Chen SL, Miranda MJ. Modelling multivariate crop yield densities with frequent extreme events (No. 377-2016-20672). In: Proceedings of the Agricultural and Applied Economics Association Conference. 2004.
- 24. Warusawitharana M. Time-varying volatility and the power law distribution of stock returns. Journal of Empirical Finance. 2018; 49: 123–141. doi: 10.1016/j.jempfin.2018.09.004 [DOI] [Google Scholar]
- 25. Masseran N, Yee LH, Safari MAM, Ibrahim K. Power law behavior and tail modelling on low income distribution. Mathematics and Statistics. 2019; 7: 70–77. doi: 10.13189/ms.2019.070303 [DOI] [Google Scholar]
- 26. Safari MAM, Masseran N, Ibrahim K, Al-Dhurafi NA. The power-law distribution for the income of poor households. Physica A: Statistical Mechanics and Its Applications. 2020; 557: article number 124893. doi: 10.1016/j.physa.2020.124893 [DOI] [Google Scholar]
- 27. Asif M, Hussain Z, Asghar Z, Hussain MI, Raftab M, Shah SF, et al. A statistical evidence of power law distribution in the upper tail of world billionaires’ data 2010–20. Physica A: Statistical Mechanics and Its Applications. 2021; 581: article number 126198. doi: 10.1016/j.physa.2021.126198 [DOI] [Google Scholar]
- 28. Simsek E, Kim M. Power-law tail in lag time distribution underlies bacterial persistence. Proceedings of the National Academy of Sciences. 2019; 116: 17635–17640. doi: 10.1073/pnas.1903836116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Masseran N. Power-law behaviors of the duration size of unhealthy air pollution events. Stochastic Environmental Research and Risk Assessment. 2021; 35: 1499–1508. doi: 10.1007/s00477-021-01978-2 [DOI] [Google Scholar]
- 30. Chen X, Pan Y, Luo B. Research on power-law distribution of long-tail data and its application to tourism recommendation. Industrial Management and Data Systems. 2021; 121: 1268–1286. doi: 10.1108/IMDS-10-2019-0584 [DOI] [Google Scholar]
- 31. Balthrop A, Quan S. The power-law distribution of cumulative coal production. Physica A: Statistical Mechanics and Its Applications. 2019; 530: article number 121573. doi: 10.1016/j.physa.2019.121573 [DOI] [Google Scholar]
- 32.Akhundjanov SB, Chamberlain L. The power-law distribution of agricultural land size. In: Proceedings of the 2019 Annual Meeting, Agricultural and Applied Economics Association. 2019.
- 33. Campolieti M. The distribution of union size: Canada, 1913–2014. Physica A: Statistical Mechanics and Its Applications. 2020; 558: article number 125007. doi: 10.1016/j.physa.2020.125007 [DOI] [Google Scholar]
- 34. Campolieti M, Ramos A. The distribution of strike size: Empirical evidence from Europe and North America in the 19th and 20th centuries. Physica A: Statistical Mechanics and Its Applications. 2021; 563: article number 125424. doi: 10.1016/j.physa.2020.125424 [DOI] [Google Scholar]
- 35. Pena G, Puente-Ajovín M, Ramos A, Sanz-Gracia F. Log-growth rates of CO2: An empirical analysis. Physica A: Statistical Mechanics and Its Applications. 2022; 588: article number 126550. doi: 10.1016/j.physa.2021.126550 [DOI] [Google Scholar]
- 36. Hennessy DA. Crop yield skewness under law of the minimum technology. American Journal of Agricultural Economics. 2009; 91: 197–208. doi: 10.1111/j.1467-8276.2008.01181.x [DOI] [Google Scholar]
- 37. Koning AJ, Peng L. Goodness-of-fit tests for a heavy tailed distribution. Journal of Statistical Planning and Inference. 2008; 138: 3960–3981. doi: 10.1016/j.jspi.2008.02.013 [DOI] [Google Scholar]
- 38. Zhang H, Wu Z. The general goodness-of-fit tests for correlated data. Computational Statistics and Data Analysis. 2022; 167: article number 107379. doi: 10.1016/j.csda.2021.107379 [DOI] [Google Scholar]
- 39. Pankratz A. Forecasting with univariate Box-Jenkins models: Concepts and cases. New York: John Wiley and Sons; 2009. [Google Scholar]
- 40. Box GE, Jenkins GM, Reinsel GC, Ljung GM. Time series analysis: Forecasting and control. New York: John Wiley and Sons; 2015. [Google Scholar]
- 41. Box GE, Cox DR. An analysis of transformations. Journal of the Royal Statistical Society, B. 1964; 26: 211–243. [Google Scholar]
- 42. Kwiatkowski D, Phillips PCB, Schmidt P, Shin Y. Testing the null hypothesis of stationarity against the alternative of a unit root. Journal of Econometrics. 1992; 54: 159–178. doi: 10.1016/0304-4076(92)90104-Y [DOI] [Google Scholar]
- 43. Cox DR, Hinkley DV. (1979). Theoretical statistics. London: Chapman and Hall; 1979. [Google Scholar]
- 44. Pareto V. La legge della domanda. Giornale Degli Economisti. 1895; 10: 59–68. [Google Scholar]
- 45. Fréchet M. Sur la loi de probabilite de l’ecart maximum. Annales de la Societe Polonaise de Mathematique. 1927; 6: 93. [Google Scholar]
- 46.R Core Team R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing; 2022. https://www.R-project.org/.
- 47. Clauset A, Shalizi CR, Newman ME. Power law distributions in empirical data. SIAM Review. 2009; 51: 661–703. doi: 10.1137/070710111 [DOI] [Google Scholar]
- 48. Gabaix X, Ibragimov R. Rank—1 / 2: A simple way to improve the OLS estimation of tail exponents. Journal of Business and Economic Statistics. 2011; 29: 24–39. doi: 10.1198/jbes.2009.06157 [DOI] [Google Scholar]
- 49. Hill BM. A simple general approach to inference about the tail of a distribution. Annals of Statistics. 1975; 3: 1163–1174. doi: 10.1214/aos/1176343247 [DOI] [Google Scholar]
- 50. Theil H. A rank-invariant method of linear and polynomial regression analysis, I. Proceedings of the Koninklijke Nederlandse Akademie van Wetenschappen A. 1950; 53: 386–392. [Google Scholar]
- 51. Schwarz GE. Estimating the dimension of a model. Annals of Statistics. 1978; 6: 461–464. doi: 10.1214/aos/1176344136 [DOI] [Google Scholar]
- 52. Hurvich CM, Tsai CL. Regression and time series model selection in small samples. Biometrika. 1989; 76: 297–307. doi: 10.1093/biomet/76.2.297 [DOI] [Google Scholar]
- 53. Vuong QH. Likelihood ratio tests for model selection and non-nested hypotheses. Econometrica: Journal of the Econometric Society. 1989; 57: 307–333. doi: 10.2307/1912557 [DOI] [Google Scholar]
- 54. Rowhani P, Lobell DB, Linderman M, Ramankutty N. Climate variability and crop production in Tanzania. Agricultural and Forest Meteorology. 2011; 151: 449–460. doi: 10.1016/j.agrformet.2010.12.002 [DOI] [Google Scholar]
- 55. Knox J, Hess T, Daccache A, Wheeler T. Climate change impacts on crop productivity in Africa and south Asia. Environmental Research Letters. 2012; 7: article number 034032. doi: 10.1088/1748-9326/7/3/034032 [DOI] [Google Scholar]
- 56. Craparo ACW, Van Asten PJA, Laederach P, Jassogne LTP, Grab SW. Coffea arabica yields decline in Tanzania due to climate change: Global implications. Agricultural and Forest Meteorology. 2015; 207: 1–10. doi: 10.1016/j.agrformet.2015.03.005 [DOI] [Google Scholar]
- 57. Munishi LK, Lema AA, Ndakidemi PA. Decline in maize and beans production in the face of climate change at Hai district in Kilimanjaro region, Tanzania. International Journal of Climate Change Strategies and Management. 2015; 7: 17–26. doi: 10.1108/IJCCSM-07-2013-0094 [DOI] [Google Scholar]
- 58. Kahsay GA, Hansen LG. The effect of climate change and adaptation policy on agricultural production in eastern Africa. Ecological Economics. 2016; 121: 54–64. doi: 10.1016/j.ecolecon.2015.11.016 [DOI] [Google Scholar]
- 59. Wolka K, Mulder J, Biazin B. Effects of soil and water conservation techniques on crop yield, runoff and soil loss in sub-Saharan Africa: A review. Agricultural Water Management. 2018; 207: 67–79. doi: 10.1016/j.agwat.2018.05.016 [DOI] [Google Scholar]
- 60. Mukamuhirwa A, Hovmalm HP, Bolinsson H, Ortiz R, Nyamangyoku O, Johansson E. Concurrent drought and temperature stress in rice. A possible result of the predicted climate change: Effects on yield attributes, eating characteristics, and health promoting compounds. International Journal of Environmental Research and Public Health. 2019; 16: article number 1043. doi: 10.3390/ijerph16061043 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Chapman S, Birch CE, Pope E, Sallu S, Bradshaw C, Davie J, et al. Impact of climate change on crop suitability in sub-Saharan Africa in parameterized and convection-permitting regional climate models. Environmental Research Letters. 2020; 15: article number 094086. doi: 10.1088/1748-9326/ab9daf [DOI] [Google Scholar]
- 62. Mkonda MY, He XH. The influence of soil organic carbon and climate variability on crop yields in Kongwa district, Tanzania. Environmental Management. 2022. [DOI] [PubMed] [Google Scholar]
- 63. Narita D, Sato I, Ogawada D, Matsumura A. Integrating economic measures of adaptation effectiveness into climate change interventions: A case study of irrigation development in Mwea, Kenya. Plos ONE. 2020; 15: article number e0243779. doi: 10.1371/journal.pone.0243779 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Ojara MA, Lou YS, Ongoma V, Mumo L, Akodi D, Ayugi B, et al. Projected changes in east African climate and its impacts on climatic suitability of maize production areas by the mid-twenty-first century. Environmental Monitoring and Assessment. 2021; 193: article number 831. doi: 10.1007/s10661-021-09547-4 [DOI] [PubMed] [Google Scholar]
- 65. Shabani Y, Pauline NM. Perceived effective adaptation strategies against climate change impacts: Perspectives of maize growers in the southern highlands of Tanzania. Environmental Management. 2022. [DOI] [PubMed] [Google Scholar]
- 66. Warsame AA, Sheik-Ali IA, Jama OM, Hassan AA, Barre GM. Assessing the effects of climate change and political instability on sorghum production: Empirical evidence from Somalia. Journal of Cleaner Production. 2022; 360: article number 131893. doi: 10.1016/j.jclepro.2022.131893 [DOI] [Google Scholar]
- 67. Newman ME. Power laws, Pareto distributions and Zipf’s law. Contemporary Physics. 2005; 46: 323–351. doi: 10.1080/00107510500052444 [DOI] [Google Scholar]
- 68. Muli JM, Omanga PA, Jones RB. Farmer-to-farmer seed supply: Case study of pigeon pea seed distribution in Kenya. In: Rohrbach DD, Bishaw Z, Vangastel AJG, editors. Alternative strategies for smallholder seed supply. 1997. pp. 188–191. [Google Scholar]
- 69. Atera EA, Onyango JC, Azuma T, Asanuma S, Itoh K. Field evaluation of selected NERICA rice cultivars in western Kenya. African Journal of Agricultural Research. 2011; 6: 60–66. [Google Scholar]
- 70.Fermont AM, Obiero HM, van Asten PJA, Bagurna Y, Okwuosa E. Improved cassava varieties increase the risk of soil nutrient mining: An ex-ante analysis for western Kenya and Uganda. In: Bationo A, Waswa, B, Kihara J, Kimetu J, editors, Advances in integrated soil fertility management in sub-Saharan Africa—Challenges and opportunities. 2007. pp. 511-519.
- 71.Pole FN, Saha HM, Mangale N, Mzingirwa AM, Munyambu P. On-Station evaluation of maize genotypes for nutrient and water use efficiency in the semi arid lands of coastal Kenya. In: Filho WL, Esilaba AO, Rao KPC, Sridhar G, editors, Adapting African agriculture to climate change—Transforming rural livelihoods. 2015. pp. 199-206.
- 72. Murungweni C, Van Wijk MT, Smaling EMA, Giller KE. Climate-smart crop production in semi-arid areas through increased knowledge of varieties, environment and management factors. Nutrient Cycling in Agroecosystems. 2016; 105: 183–197. doi: 10.1007/s10705-015-9695-4 [DOI] [Google Scholar]
- 73. Thierfelder C, Rusinamhodzi L, Setimela P, Walker F, Eash NS. Conservation agriculture and drought-tolerant germplasm: Reaping the benefits of climate-smart agriculture technologies in central Mozambique. Renewable Agriculture and Food Systems. 2016; 31: 414–428. doi: 10.1017/S1742170515000332 [DOI] [Google Scholar]
- 74. Mkonda MY, He XH. Conservation agriculture in Tanzania. Sustainable Agriculture Reviews. 2017; 25: 309–324. doi: 10.1007/978-3-319-48006-0_10 [DOI] [Google Scholar]
- 75. Shikuku KM, Winowiecki L, Twyman J, Eitzinger A, Perez JG, Mwongera C, et al. Small holder farmers’ attitudes and determinants of adaptation to climate risks in east Africa. Climate Risk Management. 2017; 16: 234–245. doi: 10.1016/j.crm.2017.03.001 [DOI] [Google Scholar]
- 76. Alibu S, Neuhoff D, Senthilkumar K, Becker M, Kopke U. Potential of cultivating dry season maize along a hydrological gradient of an inland valley in Uganda. Agronomy-Basel. 2019; 9: article number 606. [Google Scholar]
- 77. Kabubo-Mariara J, Mulwa R. Adaptation to climate change and climate variability and its implications for household food security in Kenya. Food Security. 2019; 11: 1289–1304. doi: 10.1007/s12571-019-00965-4 [DOI] [Google Scholar]
- 78. Kiwia A, Kimani D, Harawa R, Jama B, Sileshi GW. Sustainable intensification with cereal-legume intercropping in eastern and southern Africa. Sustainability. 2019; 11: article number 2891. doi: 10.3390/su11102891 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Obunyali CO, Karanja J, Oikeh SO, Omanya GO, Mugo S, Beyene Y, et al. On-farm performance and farmers’ perceptions of DroughtTEGO climate-smart maize hybrids in Kenya. Agronomy Journal. 2019; 111: 2754–2768. doi: 10.2134/agronj2019.08.0600 [DOI] [Google Scholar]
- 80. Mairura FS, Musafiri CM, Kiboi MN, Macharia JM, Ng’etich OK, Shisanya CA, et al. Determinants of farmers’ perceptions of climate variability, mitigation, and adaptation strategies in the central highlands of Kenya. Weather and Climate Extremes. 2021; 34: article number 100374. doi: 10.1016/j.wace.2021.100374 [DOI] [Google Scholar]
- 81. Sedebo DA, Li GC, Etea BG, Abebe KA, Ahiakpa JK, Arega Y, et al. Impact of small holder farmers’ climate-smart adaptation practices on wheat yield in southern Ethiopia. Climate and Development. 2022; 14: 282–296. doi: 10.1080/17565529.2021.2014777 [DOI] [Google Scholar]
- 82. Zizinga A, Mwanjalolo JGM, Tietjen B, Bedadi B, Gabiri G, Luswata KC. Effect of mulching and permanent planting basin dimensions on maize (Zea mays L.) production in a sub-humid climate. Water. 2022; 14: article number 79. doi: 10.3390/w14010079 [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
(CSV)
(CSV)
(CSV)
(CSV)
(CSV)
(CSV)
(CSV)
(CSV)
(TXT)
Data Availability Statement
All relevant data are within the paper and its Supporting information files.