Skip to main content
PLOS One logoLink to PLOS One
. 2023 Jun 13;18(6):e0287011. doi: 10.1371/journal.pone.0287011

Time series and power law analysis of crop yield in some east African countries

Idika E Okorie 1, Emmanuel Afuecheta 2,3, Saralees Nadarajah 4,*
Editor: Steven Arthur Loiselle5
PMCID: PMC10263363  PMID: 37310978

Abstract

We carry out a time series analysis on the yearly crop yield data in six east African countries (Burundi, Kenya, Somalia, Tanzania, Uganda and Rwanda) using the autoregressive integrated moving average (ARIMA) model. We describe the upper tail of the yearly crop yield data in those countries using the power law, lognormal, Fréchet and stretched exponential distributions. The forecast of the fitted ARIMA models suggests that the majority of the crops in different countries will experience neither an increase nor a decrease in yield from 2019 to 2028. A few exceptional cases correspond to significant increase in the yield of sorghum and coffee in Burundi and Rwanda, respectively, and significant decrease in the yield of beans in Burundi, Kenya and Rwanda. Based on Vuong’s similarity test p–value, we find that the power law distribution captured the upper tails of yield distribution better than other distributions with just one exceptional case in Uganda, suggesting that these crops have the tendency for producing high yield. We find that only sugar cane in Somalia and sweet potato in Tanzania have the potential of producing extremely high yield. We describe the yield behaviour of these two crops as black swan, where the “rich getting richer” or the “preferential attachment” could be the underlying generating process. Other crops in Burundi, Kenya, Somalia, Tanzania, Uganda and Rwanda can only produce high but not extremely high yields. Various climate adaptation/smart strategies (use of short-duration pigeon pea varieties, use of cassava mosaic disease resistant cassava varieties, use of improved maize varieties, intensive manuring with a combination of green and poultry manure, early planting, etc) that could be adapted to increase yields in east Africa are suggested. The paper could be useful for future agricultural planning and rates calibration in crop risk insurance.

1 Introduction

Africa is the poorest continent. It is struggling to feed its people. Hence, enhancement of crop production is important.

Furthermore, farmers are more interested in investing in crops that are capable of producing high yields not crops that can produce extremely low yield. They want to maximize the profit on their investment. Crops that have the potential for high yield are likely to attract low premium in crop yield insurance.

There have been several papers on high crop yield in African countries. While discussing nutrients in the west African Sudano-Sahelian zone, [1] noted that “shrubs and trees with their alternating periods of nutrient storing and recycling in leaves and wood, micro-depressions, termite mounts and ant nests become localised points of nutrient concentration and high crop productivity”. While investigating the importance of liming acid soils, [2] demonstrated that “severely acidified soils of the western highlands of Cameroon should be limed at moderate rates to sustain crop productivity”. While examining the seed supply system for maize production in southwestern Nigeria, [3] observed that “about 39% of farmers used improved varieties for high crop yields, 24% for disease resistance and 22% for market preferences, whereas local varieties were cultivated by 37% of farmers because of market preferences and availability, 16% because of low cost and 12% because of disease resistance”. [4] demonstrated that continuous-flow drip irrigation in Bauchi state of Nigeria delivers “high crop yields especially if the crops are grown under appropriate agronomic practices that enable protraction of the growth season”. [5] demonstrated that high maize yields on sandy soils in Zimbabwe can be achieved by using mineral fertilizers. According to [6], among many oilseed crops (for example, sunflower, soybeans, rapeseed/mustard, sesame, groundnuts, etc) grown in Kenya, oilseed rape is preferred because of its high yields (1.5 tons—4.0 tons / hectare) with high oil content of 42–46%. While comparing three fertigation strategies of grapes in the Berg River Valley region of South Africa, [7] found that “less berry crack contributed to a higher yield and higher export percentage of grapes”. While analysing the benefits of soil conservation in the Kondoa eroded area of Tanzania by conducting a household survey of 240 households, [8] observed that 56% of the respondents gained high crop yields. [9] investigated limited nitrogen content, a major challenge to sustainable and high crop production, for agricultural soils of lower eastern Kenya. While evaluating small holder farmers’ preferences for climate smart agricultural practices in Tehuledere district, northeastern Ethiopia, [10] found that “high and moderate climate resilience and high crop yield agricultural practices had a positive utility”. [11] demonstrated that phosphorus treatment for rice fields in lowlands in the central highlands of Madagascar significantly and consistently accelerated initial production with high crop growth rate and shortened days to heading. According to [12], “rain fed agriculture has a high crop yield potential if rainfall and soil nutrient input resources are utilized effectively”.

But none of these papers discuss the distribution of crop yield or forecasts. The distributions of crop yields is very useful in agribusiness. These distributions can help to tackle food shortages and insecurity by understanding how natural resources and farmers attitude towards crops selection and cultivation can control agricultural productivity, in agricultural policy assessment and to calibrate rates and premiums in crop insurance. Similarly, understanding the trend of crop yield and the insights gained from crop yield predictions can go a long way in helping to address the current global issue of increase in food prices and demand as well as to understand the associated risk of food production by helping farmers to make informed decisions especially on what and where to grow.

We are also not aware of any previous research that has focused on predicting crop yield in east Africa let alone doing so in such an almost holistic manner as we have done in this paper; so, to bridge this research gap, we follow [13] to provide some crop yield forecast in some east African countries. We believe that the results herein will be of extreme importance to east African regional farmers.

The aim of this paper is two folded. First, to forecast the crop yield and secondly to identify cash crops that are capable of producing extremely high yield in some east African countries by modelling the tail region of crop yield data. The remainder of this paper contains data in Section 2, methods in Section 3, results and discussion in Section 4 and conclusions in Section 5.

We use two methods for analyzing the data: time series analysis and fit of heavy tailed distributions. Time series analysis and forecasting is a branch of statistics. Time series forecasting uses models to predict future outcomes based on past observations. With time series visualizations, trends and seasonal patterns could be identified. We could then seek to gain deeper insight as regards to the reason behind these trends. Several time series models have been developed, studied and widely applied in many fields. Box-Jenkins’ auto-regressive integrated moving average (ARIMA) model [14] arguably stands out among others as the most widely used perhaps due to its simplistic application appeal and high precision in modelling. For instance, [15] used the ARIMA model to forecast rice production, consumption, importation, exportation and self-sufficiency in the Benin Republic. [16] used the ARIMA model to forecast the consumption of some livestock products such as eggs, milk, chicken and cow meat to see if the forecast of consumption was on the increase. [17] highlighted that the past century has witnessed significant rise and fall of cocoa production in Nigeria due to diverse institutional and climate changes. They used the ARIMA model to predict cocoa production in Nigeria between 2018 and 2025. Their forecast showed a decreasing trend where cocoa production is expected to fall by more than 20% in 2025 against the 2017 value. [18] used the ARIMA model to forecast maize production in India from 2018 to 2022. The model predicted about 13.76% increase in maize production in India. [19] used the ARIMA model to forecast soybean yield in Zambia. The forecast suggested 23430.3 hectogram / hectare yield increase in 2020 compared to the 2016 figure of 19624 hectogram / hectare. [20] used the ARIMA model to forecast Kharif rice production in West Bengal, India which contributes about 15% of the total paddy in India. [21] used the ARIMA model to forecast sorghum production in South Africa from 2017 to 2020. Their forecast depicted an increasing trend. [22] used the ARIMA model to forecast sugar cane production in Pakistan from 2019 to 2030. Their forecast indicated a significant increase.

Quantifying the tail of the crop yield distribution is vital for managing agricultural production risk and rating crop insurance [23]. The simplest and the most widely used distribution for modelling rare outcomes occurring in the tail region is the power law distribution. Many processes follow the power law over large magnitude of values. Recent examples are the distribution of stock returns [24], income [25, 26], wealth of world billionaires [27], persisters-antibiotic-tolerant cells [28], duration size of unhealthy air pollution events [29], tourism recommendations [30], cumulative coal production [31], agricultural land size [32], rates of wetland loss [18], union size [33], strike size [34] and growth rate of CO2 [35]. Popular alternatives to the power law distribution are the lognormal, stretched exponential, and Fréchet distributions.

2 Data

Yearly data from 1961 to 2018 on the yield of cash crops like banana, plantain, beans, cassava, coffee, sorghum, potato, sweet potato, maize, rice, sugar cane, wheat, millet and cotton seed from six countries in east Africa (namely, Burundi, Kenya, Somalia, Tanzania, Uganda and Rwanda) were obtained from Food and Agriculture Organization of United Nations-FAO, see http://www.fao.org/faostat/en/#home. The data obtained were yields aggregated at national levels.

The time plots of the crops in different countries are shown in Figs 1 and 2. Some sudden changes, particularly big drops and falls could be seen at different times indicating periods of high and low yields. These changes could be as a result of the global economic outlook, environmental/climate changes or even changes in farming practices.

Fig 1. Time series plots for crop yield in different countries.

Fig 1

Fig 2. Time series plots for crop yield in different countries.

Fig 2

Some descriptive statistics of the data for crops are presented in Table 1. The statistics include the mean, median, standard deviation, minimum, maximum, skewness and kurtosis. The discrepancy between the mean and the median values appears not to be large for almost all the crops across the countries. The mean is larger than the standard deviation for all the crops across the countries. This suggests that the data are underdispersed. Note that underdispersion could be as a result of serial correlation which is typical of time series data. We can remove serial correlation by random variable transformation. But, this may lead to (a) loss of data information and (b) limits us to specific class of models to use. The data exhibit varying degrees of skewness and kurtosis across crops and countries. The lowest (highest) positive skewness of 0.0261 (3.1150) corresponds to maize (sweet potato) in Kenya (Burundi). The lowest (highest) negative skewness of -0.0934 (-2.0220) corresponds to rice (cassava) in Kenya (Burundi). The lowest (highest) positive kurtosis of 0.0410 (13.0916) corresponds to sorghum (banana) in Rwanda (Burundi). The lowest (highest) negative kurtosis of -0.0043 (-1.0478) corresponds to coffee (maize) in Uganda (Kenya). Crop yield skewness has been used to characterize crop yield tendencies. [36] reported that crop yield is positively skewed in the presence of independent, identical and uniform resource availability distribution. Crop yield is negatively skewed whenever the distributions are Gaussian, i.e. skewness depends on asymmetries in resource availabilities, meaning that a negatively skewed yield occurs whenever production is tightly controlled so that the left tails of some resources availabilities distributions are thin [36]. However, in addition to the observable similarities between the mean and the median crop yield values, we notice that for majority of the cases, the skewness and kurtosis values are close to zero, suggesting possible symmetry and mesokurtosis.

Table 1. Descriptive measures for the crop yield data sets.

Country Crop Min. 1st qu. Median Mean 3rd qu. Max. Std. dev. Skewness Kurtosis
Burundi Banana 46915 54433 56947 60700 62984 127352 12204.970 3.0704 13.0916
Beans 6044 9067 10036 9599 10380 13184 1375.755 -0.8491 0.9814
Cassava 41867 85880 89890 85078 90894 112378 12870.850 -2.0220 4.2142
Coffee 2687 6961 8251 8100 9379 11598 1769.862 -0.5861 0.1126
Sorghum 5890 9837 10000 10426 11881 14042 1886.384 -0.3327 -0.1633
Sweet potato 59048 63302 63837 68215 65848 133015 13063.090 3.1150 10.0345
Kenya Beans 3127 4783 5556 5413 6122 8382 1034.896 0.0585 0.3111
Coffee 9212 14918 16839 18293 21078 31991 5003.894 0.7453 0.2387
Maize 10713 12957 15813 15507 17266 20712 2605.723 0.0261 -1.0478
Rice 13076 34802 39621 39988 46510 61813 10166.130 -0.0934 -0.0593
Sugar Cane 297552 689809 808548 774305 885805 1211845 217545.200 -0.4537 -0.0124
Wheat 9212 14918 16839 18293 21078 31991 5003.894 0.7453 0.2387
Somalia Banana 88430 169641 170374 185025 198616 317500 46006.370 0.7969 0.7965
Maize 4149 8173 9758 10065 11765 17901 3029.413 0.5619 -0.1600
Sorghum 2040 3320 3522 3992 4280 9824 1306.699 2.2982 6.9370
Sugar Cane 272727 350000 407143 604201 887500 1455975 346839.200 1.0554 -0.2674
Tanzania Maize 4808 9170 12722 12734 14414 31359 4893.517 1.3077 2.9604
Millet 4522 7010 8308 8697 9982 19507 2401.514 1.5969 5.1457
Rice 7143 12826 16286 16396 19172 27382 4933.184 0.2870 -0.7743
Cotton Seed 2328 4356 5136 5117 5783 7936 1245.132 0.2257 -0.4752
Sorghum 4423 6554 9151 8832 10100 17963 2717.268 0.4293 0.5903
Sweet potato 10448 18029 29252 34621 49412 72759 19284.670 0.5541 -1.0485
Uganda Banana 23298 39412 42070 40891 44927 48333 5636.359 -1.3614 1.613
Cassava 32973 44771 66988 71711 89971 144083 32354.340 0.7664 -0.5467
Coffee 3839 5443 6131 6402 7150 10283 1408.237 0.6347 -0.0043
Millet 8092 11486 14017 13341 15986 16751 2637.066 -0.6120 -0.945
Plantain 42971 52141 56585 59014 60867 84235 11223.650 0.8573 -0.0373
Sweet potato 24009 35504 41558 40660 44017 62075 6398.938 0.3221 1.5048
Rwanda Beans 5606 6980 8020 7858 8522 10258 1037.677 -0.0566 -0.8005
Cassava 11778 55212 91644 82820 116873 164000 37331.23 -0.2683 -1.0465
Coffee 2678 5267 5994 6051 6776 11019 1473.642 0.6369 1.6200
Potato 22821 64313 68656 71994 82871 130600 20435.600 0.2417 0.7926
Sorghum 6850 10014 11000 11109 12086 15084 1754.313 0.2655 0.0410
Sweet potato 34388 53682 62550 63978 75039 96163 13269.530 0.0977 -0.4860

Figs 3 and 4 show boxplots to support the descriptive statistics in Table 1 and to compare the yield performance of some of the crops that are produced in more than one east African country. We see that Somalia recorded the highest banana and sugar cane yields. Burundi recorded the highest beans, coffee and sweet potato yields. Rwanda recorded the highest cassava yield. Kenya recorded the highest rice yield. Tanzania recorded the highest sorghum, maize and millet yields. Also, evident enough in Figs 3 and 4 are the presence of extreme (high and low) yields for some of the crops which are indicated by observations lying outside of the whiskers in the box plots. The power law distribution discussed later is especially useful for modelling unusually high yields.

Fig 3. Box plots for crop yield in different countries.

Fig 3

Fig 4. Box plots for crop yield in different countries.

Fig 4

We tested heavytailedness of the each data set using [37]’s test based on Kolmogorov-Smirnov statistic corrected for correlation [38]. The p–values of this test for banana, beans, cassava, coffee, sorghum and sweet potato in Burundi were 0.182, 0.0664, 0.151, 0.102, 0.156 and 0.115, respectively. The p–values for the crops in Kenya were 0.162, 0.171, 0.059, 0.120, 0.166 and 0.145. The p–values for the crops in Somalia were 0.167, 0.157, 0.098 and 0.115. The p–values for the crops in Tanzania were 0.168, 0.112, 0.095, 0.068, 0.096 and 0.125. The p–values for the crops in Uganda were 0.177, 0.114, 0.087, 0.171, 0.105 and 0.077. The p–values for the crops in Rwanda were 0.075, 0.169, 0.068, 0.098, 0.061 and 0.158. The p–values reported show that there is no significant evidence against the fact that each data has a heavy tail. Hence, unusually high yields can be modeled by heavy tailed distributions as done in Section 4.

3 Methods

3.1 Time series analysis of crop yields

One possible technique for time series analysis is to assume that the overall mean is either constantly increasing or constantly decreasing with respect to time. In this case, the fit of a sloping line might be appropriate for the time series. This type of line is typically referred to as a linear trend model or a trend-line model and it is a special case of a simple linear regression model with time index t as the only predictor variable, i.e. t = 1, 2, 3, …. The estimated trend line is the line that minimizes the sum of the squared vertical deviations from the data. Trend lines serve as important visual aids. However, they often perform poorly in forecasting beyond the historical data. In practice, majority of the time series data that arise in different areas cannot be described by some straight lines because their trends often undergo evolution. Given the past observations, the trend-line model attempts to find the intercept and slope that give the best average fit to the data. Unfortunately, the deviation of the linear trend model from the data is usually greatest at the end of the time series where the forecasting starts. Therefore, in time series analysis and forecasting, the important question ‘what is the appropriate model?’ can first be addressed by visually inspecting the time series data for any constantly changing trend or randomly changing trend. Based on Figs 1 and 2, we see that assuming a steady upward or downward linear trend for any of the crop yield data is apparently illogical and out of place because a randomly changing trend is overwhelmingly evident for all the time series data. To model the nonlinear trend in all the time series, we may need to regress the time series on second or higher order terms of t and this may require some trial and errors which may possibly lead to some overestimated or underestimated models. To circumvent the issue of model selection, we consider the most reliable models for nonlinear trends in time series and they are referred to as stochastic time-series models. Examples of such models are the one proposed by [14] which involve straightforward laid down iterative procedures for model fitting unlike the nonlinear regression method mentioned earlier.

In this section, we carry out a time series analysis to study the yield pattern of crops over a specified period of time. We need to isolate first the impact of trends (the overall pattern in the series) and second the impact of random disturbances (the vigorous wiggles in the series). The impact of trends could be due to planting strategies and techniques, advanced mechanized farming, farm management, irrigation, the use of fertilizers and genetically improved seedlings/crops. The impact of random disturbances could be due to pandemics, crop disease outbreaks, wars, recessions, environmental degradations (for example, erosion) and extreme weather conditions such as droughts and floods.

Let xt denote the observed yield of a crop at time t. Suppose we denote all the observed information up to time t by It. We are interested in forecasting xt. We can specify the forecast as xt|It or more specifically as x^t+h|t. The forecast of xt+h given all previous observations up to time t (x1, x2, …, xt) is known as the h–step forecast. The h–step forecasting method can be easily implemented through the famous Box-Jenkins autoregressive integrated moving average (ARIMA) modelling framework. ARIMA models are used for trend analysis and forecasting. The ARIMA (p, d, q) model is defined by

[1-i=1pϕiBi](1-B)dxt=c+[1+j=1qθjBj]ξt,

where ϕ’s are the autoregressive (AR) parts of the model, θ’s are the moving average (MA) parts of the model, d is the order of difference, B is known as the backshift operator, c is a constant which is equal to μ(1 − ϕ1 − ⋯ − ϕp), μ is the mean of the dth differenced series (1 − B)dxt and ξt is white noise. ξt are generally assumed to be independent, identically distributed variables sampled from a normal distribution with zero mean. In ARIMA modelling, we make the following assumptions about the time series: there are no seasonality or cyclical trends, there are no outliers, and that the variation about the mean is consistent. After fitting the ARIMA model, we can check the model adequacy viz-a-viz a popular portmanteau test called Ljung-Box test by simply testing whether the residuals from the fitted model are white noise. For Ljung–Box test, we test the hypothesis H0: ρk = 0 versus H1: ρk ≠ 0. The test statistic of Ljung-Box test is

Q=n(n+2)k=1hρ^k2(n-k),

where n is the sample size, ρ^k is the sample autocorrelation at lag k, and h is the number of lags being tested. Under H0, the statistic Q is asymptotically chi-square distributed with h degrees of freedom. At α significance level, the critical region for rejecting the hypothesis of randomness is Q>χ1-α,h2, where χ1-α,h2 denotes the (1 − α)th quantile of the chi-squared distribution with h degrees of freedom.

A detailed discussion of Box-Jenkins ARIMA (p, d, q) model could be read from [39] and [40]. In Figs 1 and 2, we find some evidence of changing variance in some of the series. Each series appears clearly non-stationary as the series wanders up and down. Before proceeding with the data analysis, we ensured that the variance for each series is stabilized by the Box-Cox transformation [41].

The Box Cox transformation involves an exponent, λ ∈ [−5, 5]. In this paper, all values of λ are considered but the optimal value for each data is applied. The optimal value of λ is the one that gives the best approximation of the Gaussian distribution. The transformation of xt has the form:

xt(λ)={xtλ-1λ,ifλ0,ln(xt),ifλ=0. (1)

The formula in (1) is not as simple as it appears because testing for all possible values one by one is unnecessarily time consuming. However, most software packages include an option for a Box-Cox transformation. In this paper, we used the auto.arima function in the forecast package in the R (R Core Team, 2022) software to fit the ARIMA (p, d, q) models. Setting the lambda argument to auto allows a transformation to be automatically selected and implemented using the Box-Cox method. The routinely transformed data are then coerced into stationarity by implementing first or second order differences whenever there is any need to do so before estimating the appropriate model.

Each coerced series was tested for stationarity using [42]’s test. The null hypothesis was that the series is stationary. The p–values for banana, beans, cassava, coffee, sorghum and sweet potato in Burundi were 0.085, 0.085, 0.089, 0.095, 0.075 add 0.083, respectively. The p–values for the crops in Kenya were 0.057, 0.081, 0.051, 0.069, 0.086 and 0.056. The p–values for the crops in Somalia were 0.098, 0.078, 0.089 and 0.083. The p–values for the crops in Tanzania were 0.067, 0.078, 0.082, 0.083, 0.081 and 0.052. The p–values for the crops in Uganda were 0.094, 0.086, 0.095, 0.051, 0.092 and 0.090. The p–values for the crops in Rwanda were 0.098, 0.099, 0.053, 0.073, 0.090 and 0.080.

3.2 Analysis of the maximum crop yields

Suppose we denote the crop yield random variable by X with realizations xi, i = 1, 2, …, n, where n represents the number of observations. For the convenience of fitting distributions to the available data, we assume that the xi are random. The assumption of independence is not technically correct as the data are actually serially correlated. But ignoring dependence in a data set and treating the data as being independent has no effect on parameter estimates, it only affects standard errors (see, for example, [43]). Hence, the results presented later on the fit of heavy tailed distributions are correct as accuracy of estimation is not taken into account.

The probability density functions (PDFs) of the fitted heavy tailed distributions are

  1. The power law distribution also known as Pareto distribution of type I [44] specified by the PDF
    f(x)=α-1xmin(xxmin)-α
    for xxmin > 0, where xmin is the lower bound and α > 0 is the exponent. At or above xmin, the distribution exhibits properties of a power law distribution.
  2. The lognormal distribution specified by the PDF
    f(x)=1bx2πexp[-(lnx-a)22b2]
    for x > 0, where −∞ < a < ∞ and b > 0 are the location and scale parameters, respectively.
  3. The stretched exponential distribution specified by the PDF
    f(x)=ba(xa)b-1exp[-(xa)b]
    for x > 0, where a > 0 is the scale parameter and b > 0 is the shape parameter.
  4. Fréchet distribution [45] specified by the PDF
    f(x)=babx-1-bexp[-(xa)-b]
    for x > 0, where a > 0 is the scale parameter and b > 0 is the shape parameter.

We estimated the parameters of all the distributions by the method of maximum likelihood through the optim routine in R [46]. We estimated xmin in the power law distribution by following the method in [47]. That is, we chose xmin that minimized

KS=maxxxmin|Fn(x)-F^(x)|,

where Fn(x) and F^(x) denote, respectively, the empirical and fitted power law distribution functions for xxmin.

We have used the method of maximum likelihood because of its popularity. There are other methods for estimation; in particular, for estimating α of the power law distribution. Some of these estimators include the rank estimator due to [48], [49]’s estimator and the median estimator due to [50].

Note that each of the four distributions has two free parameters. So, no one distribution is more flexible than the others in terms of the number of parameters. Unlike the power law distribution, the lognormal, Fréchet and stretched exponential distributions model the entire data. We can compare their fits by the following goodness-of-fit measures:

  1. Bayes information criterion (BIC) due to [51] defined by
    BIC=-2L^+kln(n),
  2. Akaike information criterion with a correction (AICc) due to [52] defined by
    AICc=AIC+2k(k+1)n-k-1,

where L^ and k denote, respectively, the maximized log likelihood value and the number of unknown parameters.

We can also compare all of the fitted distributions through the Kolmogorov–Smirnov test. Its statistic is given by

KS=maxxData|Fn(x)-F^(x)|,

which was corrected as in [38] to account for correlation in the data. The larger the value of the corresponding KS p–value the better the fitted distribution. We require the p–value of the Kolmogorov–Smirnov test to be greater than 0.05 to conclude that the distribution is a reasonable model for the data. A p–value less than 0.05 suggests an absolute rejection of the distribution as a candidate for the data. However, one major drawback of the Kolmogorov–Smirnov p–value is that it depends on fixed parameters, hence it does not reflect sampling variability. We can calculate more conservative p–values by a bootstrapping method in [47]. We implemented this method by using 5000 bootstrap replications to obtain the final p–value for the Kolmogorov–Smirnov test. In this paper, we shall use the non-bootstrapped KS p–value to verify the plausibility of each distribution as a candidate model for data. We use the bootstrapped KS p–value to discriminate among competing distributions and to generalize our findings.

Vuong test [53] can be used to discriminate between two non-nested models by testing the null hypothesis that the models provide indistinguishable fits for the same data. Suppose we denote the probabilities for models 1 and 2 by P(x|Θ1^) and P(x|Θ2^), respectively, where Θ1^ and Θ2^ denote the parameter estimates for models 1 and 2, respectively. Let d=lnP(x|Θ1^)-lnP(x|Θ2^). The test statistic for Voung’s test is Λ=nd¯sd, where d¯ and sd denote the mean and standard deviation of d, respectively. A large, positive test statistic value provides evidence that model 1 is superior to model 2. A large, negative test statistic value gives evidence that model 2 is superior to model 1. Under the null hypothesis that the models are inseparable, the test statistic Λ is asymptotically standard normal distributed. Two finite sample corrections of Vuong’s test are sometimes considered based on the AIC and BIC penalty terms, depending on the complexity of the two models. However, these corrections sometimes generate conflicting conclusions.

4 Results and discussion

Ljung–Box p–values in Table 2 are > 0.05 suggesting that the residuals of the fitted ARIMA models are not statistically significant from white noise at 0.05 significance level for all the crops except for plantain in Uganda which is not statistically significant from white noise at 0.01 significance level. All of the fitted models are suitable for prediction based on the residual analysis. From the 10 years (2019–2028) point forecast (solid blue lines) of the fitted ARIMA models in Figs 5 to 10, we observe the following for Burundi: an initial sharp drop in 2019 followed by an upward swing of yield for banana; a sharp increase in 2019 followed by increasing oscillations of yield for sweet potato; the yield for sorghum shows a quick increase from 2019 to 2028; the yield for beans shows an immediate decline from 2019 to 2028; neither cassava nor coffee indicate any increasing or decreasing pattern from 2019 to 2028. In Kenya, we observe the following: the yield for beans shows a continuous decline from 2019 to 2028; neither upward nor downward yield trend is evident for coffee, rice, wheat and sugar cane from 2019 to 2028; the yield of maize shows a sharp drop in 2019 followed by an increase and then a stable trend. In Somalia, we observe the following: the yield for maize or sugar cane does not indicate any pattern; the yield for banana shows an initial moderate increase in 2019 followed by a period of no trend up to 2028; the yield for sorghum first experienced a sharp drop in 2019 followed by a stable period of no trend up to 2028. In Tanzania, we observe the following: no significant trend could be identified for maize, rice, sweet potato and cotton seed for the entire forecast period; millet is characterized by a slight yield decrease in 2019 followed by a period of no significant trend up to 2028. In Uganda, we observe the following: the forecast for banana, cassava, millet, plantain and sweet potato did not show any significant trend from 2019 to 2028; the yield for coffee shows a slight increase in 2019 followed by a period of neither increase nor decrease. In Rwanda, we observe the following: the yield for beans shows a persistent decline from 2019 to 2028; the yield for sweet potato shows initial jump followed by a slow decline; coffee indicated an upward trend tendency from 2019 to 2028; cassava, potato and sorghum did not indicate any significant trend.

Table 2. Ljung–Box test statistic (Q), its degree of freedom and its p–value for the fitted ARIMA models at lag 10 (i.e. h = 10).

Country Crop Fitted ARIMA (p, d, q) a Q df p–value
Burundi Banana ARIMA(2,1,2) 1.5746 6 0.9544
Beans ARIMA(0,1,1) with drift 5.9108 8 0.6572
Cassava ARIMA(0,1,0) 12.0820 10 0.2796
Coffee ARIMA(0,1,1) 8.3159 9 0.5027
Sorghum ARIMA(1,0,0) with non-zero mean 8.9597 8 0.3457
Sweet potato ARIMA(2,1,1) 7.8161 7 0.3491
Kenya Beans ARIMA(1,0,0) 12.0780 8 0.1478
Coffee ARIMA(0,1,1) 13.2300 9 0.1525
Maize ARIMA(0,1,3) 3.4473 7 0.8408
Rice ARIMA(0,1,0) 10.4230 10 0.4042
Sugar Cane ARIMA(0,1,0) 17.9800 10 0.0553
Wheat ARIMA(0,1,1) 13.9030 9 0.1258
Somalia Banana ARIMA(0,0,4) with non-zero mean 4.1658 5 0.5258
Maize ARIMA(0,1,0) 15.1300 10 0.1274
Sorghum ARIMA(1,1,1) 3.8376 8 0.8715
Sugar Cane ARIMA(0,1,0) 13.8200 10 0.1814
Tanzania Maize ARIMA(1,1,1) 11.8720 8 0.1570
Millet ARIMA(1,0,0) with non-zero mean 6.2974 8 0.6140
Rice ARIMA(0,1,1) 7.7560 9 0.5589
Cotton Seed ARIMA(0,1,1) 18.8250 9 0.0267
Sorghum ARIMA(2,1,0) 4.2521 8 0.8337
Sweet potato ARIMA(0,1,1) 12.8150 9 0.1711
Uganda Banana ARIMA(0,1,2) 3.8909 8 0.8668
Cassava ARIMA(0,1,0) 12.1600 10 0.2745
Coffee ARIMA(1,0,0) with non-zero mean 15.3660 8 0.0524
Millet ARIMA(0,1,1) 6.1006 9 0.7298
Plantain ARIMA(0,1,0) 19.0490 10 0.0397
Sweet potato ARIMA(0,1,0) 17.7340 10 0.0596
Rwanda Beans ARIMA(1,0,0) with non-zero mean 7.9098 8 0.4423
Cassava ARIMA(1,1,0) 2.3680 9 0.9842
Coffee ARIMA(1,0,0) with non-zero mean 3.5770 8 0.8931
Potato ARIMA(0,1,2) 4.4913 8 0.8103
Sorghum ARIMA(1,0,0) with non-zero mean 4.4304 8 0.8164
Sweet potato ARIMA(2,0,2) with non-zero mean 2.4531 5 0.7835

aDue to space constraints, we omit the coefficients of the fitted ARIMA models; interested readers can obtain them from the authors upon reasonable request.

Fig 5. Time series plots and 10 years yield forecast with fitted ARIMA models showing 80% and 95% confidence bands for crops in Burundi.

Fig 5

Fig 10. Time series plots along with 10 years yield forecast for the fitted ARIMA models showing 80% and 95% prediction confidence bands for crops in Rwanda.

Fig 10

Fig 6. Time series plots along with 10 years yield forecast for the fitted ARIMA models showing 80% and 95% prediction confidence bands for crops in Kenya.

Fig 6

Fig 7. Time series plots along with 10 years yield forecast for the fitted ARIMA models showing 80% and 95% prediction confidence bands for crops in Somalia.

Fig 7

Fig 8. Time series plots along with 10 years yield forecast for the fitted ARIMA models showing 80% and 95% prediction confidence bands for crops in Tanzania.

Fig 8

Fig 9. Time series plots along with 10 years yield forecast for the fitted ARIMA models showing 80% and 95% prediction confidence bands for crops in Uganda.

Fig 9

The changes observed in Figs 5 to 10 are consistent with findings in the literature. [54] established that both intra- and interseasonal changes in temperature and precipitation influence cereal yields in Tanzania. [55] reported that climate change will reduce mean yields in Africa by 17% for wheat, 5% for maize, 15% for sorghum and 10% for millet. No mean change in yield for rice was detected. Using data from the northern Tanzanian highlands, [56] demonstrated that increasing night time temperature is the most significant climatic variable responsible for diminishing coffea arabica yields between 1961 and 2012. According to [57], annual food crops in the Kilimanjaro region of Tanzania were particularly sensitive to the drought and maize and beans yields were lower than perennial crops during the years of drought. Through a simulation study, [58] predicted climate change in east Africa and found its negative impact on crop production in that region. They projected that the crop output decrease will lie between 1.2% and 4.5%. [59] identified soil erosion by water as one of the major causes of land degradation and dwindling agricultural produce in Africa resulting in an estimated yearly crop yield loss of about 280 million tons. [60] provided evidence to suggest that climate change severely impacted rice production in Rwanda. [61] produced evidence to suggest that temperature increases lead to decline in maize and cassava crops for Tanzania, Malawi, Zambia and South Africa. [62] observed that the yields for maize, sorghum or millet fluctuated at a decreasing trend in the Kongwa district of Tanzania. According to [63], increased temperatures in Kenya due to climate change have a general tendency to reduce rice yields. [64] showed that the impacts of projected changes in climate on maize production areas are the reduction in the suitability of the crop, especially around central and western Tanzania, mid-northern and western Uganda, and parts of western Kenya by 20–40%, and patches of east Africa will experience a reduction as high as 40–60%, especially in northern Uganda, and western Kenya. According to [65], maize production in southern highlands of Tanzania has decreased during the past two decades, since the year 2000. According to [66], climate change has induced a devastating effect on agricultural production in Somalia leading to crop yield to decline including sorghum.

Tables 3 and 4 give the BIC, AICc and the KS p–values of the fitted distributions. The BIC and AICc values for the power law distribution are smaller than those for the remaining distributions. The KS p–value > 0.05 in all the cases except for Millet in Uganda indicating that the power law is not a plausible distribution in this case. We cannot compare the values of the goodness-of-fit measures of the power law distribution with those of the other distributions because the power law distribution fits only the tails whereas the lognormal, Fréchet, and the stretched exponential distributions fit the entire data. Thus, we can only compare the BIC and AICc values of the lognormal, Fréchet, and the stretched exponential distributions. Based on the KS p–value, we can observe that the lognormal distribution could be a plausible distribution for banana and coffee in Burundi, all the crops in Kenya except for sugar cane, maize and sorghum in Somalia, all the crops in Tanzania, all but banana in Uganda and all but cassava in Rwanda. Fréchet distribution appears to be a plausible distribution for banana in Burundi, maize and wheat in Kenya, maize and sorghum in Somalia, all except for maize and sorghum in Tanzania and cassava, coffee and plantain in Uganda and all except for cassava and potato in Rwanda. The stretched exponential distribution appears to be a plausible distribution for beans and coffee in Burundi, all the crops in Kenya, maize in Somalia, all the crops in Tanzania, all except for plantain in Uganda and all the crops in Rwanda.

Table 3. AIC, BIC, AICc and KS p–values (as defined in Section 3) for the fitted distributions.

Country—Crop Power law Lognormal Fréchet Stretched exponential
BIC AICc p–value BIC AICc p–value BIC AICc p–value BIC AICc p–value
Burundi—Banana 723.717 719.814 0.954 1239.032 1235.129 0.051 1214.761 1210.859 0.629 1282.202 1278.299 0.000
Beans 519.823 515.920 0.532 1019.932 1016.029 0.004 1048.091 1044.188 0.001 1005.110 1001.208 0.092
Cassava 585.320 581.417 0.132 1293.669 1289.766 0.000 1337.395 1333.492 0.000 1250.241 1246.338 0.000
Coffee 307.680 303.777 0.931 1054.181 1050.278 0.112 1091.141 1087.238 0.001 1036.031 1032.129 0.645
Sorghum 78.290 74.387 0.930 1053.391 1049.489 0.001 1076.397 1072.495 0.000 1045.201 1041.298 0.000
Sweet potato 134.892 130.989 0.901 1243.862 1239.959 0.000 1184.906 1181.003 0.000 1290.785 1286.882 0.000
Kenya—Beans 493.325 489.422 0.504 980.215 976.313 0.223 999.376 995.474 0.010 979.803 975.900 0.432
Coffee 308.382 304.479 0.721 1043.274 1039.371 0.109 1390.577 1386.674 0.000 1038.009 1034.106 0.323
Maize 352.530 348.627 0.756 1084.795 1080.893 0.332 1093.394 1089.491 0.112 1085.083 1081.180 0.532
Rice 785.164 781.262 0.303 1251.779 1247.876 0.121 1282.416 1278.513 0.012 1242.221 1238.318 0.231
Sugar Cane 771.239 767.336 0.944 1613.395 1609.492 0.007 1639.809 1635.906 0.000 1595.751 1591.848 0.321
Wheat 418.665 414.762 0.835 1153.186 1149.283 0.821 1160.712 1156.810 0.712 1162.411 1158.508 0.402
Somalia—Banana 306.400 302.498 0.889 1412.801 1408.899 0.000 1430.688 1426.786 0.000 1421.742 1417.839 0.000
Maize 577.876 573.973 0.811 1097.639 1093.737 0.943 1109.941 1106.038 0.423 1102.289 1098.386 0.434
Sorghum 299.456 295.554 0.954 978.390 974.487 0.131 976.162 972.259 0.203 1006.621 1002.718 0.015
Sugar Cane 1561.743 1557.841 0.252 1621.627 1617.724 0.009 1608.998 1605.095 0.019 1636.254 1632.351 0.003
Tanzania—Maize 602.620 598.718 0.632 1146.010 1142.107 0.100 1157.454 1153.551 0.005 1155.097 1151.194 0.103
Millet 357.070 353.168 0.922 1061.506 1057.604 0.966 1066.259 1062.356 0.533 1080.704 1076.801 0.121
Rice 645.884 641.981 0.532 1157.416 1153.513 0.802 1169.649 1165.746 0.242 1157.795 1153.893 0.499
Cotton Seed 480.611 476.708 0.332 999.445 995.543 0.854 1016.260 1012.358 0.166 999.811 995.908 0.664
Sorghum 264.040 260.138 0.834 1088.711 1084.809 0.211 1100.409 1096.506 0.023 1089.489 1085.586 0.512
Sweet potato 823.410 819.507 0.200 1303.455 1299.553 0.661 1307.969 1304.066 0.183 1305.698 1301.795 0.402
Uganda—Banana 269.292 265.389 0.909 1187.916 1184.013 0.005 1224.341 1220.438 0.000 1158.594 1154.691 0.443
Cassava 688.846 684.944 0.518 1361.801 1357.898 0.535 1361.721 1357.818 0.515 1369.707 1365.804 0.323
Coffee 614.601 610.698 0.833 1007.740 1003.837 0.907 1013.23 1009.328 0.516 1017.461 1013.558 0.420
Millet 700.174 696.271 0.003 1094.174 1090.271 0.074 1113.196 1109.294 0.008 1079.048 1075.145 0.187
Plantain 884.580 880.678 0.667 1245.936 1242.033 0.138 1242.476 1238.573 0.384 1261.018 1257.115 0.004
Sweet potato 618.508 614.605 0.654 1189.226 1185.324 0.087 1209.564 1205.662 0.012 1196.161 1192.258 0.065

Table 4. Continuation of Table 3.

Country—Crop Power law Lognormal Fréchet Stretched exponential
BIC AICc p–value BIC AICc p–value BIC AICc p–value BIC AICc p–value
Rwanda—Beans 171.782 167.879 0.935 978.580 974.677 0.524 989.914 986.011 0.165 978.688 974.785 0.642
Cassava 152.243 148.340 0.909 1410.743 1406.840 0.019 1433.857 1429.954 0.004 1392.647 1388.744 0.176
Coffee 578.925 575.022 0.903 1017.965 1014.062 0.332 1040.145 1036.243 0.054 1022.827 1018.925 0.105
Potato 945.602 941.699 0.810 1332.154 1328.251 0.051 1363.027 1359.124 0.001 1324.835 1320.932 0.142
Sorghum 617.552 613.650 0.732 1038.192 1034.289 0.543 1054.371 1050.468 0.223 1043.405 1039.502 0.091
Sweet potato 278.219 274.317 0.983 1274.881 1270.979 0.675 1292.205 1288.303 0.263 1274.463 1270.560 0.243

Based on the AICc and BIC values in Tables 3 and 4, we can see that none of the three distributions that model the entire data (i.e. the lognormal, Fréchet and the stretched exponential distributions) consistently provide the best fit. None of them consistently gave the smallest AICc or smallest BIC values across the countries. The bootstrapped KS p–values in Table 5 indicate that the power law distribution is a plausible model for all the crop yield data. In general, the distribution with the smallest AICc and smallest BIC values corresponds to the distribution with the largest bootstrapped KS p–values. Fitting of such distributions to the tail of the data can be compared with that of the power law distribution by using Vuong’s test. The results of this comparison are presented in Table 6. We can observe that the stretched exponential distribution emerges as the best model for millet in Uganda and the power law distribution emerges as the best model for the rest of the crops except for a few cases where the winner is undecided. For instance, for sorghum in Burundi (power law and lognormal), sweet potato in Burundi (power law and Fréchet) and for banana in Somalia (power law and lognormal). The log-log plots of the fitted distributions superimposed with the empirical distributions are displayed in Figs 11 to 16. We can see that the power law distribution fits all the crop yield data well across the countries.

Table 5. Bootstrap KS-test p–values (as defined in Section 3) for the fitted distributions.

Country Crop Power law Lognormal Fréchet Stretched exponential
Burundi Banana 0.966 0.732 0.925 0.094
Beans 0.902 0.424 0.112 0.732
Cassava 0.832 0 0 0.021
Coffee 0.943 0.831 0.322 0.941
Sorghum 0.952 0.305 0.101 0.324
Sweet potato 0.903 0 0.221 0
Kenya Beans 0.913 0.902 0.591 0.944
Coffee 0.928 0.821 0 0.922
Maize 0.929 0.931 0.820 0.954
Rice 0.805 0.821 0.461 0.844
Sugar Cane 0.945 0.611 0.132 0.940
Wheat 0.905 0.962 0.901 0.922
Somalia Banana 0.923 0.016 0.012 0
Maize 0.933 0.942 0.912 0
Sorghum 0.955 0.813 0.828 0
Sugar Cane 0.903 0.523 0.631 0
Tanzania Maize 0.956 0.787 0.511 0.723
Millet 0.931 0.962 0.936 0.902
Rice 0.902 0.951 0.921 0.952
Cotton Seed 0.933 0.971 0.822 0.949
Sorghum 0.939 0.827 0.609 0.906
Sweet potato 0.804 0.965 0.919 0.915
Uganda Banana 0.965 0.514 0.055 0.931
Cassava 0.901 0.961 0.919 0.924
Coffee 0.955 0.980 0.911 0.923
Millet 0.201 0.612 0.423 0.919
Plantain 0.932 0.831 0.925 0.506
Sweet potato 0.909 0.810 0.424 0.734
Rwanda Beans 0.958 0.951 0.901 0.911
Cassava 0.987 0.612 0.301 0.812
Coffee 0.943 0.902 0.613 0.832
Potato 0.924 0.732 0.321 0.815
Sorghum 0.932 0.933 0.841 0.822
Sweet potato 0.949 0.910 0.906 0.931

Table 6. Vuong test statistic (Λ) and its p–value for comparing the upper tail (i.e. x > xmin) of the fitted power law distribution and the best among the rest of the competing distributions.

Country Crop Contest Statistic (Λ) p–value Winner
Burundi Banana Power law vs Fréchet 2.6671 0.0077 Power law
Beans Power law vs Stretched exponential 6.7366 0 Power law
Cassava Power law vs Stretched exponential 10.4399 0 Power law
Coffee Power law vs Stretched exponential 4.1021 0 Power law
Sorghum Power law vs Lognormal 1.2454 0.2130 Undecided
Sweet potato Power law vs Fréchet 0.7442 0.4568 Undecided
Kenya Beans Power law vs Stretched exponential 3.9727 1.0×10−4 Power law
Coffee Power law vs Stretched exponential 3.5192 4.0×10−4 Power law
Maize Power law vs Stretched exponential 2.8513 0.0044 Power law
Rice Power law vs Stretched exponential 4.4303 0 Power law
Sugar Cane Power law vs Stretched exponential 4.3048 0 Power law
Wheat Power law vs Lognormal 2.5905 0.0096 Power law
Somalia Banana Power law vs Lognormal 1.7339 0.0829 Undecided
Maize Power law vs Lognormal 2.6642 0.0077 Power law
Sorghum Power law vs Fréchet 1.9648 0.0494 Power law
Sugar Cane Power law vs Fréchet 4.5875 0 Power law
Tanzania Maize Power law vs Lognormal 5.4006 0 Power law
Millet Power law vs Lognormal 3.8995 1.0×10−4 Power law
Rice Power law vs Stretched exponential 2.8193 0.0048 Power law
Cotton Seed Power law vs Lognormal 2.8774 0.0040 Power law
Sorghum Power law vs Stretched exponential 2.8893 0.0039 Power law
Sweet potato Power law vs Lognormal 2.3193 0.0204 Power law
Uganda Banana Power law vs stretched exponential 2.9194 0.0035 Power law
Cassava Power law vs Lognormal 2.4766 0.0133 Power law
Coffee Power law vs Lognormal 2.7274 0.0064 Power law
Millet Power law vs Stretched exponential −2.8155 0.0049 Stretched exponential
Plantain Power law vs Fréchet 3.1373 0.0017 Power law
Sweet potato Power law vs Lognormal 3.9022 1.0×10−4 Power law
Rwanda Beans Power law vs Stretched exponential 3.5970 3.0×10−4 Power law
Cassava Power law vs Stretched exponential 2.9573 0.0031 Power law
Coffee Power law vs Lognormal 4.3420 0 Power law
Potato Power law vs Stretched exponential 5.4570 0 Power law
Sorghum Power law vs Lognormal 4.1122 0 Power law
Sweet potato Power law vs Lognormal 2.6648 0.0077 Power law

Fig 11. Log-log plots for crops yield in Burundi where the red line corresponds to the value of xmin in Table 7.

Fig 11

Fig 16. Log-log plots for crops yield in Rwanda where the red line corresponds to the value of xmin in Table 7.

Fig 16

Fig 12. Log-log plots for crops yield in Kenya where the red line corresponds to the value of xmin in Table 7.

Fig 12

Fig 13. Log-log plots for crops yield in Somalia where the red line corresponds to the value of xmin in Table 7.

Fig 13

Fig 14. Log-log plots for crops yield in Tanzania where the red line corresponds to the value of xmin in Table 7.

Fig 14

Fig 15. Log-log plots for crops yield in Uganda where the red line corresponds to the value of xmin in Table 7.

Fig 15

Since the power law model appears to be a plausible distribution for virtually all the crops across countries, we present the estimate for the parameters of the distribution in Table 7. We see that the power law mechanism may occur at varying degrees depending on the type of crop and country. See the ntail values for crops in Table 7, where ntail denotes the total number of observations equal to or above the threshold value xmin, i.e. the total number of data points following the power law distribution. The occurrence of such extremely high crop yield definitely has positive impact on farmers and food security. In this case, farmers can make huge profits. Crop yield risk insurance policies for such crops can attract relatively lower premium rates compared to crops with lower yields. The α value of the fitted power law model describes the heaviness of the tail distribution corresponding to extremely high crop yield events with yield > xmin. According to Table 7, the estimates of α are all > 2 indicating that the data in the right tail of the distribution show significant high inequality (i.e. large crop yield). However, there are two special cases satisfying 2 < α ≤ 3 specifically in Somalia for sugar cane and Tanzania for sweet potato. In these cases, the variance and higher-order moments for the crop yields are infinite regardless of whether their mean yield exists or not. Hence, the classical central limit theorem does not hold for these yield data. The consequence of the infinite variance and higher order moments is that empirical estimates of the means converge very slowly due to the regular occurrence of extremely large crop yield values. These characteristics suggest that crop harvest with extremely large yield could sometimes occur for sugar cane in Somalia and sweet potato in Tanzania. Such events could often be of great importance to the farmers and other investors in agribusiness. This behavior is referred to as the black swan mechanism (see [67]). The black swan mechanism describes events coming as a surprise. It has a major effect (positive or negative) and is often inappropriately rationalized. Farmers can have the tendency to break even and even enjoy lower crop yield risk insurance policies in Somalia and Tanzania if they invest in sugar cane and sweet potato, respectively, due to their potential for extremely high yield.

Table 7. Parameter estimates for the power law distribution for all the crop yield data sets (xmin and α are parameters of the power distribution; αse is the standard error corresponding to α; ntail is the number of data exceeding xmin).

Country Crop n n tail x min α α se
Burundi Banana 58 35 55455 7.4026 1.0822
Beans 58 35 9888 19.9503 3.2032
Cassava 58 35 89591 52.1287 8.7685
Coffee 58 35 9180 15.9219 3.3366
Sorghum 58 5 12964 33.5816 14.5710
Sweet potato 58 6 89127 8.1898 2.9352
Kenya Beans 58 32 5429 9.4680 1.4969
Coffee 58 20 6669 11.9168 2.4411
Maize 58 21 16922 14.5952 2.9667
Rice 58 39 37568 6.7330 0.9180
Sugar Cane 58 30 806694 8.5033 1.3699
Wheat 58 22 19359 6.5818 1.1901
Somalia Banana 58 13 222222 8.2251 2.0039
Maize 58 32 9544 5.4249 0.7822
Sorghum 58 18 4143 5.3367 1.0222
Sugar Cane 58 57 300000 2.7445 0.2311
Tanzania Maize 58 33 12427 6.0445 0.8781
Millet 58 21 9530 8.3193 1.5972
Rice 58 34 15583 5.4751 0.7675
Cotton Seed 58 30 5132 7.2276 1.1370
Sorghum 58 15 10133 7.3606 1.6423
Sweet potato 58 37 25323 2.9081 0.3137
Uganda Banana 58 16 44748 36.7026 8.9257
Cassava 58 30 66988 4.0035 0.5484
Coffee 58 37 5873 6.3153 0.8738
Millet 58 39 12496 6.6790 0.9094
Plantain 58 43 53474 7.3752 0.9722
Sweet potato 58 33 40960 12.6778 2.0328
Rwanda Beans 58 12 8914 28.4564 7.9260
Cassava 58 7 120265 13.0159 4.5416
Coffee 58 36 5817 7.6296 1.1049
Potato 58 44 64088 6.0225 0.7572
Sorghum 58 37 10688 9.6451 1.4212
Sweet potato 58 14 75333 15.2064 3.7968

All the estimated α values for the power law distribution in Table 7 are > 3 except for sugar cane in Somalia and sweet potato in Tanzania. This indicates that the sample means for these crops are Gaussian distributed and that their variances are finite. Hence, the standard central limit theorem applies for these crop yield data. The finite mean and variance and the observed evidence of underdispersion in Table 1 suggest that east African regional food security does not seem to be extremely volatile as regular crop yields for these crops tend to cluster around the mean crop yield.

Ignoring the impacts of climate and environment, soil structures and compositions/nutrients, crop species, mechanization and technology, etc on crop yields, the observed black swan behaviour for the yields of sugar cane in Somalia and sweet potato in Tanzania could be explained by the so called “rich getting richer” principle or the “preferential attachment” principle. Based on these principles, these two crops have potentials for extremely high yield perhaps because of either high demand (so every farmer tends to make them their choice crops for cultivation) or common practice such as irrigation adopted by all the farmers being capable of increasing crop yield [36]. So, speaking of crop harvest, yield could follow the pattern of the rich getting richer or the preferential attachment principle. The extremely high yields for sugar cane in Somalia and sweet potato in Tanzania are not just a little bit higher than the normal yield for the same or different crops in the same or other countries. Instead they are so much higher that they cause their distributions to skew significantly.

5 Conclusions

We have analyzed the trend and tail of some yearly crop yield data such as banana, plantain, beans, cassava, coffee, sorghum, potato, sweet potato, maize, rice, sugar cane, wheat, millet and cotton seed from 1961 to 2018 in six east African countries: Burundi, Kenya, Somalia, Tanzania, Uganda and Rwanda. An exploratory analysis of the crop yield data reveals three structural patterns in each of the series. They are: increasing, decreasing and stagnant trends. Ten years (2019–2028) time series point forecast based on the fitted ARIMA models shows that majority of the crops will experience stagnant yield in different countries with only sorghum and coffee showing the tendency for significant and persistent upward trend in Burundi and Rwanda, respectively, while beans indicates significant and persistent yield decrease in Burundi, Kenya and Rwanda.

We used the power law, lognormal, Fréchet and stretched exponential distributions to describe high yields in all the crops across the countries. Based on Vuong’s test, we observed that the stretched exponential distribution gave the best fit for millet in Uganda while the power law distribution gave the best fit for the other crops except for a few undecided cases. The log-log plots were used to visually inspect the performance of the fitted distributions. The power law distribution appeared to fit the upper tail of all the crop yield data better than the other distributions in all the countries. Based on the estimated α value of the fitted power law model, we found potential for extremely high yield in sugar cane in Somalia and sweet potato in Tanzania indicating the inappropriateness of the Gaussian distribution for describing these crop yields. Other crops in Burundi, Kenya, Somalia, Tanzania, Uganda and Rwanda can produce only high but not extremely high yields. Though the time series point forecasts for majority of the crops show yield stagnancy with a few exceptions, the evidence from the power law analysis indicates the potential for high yield for all the crops and provides specific calibrations for the yield of all the crops in terms of what quantity of yield is considered high.

We characterize the evidence for extremely high yield for sugar cane and sweet potato in Somalia and Tanzania, respectively, as black swan where the “rich getting richer” or the “preferential attachment” could be the underlying generating process, meaning that either the two crops are increasingly at lower risk of climate change and environmental challenges such as being drought resistant or farmers are constantly doing many things right (such as adopting favorable planting strategies, large crop areas, etc) as far as the cultivation of the two crops are concerned in the two countries.

ARIMA(0,1,1) was used to model and predict coffee in Burundi and Kenya; beans in Burundi; wheat in Kenya; rice, cotton seed, and sweet potato in Tanzania; millet in Uganda. ARIMA(2,1,0) was used to model and predict sorghum in Tanzania. ARIMA(2,1,2) was used to model and predict banana in Burundi. ARIMA(0,1,2) was used to model and predict banana in Uganda and potato in Rwanda. ARIMA(0,1,0) was used to model and predict cassava in Burundi and Uganda; sugarcane in Kenya and Somalia; rice in Kenya; maize in Somalia; plantain and sweet potato in Uganda. ARIMA(1,0,0) was used to model and predict Sorghum in Burundi and Rwanda; beans in Kenya and Rwanda; millet in Tanzania; coffee in Uganda. ARIMA(2,1,1) was used to model and predict sweet potato in Burundi. ARIMA(0,1,3) was used to model and predict maize in Kenya. ARIMA(0,0,4) was used to model and predict banana in Somalia. ARIMA(1,1,1) was used to model and predict sorghum in Somalia; maize in Tanzania. ARIMA(2,0,2) was used to model and predict sweet potato in Rwanda.

The yield forecast in Burundi shows an initial quick decline in 2019 followed by an increase for banana; a sharp increase in 2019 followed by an increase for sweet potato; sorghum shows a quick increase from 2019 to 2028; beans shows a sharp decrease from 2019 to 2028; neither cassava nor coffee show any tendency to increase or decrease from 2019 to 2028. The forecast of the crop yield in Kenya indicates continuous decline of beans yield from 2019 to 2028; no decrease or increase pattern in yield is evident for coffee, rice, wheat and sugar cane from 2019 to 2028; maize shows a sharp decline in 2019 with an immediate increase followed by a stable trend. In Somalia, the yield forecast for maize and sugar cane does not indicate any pattern; banana shows an initial moderate increase in 2019 followed by the lack of pattern until 2028; sorghum experienced a sharp drop in 2019 followed by a period of no trend up to 2028. The yield forecast in Tanzania indicates no significant trend for maize, rice, sweet potato and cotton seed for the whole forecast period; millet is slightly decreased in 2019 and remained stagnant until 2028. The yield forecast in Uganda indicates that banana, cassava, millet, plantain and sweet potato did not show any significant pattern from 2019 to 2028; coffee shows a slight increase in 2019 followed by a period of no change in yield. The yield forecast in Rwanda indicates that beans persistently decreased from 2019 to 2028; sweet potato shows initial increase followed by a slow decrease; coffee indicated an upward trend from 2019 to 2028; cassava, potato and sorghum did not show any significant pattern.

In our discussion in Section 4, we saw how the literature points in the direction of climate change as the major cause of the observed yield stagnancy and decline. On this backdrop, we suggest that a promising future in favour of high crop yield could await east Africa if urgent changes or improvements on the cropping systems and infrastructures that currently exist in east Africa could be made in order to meet up with the inevitable future demand of agricultural produce due to the increasing population and the challenge of negative impacts of climate change. Science and technology could be useful in showing how agricultural production can be significantly improved in east Africa. For instance, the construction of irrigation systems and rainwater harvesting structures could help cushion the impact of climate change.

Further, various climate adaptation/smart strategies could be adapted to increase yields in east Africa. According to [68], short-duration pigeon pea varieties developed by the International Crops Research Institute for Semi-Arid Tropics and the Kenya Agricultural Research Institute can give high yields and escape drought, but require non-traditional management practices (for example, sole-cropping, spraying against insect pests). According to [69], NERICA, a new rice for Africa, has shown high potential to revolutionize rice farming, producing high yield with minimum inputs in stress-afflicted ecologies. [70] observed that cassava mosaic disease (CMD) resistant cassava varieties released in western Kenya and Uganda yielded up to three times more than local varieties. [71] demonstrated that high yields of maize were recorded from certain varieties (Pwani Hybrid 4-PH4, Coast Composite Maize-CCM and the local check-Mdzihana) but they usually required relatively high rainfall amounts in order for them to produce better yields. [72] showed that increased knowledge of varieties, environment and management factors can double total yield of maize, sorghum, millet and groundnut from 1.67 to 3.29 tons per hectare from the average 5.1 hectares that farmers usually crop in south east Zimbabwe. [73] showed that improved maize varieties outyielded the traditional control variety by 26–46% across sites and season in central Mozambique. [74] showed that the use of organic soil management practices such as reduced tillage, mulching and leguminous crops in the northern part of Tanzania increased the production of food crops from an average of 0.5 ton per hectare to 1.5 ton per hectare; subsequently, maize yields increased from 12,000 kilogram to 20,000 kilogram per 4.8 hectares. [75] suggested that relaxing liquidity constraints could help to encourage farmers’ adaptation through the implementation of soil, water and land management strategies; thereby, positioning east Africa for food sufficiency in the face of the current global food crisis. [76] noted that intensive manuring with a combination of green and poultry manure produced high yields of maize in central Uganda that were comparable to those with mineral fertilizers. [77] demonstrated that households in Kenya adapting to climate change and climate variability through uptake of technologies such as early planting, use of improved crop varieties, and crop diversification produced 4877 kilograms of maize yield equivalent / hectare per year against 3238 kilograms of maize yield equivalent / hectare per year for households that did not adapt (a 33.6% difference between the two groups). [78] found that fertilizer application in the intercropping system is eastern and southern Africa improved cereal yields by 71–282% and pigeon pea yields by 32–449%, increased benefit-cost ratios by 10–40%, and reduced variability in cereal yields by 40–56% and pigeon pea yields by 5–52% compared with unfertilized intercrops. [79] showed that drought resistant climate-smart maize hybrids in Kenya increased yields 33 to 54% relative to conventional hybrids. According to [80], climate adaptation strategies in the central highlands of Kenya included the use of fertilizer and manure in combination (71%), terracing (66%), and crop rotation (60%). [81] showed that climate-smart adaptation practices significantly enhanced wheat yield by 34.35% in southern Ethiopia. [82] showed that use of mulching and permanent planting basin dimensions on maize in western Uganda relatively increased yield by 11–66% and water use efficiency by 33–94% compared to conventional practices.

The findings in this paper underscore the importance of using climate-smart agricultural alternatives to improve resilience farming system and the livelihood of subsistence farmers due to the impact of climate change in east Africa. Currently, crop yield for majority of the crops in different countries has been confirmed to neither increase nor decrease with only few crops experiencing all time increase or decrease in yield. Urgent attention should be paid to beans production in the affected countries in order to reverse the persistent downward trend of its yield. This paper brings good news of hope for crop yield increase in east Africa if adaptive farming methods and strategies are adequately harnessed in the region in the face of climate and environmental challenges and rising global demand for agricultural produce.

The data from 1961 to 2018 consist of only 58 observations. Hence, the results and forecasts in this paper should be treated conservatively. A future work is to see if more frequent and more up-to-date data are available. Another is to consider multivariate modelling of yield by considering country and crop. The disadvantage of the length of the observed series can be interpolated by explaining the common factor for each country and crop.

Supporting information

S1 Data

(CSV)

S2 Data

(CSV)

S3 Data

(CSV)

S4 Data

(CSV)

S5 Data

(CSV)

S6 Data

(CSV)

S7 Data

(CSV)

S8 Data

(CSV)

S1 File

(TXT)

Acknowledgments

The authors would like to thank the Editor and the two referees for careful reading and comments which greatly improved the paper.

Data Availability

All relevant data are within the paper and its Supporting information files.

Funding Statement

The authors received no specific funding for this work.

References

  • 1. Buerkert A, Hiernaux P. Nutrients in the west African Sudano-Sahelian zone: Losses, transfers and role of external inputs. Zeitschrift fuer Pflanzenernaehrung und Bodenkunde. 1988; 161: 365–383. doi: 10.1002/jpln.1998.3581610405 [DOI] [Google Scholar]
  • 2. Nair KPP, Ngachie V, Nzetchoung F. The importance of liming acid soils of the Sahel regions of Africa to sustain high crop productivity—A case-study from the republic of Cameroon. Experimental Agriculture. 1992; 28: 473–476. doi: 10.1017/S0014479700020184 [DOI] [Google Scholar]
  • 3. Daniel IO, Adetumbi JA. Maize seed supply systems and implications for seed sector development in southwestern Nigeria. Journal of Sustainable Agriculture. 2006; 28: 25–40. doi: 10.1300/J064v28n02_04 [DOI] [Google Scholar]
  • 4. Mofoke ALE, Adewumi JK, Babatunde FE, Mudiare OJ, Ramalan AA. Yield of tomato grown under continuous-flow drip irrigation in Bauchi state of Nigeria. Agricultural Water Management. 2006; 84: 166–172. doi: 10.1016/j.agwat.2006.02.001 [DOI] [Google Scholar]
  • 5. Mtambanengwe F, Mapfumo P. Effects of organic resource quality on soil profile N dynamics and maize yields on sandy soils in Zimbabwe. Plant and Soil. 2006; 281: 173–191. doi: 10.1007/s11104-005-4182-3 [DOI] [Google Scholar]
  • 6. Mahasi MJ, Kamundia JW. Cluster analysis in rapeseed (Brassica napus L.). African Journal of Agricultural Research. 2007; 9: 409–411. [Google Scholar]
  • 7. Howell CL, Myburgh PA, Conradie WJ. Comparison of three different fertigation strategies for drip irrigated table grapes—Part III: Growth, yield and quality. South African Journal of Enology and Viticulture. 2013; 34: 21–29. [Google Scholar]
  • 8. Shrestha RP, Ligonja PJ. Social perception of soil conservation benefits in Kondoa eroded area of Tanzania. International Soil and Water Conservation Research. 2015; 3: 183–195. doi: 10.1016/j.iswcr.2015.08.001 [DOI] [Google Scholar]
  • 9. Ondieki DK, Nyaboga EN, Wagacha JM, Mwaura FB. Morphological and genetic diversity of Rhizobia Nodulating Cowpea (Vigna unguiculata L.) from agricultural soils of lower eastern Kenya. International Journal of Microbiology. 2017; article number 8684921. doi: 10.1155/2017/8684921 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Wassie A, Pauline N. Evaluating small holder farmers’ preferences for climate smart agricultural practices in Tehuledere district, northeastern Ethiopia. Singapore Journal of Tropical Geography. 2018; 39: 300–316. doi: 10.1111/sjtg.12240 [DOI] [Google Scholar]
  • 11. Rakotoarisoa NM, Tsujimoto Y, Oo AZ. Dipping rice seedlings in p-enriched slurry increases grain yield and shortens days to heading on p-deficient lowlands in the central highlands of Madagascar. Field Crops Research. 2020; 254: article number 107806. doi: 10.1016/j.fcr.2020.107806 [DOI] [Google Scholar]
  • 12. Kiboi MN, Ngetich FK, Mucheru-Muna MW, Diels J, Mugendi DN. Soil nutrients and crop yield response to conservation-effective management practices in the sub-humid highlands agro-ecologies of Kenya. Heliyon. 2021; 7: article number 07156. doi: 10.1016/j.heliyon.2021.e07156 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Ray DK, Ramankutty N, Mueller ND, West PC, Foley JA. Recent patterns of crop yield growth and stagnation. Nature Communications. 2012; 3: 1–7. doi: 10.1038/ncomms2296 [DOI] [PubMed] [Google Scholar]
  • 14. Box GE, Jenkins GM. Time series analysis, forecasting and control. San Francisco: Holden-Day; 1976. [Google Scholar]
  • 15. Akouegnonhou O, Demirbas N. Forecasting of rice self-sufficiency in the Benin Republic using ARIMA model. Selcuk Journal of Agriculture and Food Sciences. 2019; 33: 204–214. [Google Scholar]
  • 16. Mgaya JF. Application of ARIMA models in forecasting livestock products consumption in Tanzania. Cogent Food and Agriculture. 2019; 5: article number 1607430. doi: 10.1080/23311932.2019.1607430 [DOI] [Google Scholar]
  • 17. Ajetomobi JO, Olaleye AO. Auto-regressive integrated moving average (ARIMA) modelling of cocoa production in Nigeria: 1900-2025. Journal of Crop Improvement. 2019; 33: 445–455. doi: 10.1080/15427528.2019.1610534 [DOI] [Google Scholar]
  • 18. Sharma PK, Dwivedi S, Ali L, Arora RK. Forecasting maize production in India using ARIMA model. Agro Economist-An International Journal. 2018; 5: 1–6. [Google Scholar]
  • 19.Sikalubya M, Xu S, Yu W, Moonga P. Study on forecasting soybean production: An application of ARIMA model. In: Proceedings of the 2019 International Conference on Intelligent Computing, Automation and Systems (ICICAS). 2019. pp. 447-452.
  • 20. Biswas R. Study on ARIMA modelling to forecast area and production of Kharif rice in West Bengal. Cutting-edge Research in Agricultural Sciences. 2021; 12: 142–151. doi: 10.9734/bpi/cras/v12/10700D [DOI] [Google Scholar]
  • 21. Shoko RR, Belete A. Efficient planning of sorghum production in South Africa–Application of the Box-Jenkin’s method. Journal of Agribusiness and Rural Development. 2017; 4: 835–841. doi: 10.17306/J.JARD.2017.00352 [DOI] [Google Scholar]
  • 22. Mehmood Q, Sial M, Riaz M, Shaheen N. Forecasting the production of sugarcane crop of Pakistan for the year 2018–2030, using Box-Jenkin’s methodology. Journal of Animal and Plant Sciences. 2019; 5: 1396–1401. [Google Scholar]
  • 23.Chen SL, Miranda MJ. Modelling multivariate crop yield densities with frequent extreme events (No. 377-2016-20672). In: Proceedings of the Agricultural and Applied Economics Association Conference. 2004.
  • 24. Warusawitharana M. Time-varying volatility and the power law distribution of stock returns. Journal of Empirical Finance. 2018; 49: 123–141. doi: 10.1016/j.jempfin.2018.09.004 [DOI] [Google Scholar]
  • 25. Masseran N, Yee LH, Safari MAM, Ibrahim K. Power law behavior and tail modelling on low income distribution. Mathematics and Statistics. 2019; 7: 70–77. doi: 10.13189/ms.2019.070303 [DOI] [Google Scholar]
  • 26. Safari MAM, Masseran N, Ibrahim K, Al-Dhurafi NA. The power-law distribution for the income of poor households. Physica A: Statistical Mechanics and Its Applications. 2020; 557: article number 124893. doi: 10.1016/j.physa.2020.124893 [DOI] [Google Scholar]
  • 27. Asif M, Hussain Z, Asghar Z, Hussain MI, Raftab M, Shah SF, et al. A statistical evidence of power law distribution in the upper tail of world billionaires’ data 2010–20. Physica A: Statistical Mechanics and Its Applications. 2021; 581: article number 126198. doi: 10.1016/j.physa.2021.126198 [DOI] [Google Scholar]
  • 28. Simsek E, Kim M. Power-law tail in lag time distribution underlies bacterial persistence. Proceedings of the National Academy of Sciences. 2019; 116: 17635–17640. doi: 10.1073/pnas.1903836116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Masseran N. Power-law behaviors of the duration size of unhealthy air pollution events. Stochastic Environmental Research and Risk Assessment. 2021; 35: 1499–1508. doi: 10.1007/s00477-021-01978-2 [DOI] [Google Scholar]
  • 30. Chen X, Pan Y, Luo B. Research on power-law distribution of long-tail data and its application to tourism recommendation. Industrial Management and Data Systems. 2021; 121: 1268–1286. doi: 10.1108/IMDS-10-2019-0584 [DOI] [Google Scholar]
  • 31. Balthrop A, Quan S. The power-law distribution of cumulative coal production. Physica A: Statistical Mechanics and Its Applications. 2019; 530: article number 121573. doi: 10.1016/j.physa.2019.121573 [DOI] [Google Scholar]
  • 32.Akhundjanov SB, Chamberlain L. The power-law distribution of agricultural land size. In: Proceedings of the 2019 Annual Meeting, Agricultural and Applied Economics Association. 2019.
  • 33. Campolieti M. The distribution of union size: Canada, 1913–2014. Physica A: Statistical Mechanics and Its Applications. 2020; 558: article number 125007. doi: 10.1016/j.physa.2020.125007 [DOI] [Google Scholar]
  • 34. Campolieti M, Ramos A. The distribution of strike size: Empirical evidence from Europe and North America in the 19th and 20th centuries. Physica A: Statistical Mechanics and Its Applications. 2021; 563: article number 125424. doi: 10.1016/j.physa.2020.125424 [DOI] [Google Scholar]
  • 35. Pena G, Puente-Ajovín M, Ramos A, Sanz-Gracia F. Log-growth rates of CO2: An empirical analysis. Physica A: Statistical Mechanics and Its Applications. 2022; 588: article number 126550. doi: 10.1016/j.physa.2021.126550 [DOI] [Google Scholar]
  • 36. Hennessy DA. Crop yield skewness under law of the minimum technology. American Journal of Agricultural Economics. 2009; 91: 197–208. doi: 10.1111/j.1467-8276.2008.01181.x [DOI] [Google Scholar]
  • 37. Koning AJ, Peng L. Goodness-of-fit tests for a heavy tailed distribution. Journal of Statistical Planning and Inference. 2008; 138: 3960–3981. doi: 10.1016/j.jspi.2008.02.013 [DOI] [Google Scholar]
  • 38. Zhang H, Wu Z. The general goodness-of-fit tests for correlated data. Computational Statistics and Data Analysis. 2022; 167: article number 107379. doi: 10.1016/j.csda.2021.107379 [DOI] [Google Scholar]
  • 39. Pankratz A. Forecasting with univariate Box-Jenkins models: Concepts and cases. New York: John Wiley and Sons; 2009. [Google Scholar]
  • 40. Box GE, Jenkins GM, Reinsel GC, Ljung GM. Time series analysis: Forecasting and control. New York: John Wiley and Sons; 2015. [Google Scholar]
  • 41. Box GE, Cox DR. An analysis of transformations. Journal of the Royal Statistical Society, B. 1964; 26: 211–243. [Google Scholar]
  • 42. Kwiatkowski D, Phillips PCB, Schmidt P, Shin Y. Testing the null hypothesis of stationarity against the alternative of a unit root. Journal of Econometrics. 1992; 54: 159–178. doi: 10.1016/0304-4076(92)90104-Y [DOI] [Google Scholar]
  • 43. Cox DR, Hinkley DV. (1979). Theoretical statistics. London: Chapman and Hall; 1979. [Google Scholar]
  • 44. Pareto V. La legge della domanda. Giornale Degli Economisti. 1895; 10: 59–68. [Google Scholar]
  • 45. Fréchet M. Sur la loi de probabilite de l’ecart maximum. Annales de la Societe Polonaise de Mathematique. 1927; 6: 93. [Google Scholar]
  • 46.R Core Team R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing; 2022. https://www.R-project.org/.
  • 47. Clauset A, Shalizi CR, Newman ME. Power law distributions in empirical data. SIAM Review. 2009; 51: 661–703. doi: 10.1137/070710111 [DOI] [Google Scholar]
  • 48. Gabaix X, Ibragimov R. Rank—1 / 2: A simple way to improve the OLS estimation of tail exponents. Journal of Business and Economic Statistics. 2011; 29: 24–39. doi: 10.1198/jbes.2009.06157 [DOI] [Google Scholar]
  • 49. Hill BM. A simple general approach to inference about the tail of a distribution. Annals of Statistics. 1975; 3: 1163–1174. doi: 10.1214/aos/1176343247 [DOI] [Google Scholar]
  • 50. Theil H. A rank-invariant method of linear and polynomial regression analysis, I. Proceedings of the Koninklijke Nederlandse Akademie van Wetenschappen A. 1950; 53: 386–392. [Google Scholar]
  • 51. Schwarz GE. Estimating the dimension of a model. Annals of Statistics. 1978; 6: 461–464. doi: 10.1214/aos/1176344136 [DOI] [Google Scholar]
  • 52. Hurvich CM, Tsai CL. Regression and time series model selection in small samples. Biometrika. 1989; 76: 297–307. doi: 10.1093/biomet/76.2.297 [DOI] [Google Scholar]
  • 53. Vuong QH. Likelihood ratio tests for model selection and non-nested hypotheses. Econometrica: Journal of the Econometric Society. 1989; 57: 307–333. doi: 10.2307/1912557 [DOI] [Google Scholar]
  • 54. Rowhani P, Lobell DB, Linderman M, Ramankutty N. Climate variability and crop production in Tanzania. Agricultural and Forest Meteorology. 2011; 151: 449–460. doi: 10.1016/j.agrformet.2010.12.002 [DOI] [Google Scholar]
  • 55. Knox J, Hess T, Daccache A, Wheeler T. Climate change impacts on crop productivity in Africa and south Asia. Environmental Research Letters. 2012; 7: article number 034032. doi: 10.1088/1748-9326/7/3/034032 [DOI] [Google Scholar]
  • 56. Craparo ACW, Van Asten PJA, Laederach P, Jassogne LTP, Grab SW. Coffea arabica yields decline in Tanzania due to climate change: Global implications. Agricultural and Forest Meteorology. 2015; 207: 1–10. doi: 10.1016/j.agrformet.2015.03.005 [DOI] [Google Scholar]
  • 57. Munishi LK, Lema AA, Ndakidemi PA. Decline in maize and beans production in the face of climate change at Hai district in Kilimanjaro region, Tanzania. International Journal of Climate Change Strategies and Management. 2015; 7: 17–26. doi: 10.1108/IJCCSM-07-2013-0094 [DOI] [Google Scholar]
  • 58. Kahsay GA, Hansen LG. The effect of climate change and adaptation policy on agricultural production in eastern Africa. Ecological Economics. 2016; 121: 54–64. doi: 10.1016/j.ecolecon.2015.11.016 [DOI] [Google Scholar]
  • 59. Wolka K, Mulder J, Biazin B. Effects of soil and water conservation techniques on crop yield, runoff and soil loss in sub-Saharan Africa: A review. Agricultural Water Management. 2018; 207: 67–79. doi: 10.1016/j.agwat.2018.05.016 [DOI] [Google Scholar]
  • 60. Mukamuhirwa A, Hovmalm HP, Bolinsson H, Ortiz R, Nyamangyoku O, Johansson E. Concurrent drought and temperature stress in rice. A possible result of the predicted climate change: Effects on yield attributes, eating characteristics, and health promoting compounds. International Journal of Environmental Research and Public Health. 2019; 16: article number 1043. doi: 10.3390/ijerph16061043 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Chapman S, Birch CE, Pope E, Sallu S, Bradshaw C, Davie J, et al. Impact of climate change on crop suitability in sub-Saharan Africa in parameterized and convection-permitting regional climate models. Environmental Research Letters. 2020; 15: article number 094086. doi: 10.1088/1748-9326/ab9daf [DOI] [Google Scholar]
  • 62. Mkonda MY, He XH. The influence of soil organic carbon and climate variability on crop yields in Kongwa district, Tanzania. Environmental Management. 2022. [DOI] [PubMed] [Google Scholar]
  • 63. Narita D, Sato I, Ogawada D, Matsumura A. Integrating economic measures of adaptation effectiveness into climate change interventions: A case study of irrigation development in Mwea, Kenya. Plos ONE. 2020; 15: article number e0243779. doi: 10.1371/journal.pone.0243779 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Ojara MA, Lou YS, Ongoma V, Mumo L, Akodi D, Ayugi B, et al. Projected changes in east African climate and its impacts on climatic suitability of maize production areas by the mid-twenty-first century. Environmental Monitoring and Assessment. 2021; 193: article number 831. doi: 10.1007/s10661-021-09547-4 [DOI] [PubMed] [Google Scholar]
  • 65. Shabani Y, Pauline NM. Perceived effective adaptation strategies against climate change impacts: Perspectives of maize growers in the southern highlands of Tanzania. Environmental Management. 2022. [DOI] [PubMed] [Google Scholar]
  • 66. Warsame AA, Sheik-Ali IA, Jama OM, Hassan AA, Barre GM. Assessing the effects of climate change and political instability on sorghum production: Empirical evidence from Somalia. Journal of Cleaner Production. 2022; 360: article number 131893. doi: 10.1016/j.jclepro.2022.131893 [DOI] [Google Scholar]
  • 67. Newman ME. Power laws, Pareto distributions and Zipf’s law. Contemporary Physics. 2005; 46: 323–351. doi: 10.1080/00107510500052444 [DOI] [Google Scholar]
  • 68. Muli JM, Omanga PA, Jones RB. Farmer-to-farmer seed supply: Case study of pigeon pea seed distribution in Kenya. In: Rohrbach DD, Bishaw Z, Vangastel AJG, editors. Alternative strategies for smallholder seed supply. 1997. pp. 188–191. [Google Scholar]
  • 69. Atera EA, Onyango JC, Azuma T, Asanuma S, Itoh K. Field evaluation of selected NERICA rice cultivars in western Kenya. African Journal of Agricultural Research. 2011; 6: 60–66. [Google Scholar]
  • 70.Fermont AM, Obiero HM, van Asten PJA, Bagurna Y, Okwuosa E. Improved cassava varieties increase the risk of soil nutrient mining: An ex-ante analysis for western Kenya and Uganda. In: Bationo A, Waswa, B, Kihara J, Kimetu J, editors, Advances in integrated soil fertility management in sub-Saharan Africa—Challenges and opportunities. 2007. pp. 511-519.
  • 71.Pole FN, Saha HM, Mangale N, Mzingirwa AM, Munyambu P. On-Station evaluation of maize genotypes for nutrient and water use efficiency in the semi arid lands of coastal Kenya. In: Filho WL, Esilaba AO, Rao KPC, Sridhar G, editors, Adapting African agriculture to climate change—Transforming rural livelihoods. 2015. pp. 199-206.
  • 72. Murungweni C, Van Wijk MT, Smaling EMA, Giller KE. Climate-smart crop production in semi-arid areas through increased knowledge of varieties, environment and management factors. Nutrient Cycling in Agroecosystems. 2016; 105: 183–197. doi: 10.1007/s10705-015-9695-4 [DOI] [Google Scholar]
  • 73. Thierfelder C, Rusinamhodzi L, Setimela P, Walker F, Eash NS. Conservation agriculture and drought-tolerant germplasm: Reaping the benefits of climate-smart agriculture technologies in central Mozambique. Renewable Agriculture and Food Systems. 2016; 31: 414–428. doi: 10.1017/S1742170515000332 [DOI] [Google Scholar]
  • 74. Mkonda MY, He XH. Conservation agriculture in Tanzania. Sustainable Agriculture Reviews. 2017; 25: 309–324. doi: 10.1007/978-3-319-48006-0_10 [DOI] [Google Scholar]
  • 75. Shikuku KM, Winowiecki L, Twyman J, Eitzinger A, Perez JG, Mwongera C, et al. Small holder farmers’ attitudes and determinants of adaptation to climate risks in east Africa. Climate Risk Management. 2017; 16: 234–245. doi: 10.1016/j.crm.2017.03.001 [DOI] [Google Scholar]
  • 76. Alibu S, Neuhoff D, Senthilkumar K, Becker M, Kopke U. Potential of cultivating dry season maize along a hydrological gradient of an inland valley in Uganda. Agronomy-Basel. 2019; 9: article number 606. [Google Scholar]
  • 77. Kabubo-Mariara J, Mulwa R. Adaptation to climate change and climate variability and its implications for household food security in Kenya. Food Security. 2019; 11: 1289–1304. doi: 10.1007/s12571-019-00965-4 [DOI] [Google Scholar]
  • 78. Kiwia A, Kimani D, Harawa R, Jama B, Sileshi GW. Sustainable intensification with cereal-legume intercropping in eastern and southern Africa. Sustainability. 2019; 11: article number 2891. doi: 10.3390/su11102891 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79. Obunyali CO, Karanja J, Oikeh SO, Omanya GO, Mugo S, Beyene Y, et al. On-farm performance and farmers’ perceptions of DroughtTEGO climate-smart maize hybrids in Kenya. Agronomy Journal. 2019; 111: 2754–2768. doi: 10.2134/agronj2019.08.0600 [DOI] [Google Scholar]
  • 80. Mairura FS, Musafiri CM, Kiboi MN, Macharia JM, Ng’etich OK, Shisanya CA, et al. Determinants of farmers’ perceptions of climate variability, mitigation, and adaptation strategies in the central highlands of Kenya. Weather and Climate Extremes. 2021; 34: article number 100374. doi: 10.1016/j.wace.2021.100374 [DOI] [Google Scholar]
  • 81. Sedebo DA, Li GC, Etea BG, Abebe KA, Ahiakpa JK, Arega Y, et al. Impact of small holder farmers’ climate-smart adaptation practices on wheat yield in southern Ethiopia. Climate and Development. 2022; 14: 282–296. doi: 10.1080/17565529.2021.2014777 [DOI] [Google Scholar]
  • 82. Zizinga A, Mwanjalolo JGM, Tietjen B, Bedadi B, Gabiri G, Luswata KC. Effect of mulching and permanent planting basin dimensions on maize (Zea mays L.) production in a sub-humid climate. Water. 2022; 14: article number 79. doi: 10.3390/w14010079 [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Data

(CSV)

S2 Data

(CSV)

S3 Data

(CSV)

S4 Data

(CSV)

S5 Data

(CSV)

S6 Data

(CSV)

S7 Data

(CSV)

S8 Data

(CSV)

S1 File

(TXT)

Data Availability Statement

All relevant data are within the paper and its Supporting information files.


Articles from PLOS ONE are provided here courtesy of PLOS

RESOURCES