Skip to main content
Entropy logoLink to Entropy
. 2018 Oct 12;20(10):783. doi: 10.3390/e20100783

A New and Stable Estimation Method of Country Economic Fitness and Product Complexity

Vito D P Servedio 1,*, Paolo Buttà 2, Dario Mazzilli 3, Andrea Tacchella 4, Luciano Pietronero 3
PMCID: PMC7512345  PMID: 33265871

Abstract

We present a new metric estimating fitness of countries and complexity of products by exploiting a non-linear non-homogeneous map applied to the publicly available information on the goods exported by a country. The non homogeneous terms guarantee both convergence and stability. After a suitable rescaling of the relevant quantities, the non homogeneous terms are eventually set to zero so that this new metric is parameter free. This new map almost reproduces the results of the original homogeneous metrics already defined in literature and allows for an approximate analytic solution in case of actual binarized matrices based on the Revealed Comparative Advantage (RCA) indicator. This solution is connected with a new quantity describing the neighborhood of nodes in bipartite graphs, representing in this work the relations between countries and exported products. Moreover, we define the new indicator of country net-efficiency quantifying how a country efficiently invests in capabilities able to generate innovative complex high quality products. Eventually, we demonstrate analytically the local convergence of the algorithm involved.

Keywords: economic complexity, non-linear map, bipartite networks

1. Introduction

In the last decade a new approach to macroeconomics has been developed to better understand the growth of countries [1,2]. The key idea is to consider the international trade of countries as a proxy of their internal production system. By describing the international trade as a bipartite network, where countries and products are sites of the two layers, new metrics for the economy of countries and the quality of products can be constructed by leveraging the network structures only [1] . These metrics quantify the fitness of countries, the quality of their industrial system, and the complexity of commodities by indirectly inferring the technological requirements needed to produce them. The mathematical properties of the algorithm involved in the evaluation of the metrics, as well as the economic meaning of the metrics and possible applications, have been discussed in previous papers [3,4,5]. Moreover, these two new metrics have been successfully used to develop state-of-the-art forecasting approaches for economic growth [6,7,8].

The very same approach has been applied to different social and ecological systems presenting a bipartite network structure and a competition between the components of the system [9,10]. Thus, it is natural to interpret fitness and complexity as properties of the network underlying those systems. The revised version of the fitness-complexity estimation metric that we show here, results in a clear and natural interpretation in terms of network properties and helps to better understand the different components that contribute to the fitness of countries.

In the following, we first describe the original metrics of fitness and complexity and their properties, underlining some critical issues that we solve with this new version. Then, we define the new procedure step by step and highlight its advantages in the case of countries-products bipartite networks. Finally, we devise an approximated solution and discuss its interpretation. In Appendix B we list the main quantities appearing in the text.

2. Metric Definition

2.1. The Original Metric

Object of this work is the network of countries and their exported goods. This network is of bipartite type (countries and products are mutually linked, but no link exists between countries as well as between products) and weighted (links carry a weight scp, i.e., the exported volume of product p of country c, measured in US$). Data ranging from year 1995 to year 2015 can be freely retrieved from the Web [11], though we use them after a procedure to enhance their consistency [8]. Eventually, we come up with data about 161 countries and more than 4000 products, which were categorized according to the Harmonized System 2007 coding system, at 6 digits level of coarse-graining. The weighted bipartite network of countries and products can be projected onto an unweighted network described solely by the Mcp matrix with elements set to unity when a given country c meaningfully exports a good p and zero otherwise (See Methods).

The original metric estimating the fitness of countries and complexity of products was defined by the following non-linear iterative map:

Fc(n)=pMcpQp(n1)with1cCQp(n)=cMcp/Fc(n1)1with1pP, (1)

with initial values Fc(0)=Qp(0)=1,c,p. In the previous expression, Fc and Qp stand for the fitness of a country c and quality (complexity) of a product p; C and P are the total number of countries and exported products, respectively; and from the dataset we have that CP.

By multiplying all Fc and Qp by the same numerical factor k, the map remains unaltered, so that the fixed point of the map (as n) is defined up to a normalization constant. In the original method this constant is chosen at each iteration n such that fitness and complexity are constrained to lie on the double simplex defined by:

cFc(n)=CandpQp(n)=P. (2)

The metric defined in Equations (1) and (2) successfully ranks the countries of our world according to their potential technological development and, when applied to different yearly time intervals, can be used to suggest precise strategies to improve country economies. It has also been proved to give the correct ranking of importance of species in a complex ecological system [9]. Despite its success, some points can still be improved:

  1. Convergence issues: As stated in a recent paper dealing with the stability of calculating this metric [12]:
    • If the belly of the matrix [Mcp] is outward, all the fitnesses and complexities converge to numbers greater than zero. If the belly is inward, some of the fitnesses will converge to zero.
    • Since an inward belly is the rule rather than the exception, some countries will have zero fitness and as a result all the products exported by them get zero complexity (quality). This is mathematically acceptable, though it heavily underestimates the quality of such products: Even natural resources need the right know-how to be extracted so that their quality would be better represented by a positive quantity. To cure this issue one has to introduce the notion of “rank convergence” rather than absolute convergence, i.e., the fixed point is considered achieved when the ranking of countries stays unaltered step by step.
  2. Zero exports: The countries that do not export any good do have zero fitness independently from their finite capabilities.

  3. Specialized world: In an hypothetical world where each country would export only one product, different from all other products exported by other countries, this metric would assign a unity fitness and quality to all countries and products. Though mathematically acceptable, this solution does not take into account the intrinsic complexity of products.

  4. Equation symmetry: This is rather an aesthetic point, in that Equation (1) are not cast in a symmetric form.

2.2. The New Metric

First, we reshape Equation (1) in a symmetric form by introducing the variable Pp=Qp1, i.e.,:

Fc(n)=pMcp/Pp(n1)with1cCPp(n)=cMcp/Fc(n1)with1pP. (3)

Now the quality of products are given by the quantities Pp1 and the metric is trivially equivalent to the original one provided one uses the normalization conditions cFc(n)=C and p(Pp(n))1=P.

Next, we introduce two set of quantities ϕc>0 and πp>0 and consider the inhomogeneous non-linear map defined as:

Fc(n)=ϕc+pMcp/Pp(n1)with1cCPp(n)=πp+cMcp/Fc(n1)with1pP. (4)

Since the map is no more defined up to a multiplicative constant, the normalization condition is not required anymore, while the initial condition can be set as in the original metric Fc(0)=Pp(0)=1,c,p. The fixed point of the transformation is now trivially characterized by the conditions:

Fcϕc,Ppπp,FcPp>Mcp. (5)

The parameters ϕc and πp can be interpreted as follows. The parameter ϕc represents the intrinsic fitness of a country. In fact, for a country k that does not export any good we have Mkp=0p so that its fitness is simply equal to ϕk. Irrespective of its exports any country has a set of capabilities that characterize it.

The parameter πp is more intriguing. If no country exports it (probably because no country produces it), the product q has not been invented yet and its quality lies at its maximum value πq1 since Mcq=0c. Therefore, the inverse of πq may be interpreted as a sort of innovation threshold: The smaller the parameter is, the higher is the quality of the product in his outset and more sophisticated capabilities are necessary to produce it. On the other hand, products like natural resources may be associated with a larger value of the parameter since require less complex capabilities for their extraction.

In order to keep the metric evaluating algorithm simple and parameter free as in the original case, we set a common value ϕc=πp=δ, then we study the dependence of the metrics on δ, and finally we set δ=0 (in fact renouncing to cure the issue number iii. listed above).

3. Results

3.1. Dependence on the Non-Homogeneous Parameter

We consider ϕc=πp=δc,p and address the dependence of the fixed point upon δ. To outline the dependence of Fc and Pp from the parameter δ, we use the relations defined in Equation (4) and introduce the rescaled quantities P˜p=Pp/δ and F˜c=Fcδ. After some trivial algebra we get from Equation (4):

F˜c(n)=δ2+pMcp/P˜p(n1)with1cCP˜p(n)=1+cMcp/F˜c(n1)with1pP, (6)

from which we deduce that, as soon as the parameter δ2 is much smaller than the typical value of Mcp matrix elements, i.e., much smaller than unity, the fixed point in terms of F˜c and P˜p almost does not depend on δ (see Figure 1).

Figure 1.

Figure 1

Dependence on the non-homogeneous parameter: Dependence of fitness and quality at the fixed point on the parameter δ. One country (Afghanistan) and one product (live horses) were chosen arbitrarily from the sample of year 2014.

It is worth noting that the values of fitness Fc and quality Qp=Pp1 of the original map defined by Equations (1) and (2) cannot be obtained in this new metric when the parameter δ tends to zero. In terms of F˜c and P˜p the fitness and quality obtained in the original metric can be expressed as Fc=F˜cδ1 and Qp=P˜p1δ1. Since the new metric provides finite non vanishing values of F˜c and P˜p, by taking the limit δ0 would deliver infinite values of Fc and Qp. We might think that the normalization procedure necessary in the old metric in order to fix the arbitrary constant would get rid of the common factor δ1 and deliver the same values of the new metric. Unfortunately, this is not the case since the new metric does not rely on a normalization procedure. Therefore, since a self-consistent procedure of normalization, i.e., a projection on the double simplex defined by Equation (2), is missing in the new metric, the results cannot coincide. Since the quantities F˜c and P˜p are well defined in the limit δ0, we shall focus on them only, in the following. We remind that the complexities of products delivered by the original metric are connected to the set of Pp1 and thus to the P˜p1. In particular, the second of Equation (6) can be interpreted at the fixed point as: P˜p=1+Q˜p1 with the Q˜p expressed as in the second of Equation (1), but with the tilde quantities calculated in the new metric. Therefore, we shall assign to Q˜p=(P˜p1)1 the meaning of complexity of products in our new metric. The differences between the old and new metrics are depicted in Figure 2.

Figure 2.

Figure 2

Comparison between the original and the revised metric: Differences in country fitness (left panel) and product complexity (right panel) calculated in the original metric of Ref. [1] (vertical axes) and new metric (horizontal axes) as referred to year 2007. The green line in the left panel is the best least square approximation of power-law type (correlation coefficient 0.989) with exponent ca. 1.53. The dark line in the right panel is the best power-law approximation (correlation coefficient 0.971) resulting with an exponent of ca. 1.38. The year 2007 was chosen randomly. Similar results apply to all the years considered. In particular, the correlation coefficient and the exponent of the green line in the left panel lie between 0.987 and 0.990, and 1.48 and 1.61 respectively throughout the years. For the black line in the right panel we find a correlation coefficient between 0.950 and 0.979, and an exponent between 1.34 and 1.47.

3.2. Analytic Approximate Solution

In this section we shall provide an approximate analytic solution that can be used to estimate the values attained by the map of Equation (6) at the fixed point. Despite their symmetric shape, Equation (4) are not symmetric at all since in case of actual countries and products, the matrix Mcp is rectangular with the number of its rows C being much less than the number of its columns P. To estimate the effect of this asymmetry, we first consider Equation (4) in a mean field fashion, where each element of Mcp is set to the average value M=c,pMcp/CP, and write, at the fixed point:

f˜=δ2+PMp˜1p˜=1+CMf˜1, (7)

with now all F˜c and P˜p set to be equal to their mean field value f˜ and p˜, respectively. By setting δ=0, we find p˜=1/(1CP)1+CP and f˜=PC.

Indeed, an approximate expression for the fixed point of Equation (6) in the regime δ1 and CP can be derived also beyond the mean field approximation. To this end, we set again δ=0 and consider the corresponding fixed point equation associated to Equation (6), i.e.,:

F˜c=pMcp/P˜pwith1cCP˜p=1+cMcp/F˜cwith1pP. (8)

From the empirical structure of the matrix M, we observe that the quantity Dc=pMc,p, representing the diversification of country c, i.e., the number of different products exported by c, is of the order of P, at least for the majority of countries (as an average over all the years considered we find that 70% of the countries have 0.1Dc/P1). Therefore, setting P˜*=maxpP˜p and F˜*=mincF˜c, Equation (8) implies:

F˜cDc/P˜*constP/P˜*with1cCP˜*1+C/F˜*.

From the first estimate, F˜*constP/P˜*, and therefore, by the second estimate, P˜*1+constCPP˜*. As Pp1, we conclude that P˜p=1+Wp with Wp in the order of magnitude of C/P, and, as a consequence, F˜c is of the order of magnitude of P.

We next compute explicitly the values of F˜c and P˜p at the first order in this approximation. The calculation of second order terms can be found in Appendix A. By using the first order approximation (1+a)11a twice, from Equation (8) we have:

WpcMcpDc1+1DcpMcpWp.

Now let H be the square matrix of elements Hpp=cMpcTDc2Mcp. Letting D1 be the column vector with components 1/Dc and 1 the identity matrix, the last displayed formula reads:

(1H)WMTD1.

We now observe that: Hppc1/Dc2constC/P2.

Therefore, the matrix (1H) is close to the identity (the correction is of order C/P2) and hence invertible (with also the inverse close to the identity). In this approximation, W=MTD1, so that the rescaled (reciprocals of the) qualities of products are given by:

P˜=1+MTD1. (9)

In the same approximation, we obtain the rescaled fitnesses F˜c; since:

F˜c=pMcp1+WppMcp(1Wp),

we have:

F˜=DKD1, (10)

having introduced the co-production matrix K=MMT with elements Kcc=pMcpMcp, representing the number of the same products exported by the two countries c and c.

It is interesting to note how, up to the first order approximation, the values of the fitness of countries are depending on the co-production matrix and diversification only. The goodness of the approximations above can be appreciated in Figure 3 that shows how the relative difference between the numerical values at the fixed point and the approximate solution of Equation (10) is below 0.5% for more than 85% of the countries.

Figure 3.

Figure 3

Numerical vs Analytic relative error: The histogram of the relative difference (F˜c(fixedpoint)F˜c(approximated))/F˜c(fixedpoint) is plotted with the number of countries on the vertical axis. The approximated values are calculated using Equation (10).

It is worth noting that in a recent work the Economic Complexity Index (ECI) defined in Ref. [2] has been connected to the spectral properties of a weighted similarity matrix M˜ resembling our co-production matrix K [13]. This similarity is only apparent since in ECI the matrix M˜ is defined as:

M˜cc=pMcpMcp/DcUpwithUp=cMcpubiquityofproductp,

i.e., it contains a further weighting term (the ubiquity) in the sum defining it. Besides, the two metrics of ECI and Fitness-Complexity differ very much from each other: ECI relies on a linear homogeneous map, while Fitness-Complexity relies on a non-linear and in this work also non-homogeneous map.

3.3. Country Inefficiency and Net-Efficiency

From Equation (10) we deduce that the leading part of fitness F˜c is given by the diversification Dc. The diversification of a country is indeed an important quantity, for the calculation of which we do not need any complicated algorithm. On the other hand, what the non-linear map proposed does, is to quantify how a country manages to successfully differentiate its products, and indirectly offers an estimate of the capabilities of a nation. In fact, a country exporting mainly raw materials would be less efficient with respect to a country exporting high technological goods, when they have the same diversification value. For this reason, we introduce the new quantity Ic=DcF˜c, inefficiency of country c: the smaller the value Ic the more efficient is the diversification it chooses. From the approximate solution displayed in Equation (10), we get that IccKcc/Dc, so that the inefficiency of a country is a weighted average of its co-production matrix elements. The dependence of the country inefficiency on the diversification is displayed in Figure 4, while a visual representation of it is displayed in Figure 5.

Figure 4.

Figure 4

Role of diversification: The country inefficiency Ic=DcF˜c is plotted vs. the diversification Dc with the black line representing the power-law relation IcDc0.75 (linear regression with correlation coefficient 0.994). In the inset the net efficiency Nc, defined as the difference between the black line and the inefficiency of the main graph, is shown. Plotted data pertain to year 2007. We find a similar behaviour for all the years considered with the exponent of Dc between 0.73 and 0.76, and the correlation coefficient between 0.993 and 0.995.

Figure 5.

Figure 5

Inefficiency cartoon: Large ovals represent three countries, while small circles represent products. In this simple example, the inefficiency I1 of country 1 is I1=K12/D2+K13/D3. From the figure we get K12=2 and K13=4, i.e., the number of products exported by both countries (the cardinality of the intersection sets), and the diversifications D1=17, D2=5, D3=20. Thus, I1=2/5+4/20=0.6 and the approximated fitness F˜116.4.

It is interesting to notice how a clear power-law dependence exists between the inefficiency and the diversification of a country.

The structure of the M matrix is such that those countries with high diversification also export low quality goods in average. Therefore to a large diversification would statistically correspond a large inefficiency, though the found power-law is not trivial and depends on the structure of the M. A similar power-law behaviour is found between the fitness calculated with the traditional metric and the diversification, but with a different exponent (from the left panel of Figure 2 we deduce that there is a power-law relation between the fitnesses calculated with the original metric and this new metric, and the exponent is around 1.53; since the fitness Fc calculated with the new metric goes as Dc at the first order, then the old fitnesses also go as Dc1.53). In order to better appreciate the production strategies of countries, we subtracted the common power-law trend of the dependency of the inefficiency on the diversification for each year, changed its sign and plotted the result in the right panel of Figure 6, which thus shows the time evolution of a quantity that we call country net-efficiency Nc (net in the sense opposed to gross) over the years 1995–2014. It interesting to note how countries behave differently over the time lapse considered. Some countries display a decreasing net-efficiency, others an increasing or a constant one. What many of these curves have in common is the decreasing set up around year 2000, more pronounced in the case of higher developed countries, which lie at high net-efficiency in the graph.

Figure 6.

Figure 6

Time evolution of fitness and net-efficiency: (Left panel) Country fitness yearly evolution as estimated by the new metric. (Right panel) yearly time evolution of country net efficiency. The net efficiency is a detrended version of the inefficiency defined in the text and displayed in the inset of Figure 4 for the year 2007. Curves were artificially smoothed by a cubic spline for a better visual representation.

3.4. Local Convergence

From the simulations it is clear that the fixed point obtained by iterating Equation (4) is locally stable. We can also prove it by resorting to the Jacobian of the transformation, in the case of countries and products. First we recall that the sum over the indexes c and p of Equation (4) run from 1 to C and P, respectively, with usually CP. In the case of countries and products C/P101. We also fix ϕc=πp=δ1, so that the fitnesses and the (reciprocals of the) qualities at the fixed point are approximately given by: Fc=F˜c/δ and Pp=δP˜p with F˜c and P˜p the components of the vectors F˜ and P˜ given in Equations (9) and (10) respectively.

Next, we calculate the Jacobian of the transformation at the fixed point, which can be simply expressed as the block anti-diagonal matrix:

J=0MTF2MP20, (11)

having introduced the diagonal matrices F=diag(F1,F2,,Fc) and P=diag(P1,P2,,Pp), respectively. We claim that the spectral radius ρ(J) of the square matrix J is strictly smaller than one. Denoting by σ(J) the spectrum of J, this means that ρ(J):=max{|λ|:λσ(J)}<1. From this it follows [14] that the fixed point is asymptotically stable and the convergence exponentially fast. To prove the claim we consider the square of the Jacobian that can be written as a block diagonal matrix,

J2=MTF2MP200MP2MTF2, (12)

and note that the traces of the two matrices on the diagonal is the same by applying a cyclic permutation. Noticing that FcPp=F˜cP˜p and using the approximate solutions in Equations (9) and (10), we find with simple algebra that:

Tr(J2)=2c,pMc,p2Fc2Pp22c,pMc,p2Dc2=2c1DcCP<1. (13)

Moreover, we can write the two non trivial matrices composing J2 as:

MTF2MP2=P(P1MTF1)(F1MP1)P1=PATAP1 (14)

and:

MP2MTF2=F(F1MP1)(P1MTF1)F1=FAATF1, (15)

with A=F1MP1. The matrices AAT and ATA are symmetric and positive-semidefinite so that their eigenvalues are real and non negative, and the matrices FAATF1 and PATAP1 have the same eigenvalues. Therefore, the eigenvalues of J2 are real and non negative and we can write according to Equation (13):

Tr(J2)=iλi2<1, (16)

with λi eigenvalues of J. Finally, from the preceding equation we have maxλi2<max|λi|<1 so that at the fixed point ρ(J)<1.

3.5. Robustness to Noise

Fitness and complexity (quality) values depend on the structure of the matrix Mcp. Noise can affect its elements by flipping their value. Thus, we test the robustness of the new metric to noise as described in [15]. The idea is to introduce random noise by flipping each single bit of the matrix with probability η, which then is a parameter tuning the noise level. The rank of country fitnesses in presence of noise Rcη is then compared with the rank obtained without noise Rc0. The Spearman correlation ρs is then evaluated between these two sets and shown in Figure 7 as a function of η for both the original and the new metrics: The new metrics show a perfect stability to random noise as the original ones with an unavoidable transition around η0.5, where noise is so strong to alter significantly the structure of the matrix Mcp.

Figure 7.

Figure 7

Noise robustness: Spearman correlation between the ranking of countries based on fitness at zero noise and at different noise levels η (see Section 3.5 in the main text). The performance of the two metrics is practically indistinguishable. Note that at η=1 all the elements of matrix M are flipped so that the perturbed system is perfectly anti-correlated with the original one.

4. Discussion

The proposed new inhomogeneous non-linear metric to estimate economic fitness and complexity defined in Equations (4) and (6) carries many advantages with respect to the original one. The fitnesses and complexities resulting from these two approaches are not identical, but highly correlated to each other as witnessed by the plots in Figure 2. This high correlation between the two metrics ensures that all the studies carried on with the original metric so far, can be obtained by applying this new metric as well.

Besides the stability of the metric and its robustness, one more advantage is that the fitness is well defined also for those countries that have low exportation volumes and that in the original metric had their fitness tending to zero. For those countries it is now possible to undertake a comparative study based on hypothetical investments (changing the elements of the M matrix) so to make predictions on their economic impact.

By first symmetrising the original equations, by adding an inhomogeneous parameter and by rescaling the quantities, one obtains Equation (6), where the parameter can be safely set to zero. This ensures that this new metric is parameter free as the original one. As a pleasant side effect, the fixed point of the map can be well approximated analytically, with an error with respect to the iterative fixed point of less than 3% (see Figure 3). The result is represented by Equations (9) and (10) at the first order (Equations (A2) and (A3) at the second order), which allow for a simple intuitive explanation of the complexity of products and fitness of countries.

Let us discuss Equation (10) first. The result suggests that the fitness of a country is trivially related, at the first order, to its diversification: The more products a country exports, the larger is its fitness, i.e., the more developed its capabilities. This simple explicit dependence of the fitness on the diversification is also an advantage with respect to the original metric, where the dependence was not explicitly clear. The second term of Equation (10), which we call inefficiency, is also very interesting. If a country is the only one to export a given product, the contribution of this product to its fitness is a full one, or in other words, the contribution to the inefficiency is zero. This situation mimics a condition of monopoly on that product and it is logical that the exporting country has the full benefit of it. When a product is exported by multiple nations then it is critical to assess whether those countries export few or many other products (see Figure 5). If a product is exported by a country c with low diversification (low capabilities), then that product is not supposed to be of high complexity. The result is that the ratio Kcc/Dc can be close to one (c=1,c=2 in the figure) and the inefficiency associated to the common products is high, resulting in a small contribution to the fitness of c. The inefficiency can be interpreted in terms of the bipartite network of countries and products: The Kcc counts the number of links that connect countries c and c to the same products, while the differentiation Dc is the node degree of country c. In other words, for a country c the inefficiency counts the links to common products of all other countries and weights them according to the degree of those. To our knowledge, this kind of measure has never been considered in complex networks so far.

Since, statistically, countries with an high diversification also export many less complex products, the inefficiency is an increasing function of the diversification (Figure 4, main graph). If we subtract the general trend, which stems from the structure of the matrix Mcp, we can appreciate the net effect of selecting the goods to export. We call this new de-trended quantity net-efficiency. In this way we somehow remove the negative effect of less valuable products and highlight the contribution of more sophisticated goods. In the inset of Figure 4 we show the net-efficiency as a function of diversification and underline the three nations (Japan, Korea, and Switzerland) that stand out among the others. The time evolution of this new quantity is shown in the right panel of Figure 6. We can combine the time evolution of both fitness and net-efficiency for a given country to determine to what degree they are correlated. Figure 8 shows the two quantities for selected countries. It is clear how these two quantities are not related to each other and represent two complementary information. In fact, the fitness is mainly connected to the product diversification of a country (Equation (10)) while the net-efficiency is connected to how complex are the exported products. In the figure we see two extreme cases represented by Switzerland (CHE) and South Korea (KOR), whose lines are practically orthogonal. While Switzerland have decreased the number of different exported goods in the years but have still exported complex products, South Korea have kept the number of exported products as almost constant but have increased their complexity. The opposite situation of South Korea we notice for Germany (DEU), where the complexity of the exported goods has decreased in time. Interestingly, China (CHN) has systematically increased both the number of exported goods and their complexity, which we interpret as a symptom of a solid economy in expansion.

Figure 8.

Figure 8

Net-efficiency vs fitness: each line corresponds to the time evolution of the connection between the fitness of a country and its net-efficiency in the period between 1995 and 2014. This figure connects the quantities on the vertical axes of the plots displayed in Figure 6. Lines start from a large circle (year 1995) and end with a small one (year 2014).

The complexity of products is estimated by Equation (9) as the reciprocal of the second term of the sum. Since the diversification of a country Dc is a direct measure of its capabilities, we expect to find a simple relation between it and the complexities of products Q˜p. Indeed, if we indicate with ci those countries exporting the product p, for which obviously we have Mcip=1, and with m=cMcp, we can write:

Q˜p1Dc1+1Dc2++1Dcm1

from which we corroborate the main idea that the complexities of products are driven by the countries with low diversification (capabilities) that export it. Just for amusement, we observe how the complexity of products can be considered as the equivalent resistor of a parallel of resistors each one with resistance Dc. Somehow, a high Dc represents an effective resistance to the creation of a product and its export, so that if a country exists with a low diversification exporting it, the effort (resistance) of producing that product is also low.

5. Materials and Methods

5.1. Construction of the M Matrix

We exploit the UN-COMTRADE data set [11], where re-export and re-import fluxes are explicitly declared, allowing us to exclude them from the analysis. As reported by UNSTAT in Ref. [16], the 81.8% of the whole data set (96.8% in case of developed countries) does not account for goods in transit. Moreover, commodities that do not cross borders are not included in the data.

Given the export volumes scp of a country c in a product p one can evaluate the Revealed Comparative Advantage (RCA) indicator [17] defined as the ratio:

RCAcp=scpcscp/pscpcpscp (17)

in this way one can filter out size effects. As described in the Supplementary information of [8], from the time series of the RCA we can evaluate the productive competitiveness of each country in each product by assigning to it a productivity state from 1 to 4. State 1 means that the country does not produce (or is very uncompetitive in producing) a product, state 4 means that it is one of the main producer in the world. We can then project this states onto the binarized matrix Mcp by simply setting its elements to unity whenever a state larger than 2 is encountered, and set them to null otherwise.

Acknowledgments

We acknowledge A. Zaccaria and E. Pugliese for interesting discussions. We are grateful to an anonymous reviewer for suggesting to show Figure 8.

Appendix A. Second Order Expansion of Fitness and Qualities

In this section we compute explicitly the values of F˜c and P˜p for CP up to the second order of magnitude of C/P. Letting ε=C/P, we expand Wp=εWp(1)+ε2Wp(2)+O(ε3). By assuming Dc of the order of P and by using the second order approximation (1+a)11a+a2 twice, Equation (8) implies that:

F˜c=Dc1εpMcpDcWp(1)ε2pMcpDcWp(2)(Wp(1))2+O(ε3),with1cC, (A1)

and:

εWp(1)+ε2Wp(2)=cMcpDc+εc,pMcpDcMcpDcWp(1)+ε2c,pMcpDcMcpDcWp(2)(Wp(1))2+ε2c,p,pMcpDcMcpDcMcpDcWp(1)Wp(1)+O(ε3),with1pP.

By the assumption on the magnitude of Dc, the first sum in the right-hand side is of the order of ε, the second one is of the order of ε2, while the last two sums are of the order ε3. Therefore:

εWp(1)=cMcpDc,ε2Wp(2)=c,pMcpDcMcpDcεWp(1).

Recalling H denotes the square matrix of elements Hpp=cMpcTDc2Mcp (hence Hppε/P) and D1 the column vector with components 1/Dc, we have just showed that W=MTD1+HMTD1+O(ε3). Therefore, in the second order approximation, the rescaled (reciprocals of the) qualities of products are given by:

P˜=1+MTD1+HMTD1. (A2)

In the same approximation, from Equation (A1) we finally calculate the rescaled fitnesses F˜c. Denoting by (MTD1)2 the column vector with components (MTD1)p2 we get:

F˜=DKD1+M(MTD1)2MHMTD1, (A3)

where the co-production matrix K=MMT has been introduced just below Equation (10).

Appendix B. Important Quantities Defined Throughout the Text

C, P: Total number of countries and products
M: Binary matrix with element Mcp=1 if country c is a competitive country in exporting product p; Mcp=0 otherwise; export competitiveness is estimated by means of export volumes
Fc, Qp: Fitness of country c and quality (complexity) of product p at the fixed point
Pp: Inverse of the quality of product p; it is a sort of product “simplicity” (Pp=(Qp)1)
δ: Inhomogeneous parameter; this parameter is crucial in achieving a stable algorithm to evaluate the metrics; it will be eventually let go to 0 to get a parameter free metric
F˜c,P˜p: Rescaled versions of the corresponding un-tilded quantities: F˜c=Fcδ, P˜p=Pp/δ; these quantities do not depend on δ as soon as δ0 and are better suited to represent fitness and complexity rather than the un-tilded ones
Q˜p: Similar to the complexity Qp above, but for the new metrics calculated with the inhomogeneous algorithm: Q˜p=(P˜p1)1
K: Coproduction matrix with element Kcc equal to the number of the same products exported by countries c and c; K=MMT
Dc: Diversification of country c, i.e., the number of products the country c is competitive in exporting
Ic: Inefficiency of country c defined as Ic=DcF˜c; it represents the fitness penalty resulting from exporting goods that are also exported by other countries
Nc: Net-efficiency of country c; it is a de-trended version of the inefficiency; in the dataset considered NcDc0.75Ic; it represents how effectively a country diversifies its exported goods by focusing on products not exported by others, which are usually among the most complex ones

Author Contributions

Formal analysis, P.B.; Investigation, V.D.P.S., D.M. and A.T.; Methodology, V.D.P.S. and L.P.; Project administration, L.P.; Supervision, L.P.; Writing—original draft, V.D.P.S., P.B., D.M., A.T. and L.P.

Funding

This research was partially funded by the Austrian Research Promotion Agency FFG, the EU FP7 Grant 611272 project GROWTHCOM and the CNR PNR Project “CRISIS Lab”. FFG is also covering the costs to publish in open access.

Conflicts of Interest

The authors declare no conflict of interest.

References

  • 1.Tacchella A., Cristelli M., Caldarelli G., Gabrielli A., Pietronero L. A New Metrics for Countries’ Fitness and Products’ Complexity. Sci. Rep. 2012;2:723. doi: 10.1038/srep00723. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Hidalgo C.A., Hausmann R. The building blocks of economic complexity. Proc. Natl. Acad. Sci. USA. 2009;106:10570–10575. doi: 10.1073/pnas.0900943106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Tacchella A., Cristelli M., Caldarelli G., Gabrielli A., Pietronero L. Economic complexity: conceptual grounding of a new metrics for global competitiveness. J. Econ. Dyn. Control. 2013;37:1683–1691. doi: 10.1016/j.jedc.2013.04.006. [DOI] [Google Scholar]
  • 4.Cristelli M., Gabrielli A., Tacchella A., Caldarelli G., Pietronero L. Measuring the intangibles: A metrics for the economic complexity of countries and products. PLoS ONE. 2013;8:e70726. doi: 10.1371/journal.pone.0070726. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Zaccaria A., Cristelli M., Kupers R., Tacchella A., Pietronero L. A case study for a new metrics for economic complexity: The Netherlands. J. Econ. Interact. Coor. 2016;11:151–169. doi: 10.1007/s11403-015-0145-9. [DOI] [Google Scholar]
  • 6.Saracco F., Di Clemente R., Gabrielli A., Squartini T. Detecting early signs of the 2007–2008 crisis in the world trade. Sci. Rep. 2016;6:30286. doi: 10.1038/srep30286. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Cristelli M., Tacchella A., Cader M., Roster K., Pietronero L. On the Predictability of Growth. The World Bank; Washington, DC, USA: 2017. Policy Research Working Paper 8117. [DOI] [Google Scholar]
  • 8.Tacchella A., Mazzilli D., Pietronero L. A dynamical systems approach to gross domestic product forecasting. Nat. Phys. 2018;14:861. doi: 10.1038/s41567-018-0204-y. [DOI] [Google Scholar]
  • 9.Domínguez-García V., Muñoz M.A. Ranking species in mutualistic networks. Sci. Rep. 2015;5:8182. doi: 10.1038/srep08182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Cimini G., Gabrielli A., Sylos Labini F. The Scientific Competitiveness of Nations. PLoS ONE. 2014;9:1–11. doi: 10.1371/journal.pone.0113470. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.UN Comtrade—International Trade Statistics Database. [(accessed on 8 October 2018)]; Available online: http://comtrade.un.org/
  • 12.Pugliese E., Zaccaria A., Pietronero L. On the convergence of the Fitness-Complexity algorithm. Eur. Phys. J. Spec. Top. 2016;225:1893–1911. doi: 10.1140/epjst/e2015-50118-1. [DOI] [Google Scholar]
  • 13.Mealy P., Doyne Farmer J., Teytelboym A. A New Interpretation of the Economic Complexity Index. [(accessed on 30 September 2018)]; doi: 10.2139/ssrn.3075591. Available online: [DOI]
  • 14.Hasselblatt B., Katok A. A First Course in Dynamics: With a Panorama of Recent Developments. Cambridge University Press; New York, NY, USA: 2003. [Google Scholar]
  • 15.Battiston F., Cristelli M., Tacchella A., Pietronero L. How metrics for economic complexity are affected by noise. Complex. Econ. 2014;3:1–22. [Google Scholar]
  • 16.United Nations, Statistical Division . International Merchandise Trade Statistics: Supplement to the Compliers Manual. United Nations publications; New York, NY, USA: 2008. Overview of national compilation and dissemination practices; pp. 1–26. Chapter 1. [Google Scholar]
  • 17.Balassa B. Trade Liberalisation and “Revealed” Comparative Advantage. Manch. Sch. 1965;33:99–123. doi: 10.1111/j.1467-9957.1965.tb00050.x. [DOI] [Google Scholar]

Articles from Entropy are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES