Abstract
We show that the conventional income inequality indexes assess income inequality incorrectly because of three problems. The unequally distributed (UD) income-based approach solves the problems, decomposes income inequality into two kinds of departure from equality, and provides two indexes. The comprehensive assessment of income inequality requires the integration of two kinds of departure. This paper proposes the relative UD (RUD) income-based approach. The RUD income-based approach combines the cumulative distribution function and quantile function of the RUD income and produces a new index integrating two kinds of departure. We investigate the properties of the new index and demonstrate its applicability through example income distributions.
Keywords: Cumulative distribution function, Income inequality, Norm, Progressive transfer, Quantile function, Unequally distributed income
1. Introduction
The measurement of income inequality has been an important topic in economics. Since the introduction of the Lorenz curve by Lorenz [1], many indexes have been developed to measure the degree of income inequality. We refer the readers to Hao and Naiman [2], Jenkins and Kerm [3], and Cowell [4] for a general overview of income inequality measurement. Though Hao and Naiman [2, p. 42] and Cowell [4, p. 155] provide lists of income inequality indexes, the list is still expanding. The Palma ratio [5], [6] was added by Cobham and Sumner [7]. Gallegati et al. [8] and Clementi et al. [9] added the Zanardi index developed by Zanardi [10] to measure the asymmetry of the Lorenz curve. Henceforth, we refer to these indexes as conventional indexes. We will look into expressions for the conventional indexes in Section 2.
This paper will show that the conventional income inequality indexes assess income inequality incorrectly because of three problems, propose a new approach to measuring income inequality, and develop a new index. This paper is organized as follows. Section 2 presents three problems that make the conventional income inequality indexes incorrect. Section 3 reviews the unequally distributed (UD) income-based approach proposed by Park et al. [11], [12]. The UD income-based approach solves the problems and provides two indexes for two kinds of departure from equality. Section 4 discusses the insufficiency of the UD income-based approach and proposes the relative UD (RUD) income-based approach. The RUD income-based approach provides a new index by evaluating the discrepancy between equality and the combination of the cumulative distribution function (CDF) and quantile function (QF) of the RUD income. We investigate the properties of the new index in Section 5 and demonstrate the applicability of the new index through example income distributions in Section 6. Section 7 presents concluding remarks.
2. Problems of conventional indexes
Suppose that are the incomes of n individuals in a population. The income distribution of the population is written as . The total income and mean income of the income distribution y are denoted by and , respectively. As Cowell (2011, p. 1) defined, inequality is a departure from equality [4]. Equality in income inequality is an income distribution in which all individuals in a population have the same income. Such an income distribution is referred to as perfect equality and is denoted by . Therefore, the income inequality of y is the departure of y from .
In this section, we present three problems of the conventional indexes.
2.1. Mixture of information about equality and inequality
All the conventional indexes do not consider that an income distribution includes information about equality and inequality. To assess income inequality, we need to extract information about inequality from the income distribution. Consider, for example, income distribution , where the total income is 15 and the mean income is 3. Each individual's income is at least 1. That is, 5, one-third of the total income, is equally distributed over five individuals. We can represent this information about equality as . Therefore, decomposes into two distributions and . The distribution carries information about equality, while the distribution carries information about inequality. Similarly, perfect equality corresponding to , , decomposes into and . We should measure the income inequality of by comparing and along with .
The direct comparison between y and without information separation will lead us to the measurement of some mixture of equality and inequality. All the conventional indexes have the information mixture problem. They do not extract information about inequality from the income distribution. Park et al. [11], [12] raised the information mixture problem.
2.2. Variation within distribution
According to the definition mentioned above, income inequality is about the discrepancy between y and . However, most conventional indexes measure the variation within y. Such indexes involve the comparison of y with such as , , and (equivalently ). For example, the most popular Gini coefficient
involves . The coefficient of variation (CV)
and the Pietra index, known as Hoover index, the Robin Hood index, and the Ricci-Schutz index,
involve [13]. The Atkinson index
the Theil index
the generalized entropy index
Herfindahl index
and the mean log deviation
involve [14], [15], [16], [17], [18].
Though is a representative value of , the representation of as accompanies dimension reduction. Due to the dimension reduction, the comparison between y and reflects the comparison between y and incompletely. Moreover, since is the mean of y, the comparison of y with results in measuring the dispersion of . We refer to the dispersion of as the variation within distribution y. The conventional indexes measure the variation within distribution y. Cowell (2011, p. 7) defined an income inequality index as a numerical representation of the interpersonal differences in income within a given population [4]. The conventional indexes are in line with this definition.
We can not measure a departure from equality without equality. We can not measure a departure of y from without . We can not measure the discrepancy between y and without . Inequality of y is a relationship between y and . The variation within y neither require nor describe a relationship with . The variation within y has nothing to do with . Therefore, the conventional indexes are not inequality measures. This variation within distribution problem has never been considered before in other literature.
The Gini coefficient intends to measure the discrepancy between y and . The Gini coefficient compares the Lorenz curves for y and . The Gini coefficient measures the discrepancy between y and by the area enclosed by the Lorenz curves for y and . Therefore, the Gini coefficient does not have the variation within distribution problem. However, the Gini coefficient has the information mixture problem and results in the variation within y.
2.3. Negative incomes
One fundamental assumption, which all the conventional indexes rely on, is that income is non-negative. Perfect inequality refers to an income distribution in which one individual takes all the income and each of the rest takes zero income. Perfect inequality depends on the non-negative income assumption. Perfect inequality is used for computing the upper bound of an income inequality index. For example, the Gini coefficient takes a value between zero and one. The upper bound is the Gini coefficient for perfect inequality. The list of conventional indexes in Cowell (2011, p. 155) shows the upper bounds [4].
However, we frequently encounter negative incomes in reality. Park et al. [12], [19] analyzed the LIS income datasets of forty-two countries. Negative incomes were observed in twenty-seven countries. Negative incomes are collected when the expense of the self-employed exceeds the revenue, and when the debt repayment of an employee is more than his earnings. Negative incomes can result from accounting conventions, tax laws, and data collection procedures that differ from country to country. A negative income in one country can be positive in another country. Conversely, a positive income in one country can be negative in another country. Therefore, negative incomes are valid values and should be dealt with as they are.
Negative incomes incur problems in computing the conventional indexes. The indexes based on information theory and income shares are neither computable nor interpretable [14], [15], [16]. The popular Gini coefficient requires the normalization proposed by Chen et al. [20] and Raffinetti [21]. Usually, the negative or non-positive incomes are adjusted to cope with the problems. Typical adjustments are the deletion of non-positive incomes [2], [4] and the replacement of negative incomes with either zero incomes [22] or arbitrarily small positive incomes [23].
The non-negative income assumption does not represent reality. The indexes developed under unrealistic assumptions can not assess income inequality correctly. Equally problematic is that the inconsistency between reality and the assumption is resolved by adjusting the data. Data adjustment is equivalent to fitting the data into a model. We should fit a model to the data.
3. UD income-based approach
Park et al. [11] introduced the UD income to solve the information mixture problem. Park et al. [12] proposed a UD income-based approach to income inequality measurement. The UD income-based approach allows negative income values and assumes that the total income (equivalently, the mean income) is positive. The approach solves the variation within distribution problem by assessing the discrepancy between either the CDFs, the QFs, or the unscaled Lorenz curves for the UD income distribution and perfect equality.
The UD income-based approach begins with the expression , . We can derive the following from this expression.
-
(i)
of is evenly distributed among the n individuals.
-
(ii)
is unequally distributed among the n individuals as , .
-
(iii)
The unequally distributed portions of the incomes, , , contains information about inequality.
-
(iv)
, , are non-negative, and is zero.
, are called the UD incomes. We denote the UD income distribution by . The total and mean of the UD incomes are
Similarly, the UD income distribution of is obtained as .
We can derive x and from y. We can restore y from x and . Therefore, y is equivalent to x and . Similarly, is equivalent to and . The UD income-based approach focuses on x, , and (equivalently, ) instead of y and . The UD income-based approach assesses the discrepancy between x and in three ways. The first is to evaluate the discrepancy between the CDFs. The CDFs for x and are
| (1) |
and
| (2) |
respectively. Mathematically, the CDFs are step functions depicted in Fig. 1. The departure of from is called the vertical departure. We can measure the magnitude of the vertical departure by and norms. Dorfman [24] and Yitzhaki [25] showed that
where is the Gini coefficient of x. Therefore, and norms of the vertical departure are
To make and unitless, we normalize and by and , respectively. The normalized and , denoted by and , are
| (3) |
Figure 1.
CDFs of UD income distributions x and xpe.
The second is to evaluate the discrepancy between the QFs. The QFs for x and are
| (4) |
and
| (5) |
The QFs are also step functions depicted in Fig. 2. The departure of from is called the horizontal departure. Similarly, we can measure the magnitude of the horizontal departure by normalized and norms. and norms for the horizontal departure are
where is the variance of the UD income. Normalization of and by results in
| (6) |
Figure 2.
QFs of UD income distributions x and xpe.
The third is to evaluate the discrepancy between the unscaled Lorenz curves. Park et al. [12] showed that the normalized norm of the discrepancy between the unscaled Lorenz curves of x and is equivalent to . It is because the Lorenz curve derives from the QF. and were proposed as income inequality index.
The noteworthy points of the UD income-based approach are as follows:
-
(i)
Income inequality breaks down into two kinds of departure from equality. The kind of departure depends on how to represent a distribution.
-
(ii)
and consist of two components. One is associated with the locations of y and x. The other is either or associated the variation within y. The variation within y is a part of income inequality.
-
(iii)
It is well-known that the progressive transfer preserving reduces both and . Thus, such a transfer has been considered an effective method to improve income inequality. The progressive transfer preserving is such a transfer. If the poorest is not the beneficiary of the transfer, does not change. Equations (3) and (6) show that the progressive transfer preserving reduces the horizontal departure, but increases the vertical departure. Consequently, the progressive transfer preserving does not guarantee the improvement of income inequality.
We will discuss the effect of progressive transfers further in Section 5.
4. RUD income-based approach
The UD income-based approach reviewed in the previous section solves the problems mentioned in Section 2. However, it provides two indexes for two kinds of departure from perfect equality. It is difficult to evaluate income inequality comprehensively. We need to integrate two kinds of departure. The vertical departure is unitless, while the horizontal departure is a monetary unit. Therefore, the integration of the two kinds of departure is not straightforward. We thus introduce the RUD income which is the UD income relative to the mean income and defined as . The RUD income distribution for y and the corresponding perfect equality are denoted by and . The CDFs and QFs for z and are defined as Equations (1), (2), (4), and (5) with x replaced with z. If we replace x by z in Figure 1, Figure 2, we have the graphs for the CDFs and QFs for z and .
To take both kinds of departure into income inequality measurement, we combine the CDF and QF. The combination of the CDF and QF is the superimposition of the CDF and the QF. We denote the combinations for z and by and , which are depicted in Fig. 3.
Figure 3.
Combinations of CDFs and QFs of RUD income distributions z and zpe.
The coordinates of an arbitrary point on are . Since provides the vertical and horizontal baselines, we can assess the vertical and horizontal departures of . The horizontal and vertical departures of are z and , respectively. Note that on corresponds to perfect equality. The horizontal and vertical departures of are integrated into that is the discrepancy of from . If is on the step height of , and squared norms of are and . If is on the step width of , and squared norms of are and . Therefore, and norms of the discrepancy between and are obtained as
| (7) |
and
| (8) |
where , , and are the mean, Gini coefficient, and CV of z, respectively. Next we compute and norms of the discrepancy between and . A point on the height of is for , while a point on the width of is for . Their discrepancies from are and , respectively. and squared norms of are and . and squared norms of are z and . Therefore, and norms of the discrepancy between and are obtained as
| (9) |
and
| (10) |
We derive from Equations (7)-(10) two indexes, and , which are
and
Since and , we can write as
and integrate two kinds of departure. and are greater than or equal to zero. and are zero for perfect equality. We can think of more unequal income distributions for any income distribution because negative income values are allowed. Therefore, the and indexes are not bounded above. is the total UD income ratio relative to the total income. We can interpret as the average distance between z and . The dispersion measures such as , , and assess the variation within z, that is, how unevenly the total RUD income is distributed. integrates the average distance between z and and the variation within z. We propose as an income inequality index.
5. Properties of index
We described in the previous section the basic properties and interpretation of the index. Many studies commonly mention that scale invariance, replication invariance, the anonymity axiom, and the principle of progressive transfers are the desirable properties of an income inequality index [2], [3], [4], [26], [27], [28]. As explained in Subsection 2.2, such studies consider the variation within y. Ebert [27, p. 366] mentions that these properties are about inequality within a population. In general, scale invariance, replication invariance, and the anonymity axiom are also desirable properties for the discrepancy between y and . However, we should investigate whether the principle of progressive transfers is a desirable property for an income inequality index. Because a decrease of the variation within y by progressive transfers does not mean a decrease of the discrepancy between y and . In this section, we examine the index for these properties.
Multiplication of y by some results in a new income distribution . Then and , where is the total income for . Therefore, , equals , and s for and y are the same. The index is scale-invariant.
Let be an income distribution induced by replication of y. Then y and have the same mean and minimum. The RUD income distribution for is simply replication of z. Therefore, equals , and s for and y are the same. The index is replication-invariant.
Suppose that is an income distribution obtained by permutating s in y. Permutation does not change the mean and minimum. Therefore, , and s for and y are the same. The index satisfies the anonymity axiom.
The principle of progressive transfers is that some income transfer from a high-income person to a low-income person reduces inequality, provided that the total income and the post-transfer ranking remain the same. Consider an income distribution y with . Then and . Suppose that is transferred from to and for some . Denote the resulting income distribution by . This progressive transfer preserves the mean, minimum, and ranking. The corresponding RUD income distribution for is . Then, and are different only for and as shown in Fig. 4. We can derive the following from Fig. 4.
Figure 4.
Difference between and .
-
(i)
The horizontal departure increases from to for , while it decreases from to for . The amount increased, ϵ, equals the amount decreased. Intervals, and , have the same length.
-
(ii)
The vertical departure increases from to for , while it decreases from to for . The amount increased, , equals the amount decreased. Intervals, and , have the same length.
The departure increase for an interval accompanies the departure decrease for another interval of the same length. The amount increased equals the amount decreased. Therefore, it is not evident that progressive transfers reduce the departure from perfect equality. The effect of progressive transfers depends on , , ϵ, and how to integrate the vertical and horizontal departures. If we integrate by the norm, and have the same departure from . It is because z and have the same mean. Consequently, y and have the same income inequality. If we integrate by the norm, the departure of from can be larger or smaller than the departure of from . We will present in the next section several progressive transfers showing different effects on the departure of from . Progressive transfers can fail to reduce the discrepancy between y and . The principle of progressive transfers does not apply to the RUD income-based approach.
6. Application of index
This section applies the proposed index to the example income distributions listed in Table 1. We present , , , and its components , , , and in Table 1. [12] used the first eight income distributions , to demonstrate the deficiencies of and and the applicability of and . We show the insufficiency of and and the applicability of . We include an additional income distribution . All the income distributions except have , , and . Each of the distributions derives from others by a series of transfers. Using these income distributions, we investigate the effect of progressive transfer. has a negative income value. Replacing the negative income with zero, we obtain from .
Table 1.
Example income distributions.
| y | |||||||
|---|---|---|---|---|---|---|---|
| 0.2933 | 0.5578 | 0.6667 | 1.3333 | 0.6110 | 0.8692 | 0.4408 | |
| 0.2933 | 0.5578 | 1.0000 | 1.6667 | 0.8406 | 1.1450 | 0.6035 | |
| 0.2133 | 0.4216 | 0.3333 | 1.0000 | 0.3464 | 0.5375 | 0.2206 | |
| 0.2133 | 0.4216 | 0.6667 | 1.3333 | 0.6733 | 0.7888 | 0.4230 | |
| 0.1867 | 0.3651 | 0.6667 | 1.0000 | 0.6928 | 0.7601 | 0.4967 | |
| 0.1067 | 0.2108 | 0.3333 | 0.6667 | 0.4761 | 0.3944 | 0.2451 | |
| 0.2667 | 0.6667 | 1.3333 | 1.6667 | 1.0328 | 1.4907 | 0.9029 | |
| 0.2000 | 0.5000 | 1.0000 | 1.2500 | 0.8944 | 1.1180 | 0.7498 | |
| 0.2667 | 0.4714 | 0.6667 | 1.3333 | 0.6325 | 0.8165 | 0.4200 |
According to and , the income inequality of is the same as . and also evaluate the income inequality of and the same. and fail to differentiate these two sets of income distributions concerning income inequality. also fails to differentiate and . Besides the three problems in Section 2, and are not sensitive to the change in distribution. By contrast, the index is sensitive to the change in distribution enough to successfully differentiate these income distributions. and measure the variation within income distribution. Two income distributions with the same within variation can have different distances from equality, consequently different income inequalities. Since and for are smaller than , we can say that is less unequal than . Considering and , we can differentiate between and , between and , and between and . Note that gives the same comparison result with the simultaneous use of and .
Next, we make an inequality comparison between and . says that is less unequal than , while says that is more unequal than . The comparison by conflicts with the comparison by . We observe similar conflicting results when comparing with , and with . Conflicting results can happen because and measure different kinds of departure from equality. and are not sufficient for the comprehensive income inequality comparison. According to the index, (, ) is less unequal than (, ).
We can compute and when there are negative income values. However, negative income values are usually adjusted to meet the non-negative income assumption. If we delete a negative value in , becomes perfect equality. says that is the most unequal among nine distributions in Table 1. If we replace a negative value in with zero, we have . Comparing and its components for and , we see that such adjustment incurs the underestimation of income inequality.
Finally, we investigate the effect of progressive transfer in the sense of . derives from by the transfer from the richest to the second richest. This transfer decreases and , increases , and does not change . says that the overall income inequality increases. It is noteworthy that the transfer between rich individuals can worsen income inequality. derives from by the transfer between the middle class. This transfer does not change and , increases , decreases , and decreases . derives from by the transfer between the middle class. These transfers do not change and , increases , decreases , and increases . derives from by the transfer from the richest to the poorest. derives from by the transfer from the middle class to the poorest. The transfer involving the poorest decreases and all of its components. In summary, progressive transfers can fail to improve income inequality. However, we should note that the progressive transfer is essential for improving income inequality. We need to examine whether the progressive transfer improves income inequality before application.
7. Conclusions
An important topic in economics is the measurement of income inequality. We showed that the conventional income inequality indexes assessed income inequality incorrectly because of three problems presented in Section 2. We raised the variation within distribution problem for the first time. The conventional indexes measure the variation within the given income distribution. By contrast, the UD income-based approaches intend to measure the discrepancy between two distributions. The UD income-based approach first extracts information about inequality by deriving UD income distributions for the given income distribution and perfect equality. Then it focuses on the discrepancy between two UD income distributions. The UD income-based approach compares the CDFs and QFs of the UD income distributions. It breaks down income inequality into two kinds of departure from equality and provides two indexes. It is unsuccessful in integrating two kinds of departure and assessing income inequality comprehensively. We proposed the RUD income-based approach, developed the index, and examined the properties of the index. We demonstrated the applicability of the index and the failure of progressive transfers to improve income inequality.
To apply the index in practice, we need an estimator of the index. This study does not provide an estimator of the index. Some components of the index such as the minimum and the range are not easy to estimate. The development of a good estimator can be a challenging task. In addition, we need to identify the characteristics of the progressive transfers improving income inequality. The characteristics will help formulate policies for improving inequality.
CRediT authorship contribution statement
Youngsoon Kim: Writing – original draft, Visualization, Methodology. Joongyang Park: Writing – review & editing, Writing – original draft, Validation, Methodology, Formal analysis, Conceptualization. Ae-Jin Ju: Visualization, Software.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Data availability
No datasets were created or analyzed in this study.
References
- 1.Lorenz O. Methods of measuring the concentration of wealth. Publ. Am. Stat. Assoc. 1905;9:209–2019. [Google Scholar]
- 2.Hao L., Naiman D.Q. Sage Publications, Inc.; 2010. Assessing Inequality. [Google Scholar]
- 3.Jenkins S.P., Kerm P.V. In: The Oxford Handbook of Economic Inequality. Nolan B., Salverda W., Smeeding T.M., editors. Oxford University Press; New York: 2011. The measurement of economic inequality; pp. 40–67. [Google Scholar]
- 4.Cowell F.A. 3rd ed. Oxford University Press; New York: 2011. Measuring Inequality. [Google Scholar]
- 5.Palma J.G. UN Department of Economic and Social Affairs; New York: 2006. Globalizing Inequality: Centrifugal and centripetal forces at work. DESA Working Paper 35. [Google Scholar]
- 6.Palma J.G. Dev. Change. 1. Vol. 42. University of Cambridge Department of Economics; Cambridge: 2011. 2011. Homogeneous middles vs. heterogeneous tails, and the end of the inverted-U: the share of the rich is what it's all about; pp. 87–153. Cambridge Working Papers in Economics 1111. later published in. [Google Scholar]
- 7.Cobham A., Sumner A. Center for Global Development (CGD); Washington DC: 2013. Is it all about the tails? The Palma measure of income inequality. Working Paper 343. [Google Scholar]
- 8.Gallegati M., Landini S., Stiglitz J.E. Columbia Business School; New York: 2016. The inequality multiplier. Research paper 16–29. [Google Scholar]
- 9.Clementi F., Gallegati M., Gianmoena L., Landini S., Stiglitz J.E. Mis-measurement of inequality: a critical reflection and new insights. J. Econ. Interact. Coord. 2019;14:891–921. [Google Scholar]
- 10.Zanardi G. Della asimmetria condizionata delle curve di concentrazione. Lo scentramento. Riv. Ital. Econ. Demogr. Stat. 1964;18:431–466. [Google Scholar]
- 11.Park J., Kim Y., Heo S. Dual-index measurement of income inequality. Bull. Econ. Res. 2018;70(3):277–284. [Google Scholar]
- 12.Park J., Kim Y., Ju A. Measuring income inequality based on unequally distributed income. J. Econ. Interact. Coord. 2021;16:309–322. [Google Scholar]
- 13.Schutz R.R. On the measurement of income inequality. Am. Econ. Rev. 1951;41:107–122. [Google Scholar]
- 14.Atkinson A.B. On the measurement of inequality. J. Econ. Theory. 1970;2:244–263. [Google Scholar]
- 15.Theil H. North-Holland; Amsterdam: 1967. Economics and Information Theory. [Google Scholar]
- 16.Shorrocks A.F. The class of additively decomposable inequality measures. Econometrica. 1980;48:613–625. [Google Scholar]
- 17.Herfindahl O. Columbia University; New York: 1950. Concentration in the steel industry. Dissertation. [Google Scholar]
- 18.Haughton J., Kandker S.R. The World Bank; Washington, DC: 2009. The Handbook on Poverty and Inequality. [Google Scholar]
- 19.http://www.lisdatacenter.org Luxembourg Income Study (LIS) Database, LIS.
- 20.Chen C.N., Tsau T.W., Rhab T.S. The Gini coefficient and negative income. Oxf. Econ. Pap. 1982;34:473–478. [Google Scholar]
- 21.Raffinetti E., Siletti E., Vernizzi A. On the Gini coefficient normalization when incomes with negative values are considered. Stat. Methods Appl. 2015;24(3):507–521. [Google Scholar]
- 22.OECD OECD framework for statistics on the distribution of household income, consumption and wealth. 2013. https://doi.org/10.1787/9789264194830-en available at.
- 23.Bellù L.G., Liberati P. Describing income inequality: Theil index and entropy class indices. 2006. http://www.fao.org/docs/up/easypol/445/theil_index_051en.pdf available at.
- 24.Dorfman R. A formula for Gini coefficient. Rev. Econ. Stat. 1979;61:146–149. [Google Scholar]
- 25.Yitzhaki S. More than a dozen alternative ways of spelling Gini. Res. Econ. Inequal. 1998;8:13–30. [Google Scholar]
- 26.Foster J.E. An axiomatic characterization of the Theil measure of income inequality. J. Econ. Theory. 1983;31:14–16. [Google Scholar]
- 27.Ebert U. A family of aggregative compromise inequality measures. Int. Econ. Rev. 1988;29(2):363–376. [Google Scholar]
- 28.Litchfield L.A. World Bank; Washington DC: 1999. Inequality: Methods and Tools.http://siteresources.worldbank.org/INTPGI/Resources/Inequality/litchfie.pdf available at. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
No datasets were created or analyzed in this study.




