An investigation of the performance of parametric functional forms for the Lorenz curve

Thitithep Sitthiyot; Kanyarat Holasut

doi:10.1371/journal.pone.0287546

. 2023 Jun 23;18(6):e0287546. doi: 10.1371/journal.pone.0287546

An investigation of the performance of parametric functional forms for the Lorenz curve

Thitithep Sitthiyot ^1,^*, Kanyarat Holasut ²

Editor: Ayesha Afzal³

PMCID: PMC10289384 PMID: 37352250

Abstract

Given that an excellent performance of any parametric functional form for the Lorenz curve that is based on a single country case study and a limited range of distribution must be treated with great caution, this study investigates the performance of a single-parameter functional form proposed by Paul and Shankar (2020) who use income data of Australia to show that their functional form is superior to the other existing widely used functional forms considered in their study. By using both mathematical proof and empirical data of 40 countries around the world, this study demonstrates that Paul and Shankar (2020)’s functional form not only fails to fit the actual observations well but also is generally outperformed by the other popular functional forms considered in their study. Moreover, to overcome the limitation of the performance of a single-parameter functional form on the criterion of the estimated Gini index, this study employs a functional form that has more than one parameter in order to show that, by and large, it performs better than all popular single-parameter functional forms considered in Paul and Shankar (2020)’s study. Thus, before applying any functional form to estimate the Lorenz curve, policymakers should check if it could describe the shape of income distributions of different countries through the changes in parameter values and yield the values of the estimated Gini index that are close to their observed data. Using a functional form that does not fit the actual observations could adversely affect inequality measures and income distribution policies.

Introduction

The Lorenz curve was originally developed by an American economist named Max O. Lorenz in 1905 as a method for measuring wealth concentration [1]. It represents a graphical relationship between the cumulative normalized rank of population from the poorest to the richest and the cumulative normalized wealth held by these population from the poorest to the richest. For more than a century, the Lorenz curve has been widely used for illustrating the distribution of income and for examining inequality in such distribution [2]. It has also performed an important role in gauging and comparing inequality in income distribution [3] as typically measured by the Gini index which can be computed from the Lorenz curve as shown in Fig 1.

The Gini index can be calculated as the ratio of the area between the perfect equality line and the Lorenz curve (A) divided by the total area under the perfect equality line (A + B) [4]. The Gini index takes the value between 0 and 1. The closer the index is to 0 (where the area A is small), the more equal the distribution of income. The closer the index is to 1 (where the area A is large), the more unequal the distribution of income.

According to Sitthiyot and Holasut [2], the Lorenz curve can be estimated 1) by using interpolation techniques, 2) by assuming a specified functional form for income distribution and deriving the respective Lorenz curve, and 3) by specifying a parametric functional form for the Lorenz curve. Given the interpolation techniques underestimate inequality unless the individual data are available and no existing statistical distribution has proved to be adequate for representing the entire income distribution, numerous studies have proposed a variety of parametric functional forms in order to directly estimate the Lorenz curve [3, 5–31].

While a number of studies propose a variety of parametric functional forms for estimating the Lorenz curve, the purpose of this study is to investigate an alternative single-parameter functional form for the Lorenz curve proposed by Paul and Shankar [3]. By utilizing the data on the Australian decile income shares between 2001 and 2010 that are constructed using the individual income data from the first 10 waves of Household, Income, and Labour Dynamics Australia Survey, Paul and Shankar [3] show that, on the basis of mean-squared error (MSE) and information inaccuracy measure (IIM), their functional form outperforms the other existing widely used functional forms, namely, Kakwani and Podder [5], Aggarwal [9], Chotikapanich [16], and a functional form implied by Pareto distribution. Moreover, the Gini index in all 10 years, calculated using the functional form that Paul and Shankar [3] propose, rank second closely behind those estimated using Aggarwal [9]’s functional form.

According to Jordá et al. [32], an excellent performance of any parametric functional form for the Lorenz curve that is relied on a single country case study and a limited range of distribution should be treated with extreme caution. Jordá et al. [32] also note that previous studies by [33, 34] also point out the similar caveat. Given the superiority of Paul and Shankar [3]’s functional form over the other existing widely used functional forms, it is therefore relevant and worthwhile to conduct an examination to find out if we use grouped income data of other countries, the performance of Paul and Shankar [3]’s functional form is still superior to the other existing widely used functional forms considered in their study. The findings from this investigation should also serve as a check-and-balance not only for economics but also for other scientific disciplines that use the Lorenz curve to analyze size distributions of non-negative quantities and inequalities. This is because a good functional form for the Lorenz curve must describe the shape of income distributions of different countries, regions, socioeconomic groups, and in different time periods through the changes in parameter values [35]. In addition, it should provide a good fit of the entire range of income distribution since all observations are relevant for an accurate measurement of income inequality, supporting social and income policies, as well as determining taxation structure [35]. Provided that various studies on the relationship between inequality measures and financial and/or socioeconomic variables, for example [36–39], rely on the accuracy of inequality measures that could possibly be derived from a parametric functional form for the Lorenz curve, if the choice of parametric functional form is not a valid candidate for representing the income distribution, the estimates on the income shares and inequality measures might be severely affected by misspecification bias [32].

Our findings indicate that, despite its parsimony, the single-parameter functional form for the Lorenz curve proposed by Paul and Shankar [3] has two serious limitations. First, it fails to fit the actual observations well when the inequality in income distribution as measured by the Gini index is lower than the critical threshold which is found to be 0.4180. We verify the first limitation by demonstrating both mathematically and empirically that when a country’s Gini index is lower than 0.4180, not only does the functional form proposed by Paul and Shankar [3] not perform well but also all other popular functional forms considered in their study, by and large, perform far better than Paul and Shankar [3]’s specified functional form on the basis of the coefficient of determination (R²), MSE, IIM, and the estimated Gini index. The numerical example and the income data of 20 Organization for Economic Co-operation and Development (OECD) countries are employed in order to illustrate the first limitation. Second, on the basis of R², MSE, IIM, and the estimated Gini index, the performance of Paul and Shankar [3]’s functional specification, compared to the other existing widely used functional forms considered in their study, is at best mixed when the upper part of a country’s income distribution has a long-tailed property which is defined as the income share of the top 20% being greater than 50% of total income share. The income data of the other 20 different countries around the world whose upper part of income distribution has a long-tailed property are employed in order to illustrate the second limitation.

While Paul and Shankar [3] show that the values of the Gini index estimated using the functional form that they propose rank second closely behind those estimated using Aggarwal [9]’s functional form, this study would like to note that, to evaluate the performance of a functional form for the Lorenz curve on the basis of the estimated Gini index, a functional form that contains more than one parameter is required since the curvature of the estimated Lorenz curve has to be adjustable so that it would fit the actual data as much as possible while keeping the value of the estimated Gini index the same. According to Dagum [35], this cannot be done by using a single-parameter functional form because the estimated Gini index would be a monotonic function of it. In order to illustrate that a functional form that has more than one parameter, by and large, gives the values of the estimated Gini index closer to the actual observations than single-parameter functional forms, a two-parameter functional form proposed by Sitthiyot and Holasut [2] is employed in order to estimate the Lorenz curve and calculate the value of the estimated Gini index using the same set of grouped income data of 40 countries.

Materials and methods

To examine the performance of Paul and Shankar [3]’s functional form, let p denote the cumulative normalized rank of income and L(p,γ) denote the cumulative normalized income for a given value of parameter γ. The functional form for the Lorenz curve proposed by Paul and Shankar [3] is as follows:

L (p, γ) = p * [\frac{e^{- γ (1 - e^{p})} - 1}{e^{- γ (1 - e)} - 1}], γ > 0

(1)

Given that the value of parameter γ must be greater than 0, the first limitation of this functional form could be demonstrated mathematically by setting the value of parameter γ to be 0 and then using Eq (1) to calculate the values of L(p,γ) for different values of p which are 0.0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, and 1.0. All estimated values of L(p,γ) would be used to calculate the area under the Lorenz curve using the numerical integration since the above specified functional form does not have a closed-form expression for the Gini index. The next step is to compute the value of the Gini index that corresponds to the value of parameter γ = 0. The value of the Gini index, given γ = 0, would be the critical threshold where the specified functional form would not be able to fit the actual observations well below this critical threshold since the value of parameter γ cannot be 0 or negative.

To illustrate the first limitation of Paul and Shankar [3]’s functional form empirically as well as to evaluate its performance compared to those of the other popular functional specifications considered in their study, namely, Kakwani and Podder [5], Aggarwal [9], Chotikapanich [16], and a functional form implied by Pareto distribution, this study uses the data on the observed Gini index and the decile income shares of 20 OECD countries from the United Nations University-World Income Inequality Database (UNU-WIID) [40]. The reason that we choose the OECD countries is because the values of the observed Gini index of these countries are lower than the critical threshold. We would like to note that, based on the mathematical proof as described above, this critical threshold should apply to not only income distribution of any country and in any time period but also income distribution generated by using any simulation method as long as the value of the Gini index is less than 0.4180, not just to the income distributions of 20 OECD countries employed in this study.

For the second limitation of Paul and Shankar [3]’s functional form where it does not perform well compared to the other popular functional forms when a country’s income distribution has a long-tailed property, the data on the observed Gini index and the income shares by decile of the other 20 different countries around the world from the UNU-WIID [40], the Office of the National Economic and Social Development Council (NESDC), Thailand [41], and the World Bank [42] are used to illustrate this limitation. These countries are chosen because the upper part of income distribution is long-tailed where the top 20% of income earners receive more than 50% of the total income as defined in Introduction.

In this study, the performance of functional form proposed by Paul and Shankar [3] is compared to the other existing widely used functional specifications considered in their study which are Kakwani and Podder [5], Aggarwal [9], Chotikapanich [16], and a functional form implied by Pareto distribution. These functional forms are expressed as follows:

Kakwani and Podder [5]:

L (p, δ) = p e^{- δ (1 - p)}, δ > 0

(2)

Aggarwal [9]:

L (p, θ) = \frac{{(1 - θ)}^{2} p}{{(1 + θ)}^{2} - 4 θ p}, 0 < θ < 1

(3)

Chotikapanich [16]:

L (p, k) = \frac{e^{k p} - 1}{e^{k} - 1}, k > 0

(4)

Functional form implied by Pareto distribution:

L (p, α) = 1 - {(1 - p)}^{\frac{1}{α}}, α > 1

(5)

Paul and Shankar [3, p. 1396] note that their alternative functional form for the Lorenz curve, Kakwani and Podder [5]’s functional form, and a functional form implied by Pareto distribution do not have a closed-form expression for the Gini index, hence the estimated Gini index has to be calculated by using the numerical integration. Provided that a good functional form for the Lorenz curve should have a closed-form expression for the Gini index [35], this study finds out that only does Paul and Shankar [3]’s functional form not have an explicit mathematical solution for the Gini index. For the other existing widely used functional forms considered in their study, the Gini index can be directly computed since these functional forms have closed-form expressions for the Gini index which are as follows:

Kakwani and Podder [5]:

G i n i i n d e x = 1 - \frac{2 (δ - 1)}{δ^{2}} - \frac{2 e^{- δ}}{δ^{2}}

(6)

Aggarwal [9]:

G i n i i n d e x = \frac{{(1 + θ)}^{2}}{2 θ} [\frac{{(1 - θ)}^{2}}{4 θ} l n {(\frac{1 - θ}{1 + θ})}^{2} + 1] - 1

(7)

Chotikapanich [16]:

G i n i i n d e x = \frac{(k - 2) e^{k} + (k + 2)}{k (e^{k} - 1)}

(8)

Functional form implied by Pareto distribution:

G i n i i n d e x = \frac{α - 1}{α + 1}

(9)

This study uses the curve fitting technique based on minimizing error sum of squares to estimate the Lorenz curve. The statistical measures of goodness-of-fit, namely, MSE and IIM are employed for assessing the performance of all functional forms, both of which are the same criteria as used in Paul and Shankar [3]’s study. In addition, this study uses R² and the estimated Gini index to evaluate the performance of all functional forms. The closer the value of R² is to 1 and the closer the value of MSE is to zero, the better the estimated functional form. For the IIM criterion, the estimated functional specification that has a smaller absolute value of IIM is better than those with larger absolute values of IIM. In this study, the Microsoft Excel Data Analysis program and the Microsoft Excel Solver program are used for calculating the descriptive statistics on the Gini index and the income shares by decile as well as estimating the parameters and calculating the estimated Gini index. As suggested by Dagum [35], from a perspective of computational cost and the acceptance of the specified functional form in practice, a simple method of parameter estimation is always an advantage. Table 1 reports the descriptive statistics of the observed Gini index and the decile income shares of 20 OECD countries whereas those of the other 20 countries are reported in Table 2.

Table 1. The descriptive statistics of the observed Gini index and the decile income shares of 20 OECD countries whose values of the observed Gini index are below the critical threshold of 0.4180.

Indicator	Observed Gini index	Decile income shares
Indicator	Observed Gini index	D1	D2	D3	D4	D5	D6	D7	D8	D9	D10
Mean	0.2810	0.0326	0.0536	0.0646	0.0744	0.0843	0.0948	0.1065	0.1212	0.1434	0.2247
Median	0.2805	0.0320	0.0540	0.0645	0.0745	0.0840	0.0950	0.1065	0.1210	0.1420	0.2255
Mode	0.3090	0.0320	0.0580	0.0640	0.0760	0.0850	0.0950	0.1080	0.1210	0.1420	0.2240
Minimum	0.2320	0.0200	0.0460	0.0570	0.0680	0.0790	0.0910	0.1020	0.1160	0.1360	0.1910
Maximum	0.3270	0.6510	0.0620	0.0740	0.0820	0.0930	0.1000	0.1110	0.1300	0.1590	0.2450
Standard deviation	0.0268	0.0053	0.0046	0.0044	0.0037	0.0032	0.0024	0.0024	0.0033	0.0061	0.0152
Number of countries	20	20	20	20	20	20	20	20	20	20	20

Open in a new tab

Table 2. The descriptive statistics of the observed Gini index and the decile income shares of the other 20 countries whose upper part of the income distribution has a long-tailed property defined as the income share of the top 20% being greater than 50% of total income share.

Indicator	Income share of the top 20%	Observed Gini index	Decile income shares
Indicator	Income share of the top 20%	Observed Gini index	D1	D2	D3	D4	D5	D6	D7	D8	D9	D10
Mean	0.5576	0.5065	0.0140	0.0263	0.0363	0.0455	0.0561	0.0688	0.0852	0.1100	0.1545	0.4032
Median	0.5541	0.5070	0.0126	0.0257	0.0348	0.0452	0.0561	0.0703	0.0858	0.1110	0.1572	0.3989
Mode	N/A	0.5400	0.0100	0.0260	0.0342	N/A	0.0600	0.0680	0.0870	0.1150	0.1510	N/A
Minimum	0.5112	0.4470	0.0090	0.0179	0.0250	0.0331	0.0428	0.0556	0.0742	0.0979	0.1331	0.3529
Maximum	0.6374	0.5907	0.0264	0.0375	0.0452	0.0537	0.0646	0.0769	0.0933	0.1180	0.1690	0.4725
Standard deviation	0.0322	0.0359	0.0043	0.0043	0.0047	0.0049	0.0050	0.0051	0.0053	0.0060	0.0108	0.0328
Number of countries	20	20	20	20	20	20	20	20	20	20	20	20

Open in a new tab

Results

The mathematical proof of the critical threshold for the Gini index

This study first demonstrates the mathematical proof of the first limitation of Paul and Shankar [3]’s functional form. According to their specified functional form with the values of p being equal to 0.0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, and 1.0, Table 3 reports different values of the Gini index (rounding up to 7 decimal places) for different values of parameter γ ranging between −0.000001 and 2.0.

Table 3. The values of the Gini index calculated using different values of parameter γ based on Paul and Shankar [3]’s functional form.

Parameter γ	Gini index
-0.000001	0.4179801
0	0.4179803
0.000001	0.4179804
0.00001	0.4179818
0.0001	0.4179955
0.001	0.4181326
0.01	0.4195044
0.1	0.4332468
0.5	0.4941095
1.0	0.5665420
1.25	0.6000009
1.5	0.6311545
2.0	0.6859776

Open in a new tab

The numerical results in Table 3 indicate that when the value of parameter γ is equal to 0 (the average value of parameter γ when γ = −0.000001 and 0.000001), the critical threshold for the Gini index is equal to 0.4179803 (the average value of the Gini index when the Gini index = 0.4179801 and 0.4179804). Let the value of the critical threshold for the Gini index = 0.4180 for the purpose of demonstration. This implies that when the level of inequality in income distribution as measured by the observed Gini index is less than the critical threshold, using Paul and Shankar [3]’s functional form for approximating the Lorenz curve and calculating the Gini index would result in the value of the estimated Gini index being equal to 0.4180. Provided that this is the mathematical proof, it is thus always true not only for income distribution of any country and in any time period but also income distribution generated by using any simulation technique so long as the value of the Gini index is less than 0.4180.

The performance comparison between Paul and Shankar [3]’s functional form and the other popular functional forms when the Gini index is below the critical threshold

In order to show the first limitation of Paul and Shankar [3]’s functional form empirically, this study employs the data on decile income shares of 20 OECD countries from the UNU-WIID [40] and estimate the Lorenz curve using their functional form and the other existing widely used functional specifications considered in their study, namely, Kakwani and Podder [5], Aggarwal [9], Chotikapanich [16], and a functional form implied by Pareto distribution. The estimated values of parameters for each functional form are reported in Table A1 in S1 Appendix. We then compare the performance of Paul and Shankar [3]’s functional form to those of the other existing widely used functional forms on the criteria of R², MSE, IIM, and the estimated Gini index. Tables 4–7 reports our results.

Table 4. The income data of 20 OECD countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on R².

Country	Year	Observed Gini index	Coefficient of determination (R²)
Country	Year	Observed Gini index	PS	KP	A	C	Pareto
Slovakia	2017	0.2320	0.8760	0.9989	0.9986	0.9988	0.9918
Slovenia	2017	0.2370	0.8827	0.9990	0.9993	0.9992	0.9931
Czech Republic	2017	0.2450	0.8851	0.9974	0.9989	0.9978	0.9958
Finland	2017	0.2530	0.8958	0.9975	0.9990	0.9980	0.9951
Belgium	2017	0.2600	0.9094	0.9989	0.9992	0.9991	0.9915
Norway	2017	0.2610	0.9039	0.9975	0.9984	0.9978	0.9934
Netherlands	2017	0.2710	0.9151	0.9978	0.9989	0.9982	0.9933
Denmark	2017	0.2760	0.9151	0.9967	0.9983	0.9972	0.9940
Austria	2017	0.2790	0.9244	0.9978	0.9987	0.9981	0.9921
Sweden	2017	0.2800	0.9245	0.9979	0.9987	0.9982	0.9918
Hungary	2017	0.2810	0.9270	0.9979	0.9989	0.9983	0.9920
Germany	2017	0.2910	0.9346	0.9974	0.9989	0.9979	0.9921
Poland	2017	0.2920	0.9349	0.9971	0.9987	0.9977	0.9924
France	2017	0.2930	0.9295	0.9950	0.9978	0.9958	0.9948
Switzerland	2017	0.3010	0.9394	0.9959	0.9983	0.9966	0.9934
Ireland	2017	0.3060	0.9464	0.9966	0.9990	0.9974	0.9926
Canada	2017	0.3090	0.9532	0.9983	0.9988	0.9986	0.9885
Luxembourg	2017	0.3090	0.9524	0.9978	0.9991	0.9983	0.9899
Estonia	2017	0.3160	0.9618	0.9991	0.9989	0.9993	0.9857
Italy	2017	0.3270	0.9639	0.9978	0.9982	0.9981	0.9869

Open in a new tab

PS, Paul and Shankar [3]; KP, Kakwani and Podder [5]; A, Aggarwal [9]; C, Chotikapanich [16]; Pareto, Functional form implied by Pareto distribution.

The bold numbers (if any) indicate that Paul and Shankar [3]’s functional form is superior to the other existing widely used functional forms.

Table 7. The income data of 20 OECD countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on the estimated Gini index.

Country	Year	Observed Gini index	Estimated Gini index
Country	Year	Observed Gini index	PS	KP	A	C	Pareto
Slovakia	2017	0.2320	0.4180	0.2242	0.2227	0.2239	0.2171
Slovenia	2017	0.2370	0.4180	0.2315	0.2305	0.2313	0.2250
Czech Republic	2017	0.2450	0.4180	0.2386	0.2389	0.2387	0.2351
Finland	2017	0.2530	0.4180	0.2470	0.2470	0.2470	0.2426
Belgium	2017	0.2600	0.4180	0.2540	0.2526	0.2538	0.2463
Norway	2017	0.2610	0.4180	0.2520	0.2514	0.2518	0.2467
Netherlands	2017	0.2710	0.4180	0.2628	0.2622	0.2627	0.2568
Denmark	2017	0.2760	0.4180	0.2646	0.2644	0.2646	0.2599
Austria	2017	0.2790	0.4180	0.2711	0.2701	0.2709	0.2644
Sweden	2017	0.2800	0.4180	0.2709	0.2697	0.2706	0.2639
Hungary	2017	0.2810	0.4180	0.2739	0.2728	0.2737	0.2669
Germany	2017	0.2910	0.4180	0.2833	0.2824	0.2831	0.2764
Poland	2017	0.2920	0.4180	0.2843	0.2835	0.2842	0.2777
France	2017	0.2930	0.4180	0.2834	0.2837	0.2834	0.2794
Switzerland	2017	0.3010	0.4180	0.2927	0.2924	0.2926	0.2871
Ireland	2017	0.3060	0.4180	0.3002	0.2995	0.3001	0.2933
Canada	2017	0.3090	0.4180	0.3027	0.3004	0.3022	0.2929
Luxembourg	2017	0.3090	0.4180	0.3037	0.3019	0.3034	0.2948
Estonia	2017	0.3160	0.4180	0.3120	0.3087	0.3114	0.2999
Italy	2017	0.3270	0.4180	0.3183	0.3156	0.3178	0.3079

Open in a new tab

PS, Paul and Shankar [3]; KP, Kakwani and Podder [5]; A, Aggarwal [9]; C, Chotikapanich [16]; Pareto, Functional form implied by Pareto distribution.

The bold numbers (if any) indicate that Paul and Shankar [3]’s functional form is superior to the other existing widely used functional forms.

Table 5. The income data of 20 OECD countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on MSE.

Country	Year	Observed Gini index	Mean-squared error (MSE)
Country	Year	Observed Gini index	PS	KP	A	C	Pareto
Slovakia	2017	0.2320	0.0012	0.0001	0.0001	0.0001	0.0003
Slovenia	2017	0.2370	0.0011	0.0001	0.0001	0.0001	0.0003
Czech Republic	2017	0.2450	0.0011	0.0002	0.0001	0.0002	0.0002
Finland	2017	0.2530	0.0010	0.0002	0.0001	0.0002	0.0002
Belgium	2017	0.2600	0.0009	0.0001	0.0001	0.0001	0.0004
Norway	2017	0.2610	0.0010	0.0002	0.0002	0.0002	0.0003
Netherlands	2017	0.2710	0.0008	0.0002	0.0001	0.0002	0.0003
Denmark	2017	0.2760	0.0009	0.0003	0.0002	0.0003	0.0002
Austria	2017	0.2790	0.0008	0.0002	0.0001	0.0002	0.0003
Sweden	2017	0.2800	0.0008	0.0002	0.0001	0.0002	0.0003
Hungary	2017	0.2810	0.0008	0.0002	0.0001	0.0002	0.0004
Germany	2017	0.2910	0.0007	0.0002	0.0001	0.0002	0.0003
Poland	2017	0.2920	0.0007	0.0002	0.0001	0.0002	0.0003
France	2017	0.2930	0.0009	0.0004	0.0002	0.0004	0.0002
Switzerland	2017	0.3010	0.0008	0.0004	0.0002	0.0003	0.0003
Ireland	2017	0.3060	0.0006	0.0003	0.0001	0.0002	0.0004
Canada	2017	0.3090	0.0005	0.0002	0.0001	0.0002	0.0005
Luxembourg	2017	0.3090	0.0006	0.0002	0.0001	0.0002	0.0005
Estonia	2017	0.3160	0.0004	0.0001	0.0001	0.0001	0.0007
Italy	2017	0.3270	0.0005	0.0002	0.0001	0.0002	0.0006

Open in a new tab

PS, Paul and Shankar [3]; KP, Kakwani and Podder [5]; A, Aggarwal [9]; C, Chotikapanich [16]; Pareto, Functional form implied by Pareto distribution.

The bold numbers (if any) indicate that Paul and Shankar [3]’s functional form is superior to the other existing widely used functional forms.

The overall results in Tables 4–6 suggest that, on the criteria of R², MSE, and IIM, all other popular functional forms perform far better than the functional form proposed by Paul and Shankar [3]. With regard to the value of the estimated Gini index, all results in Table 7 clearly confirm the first limitation of Paul and Shankar [3]’s functional form in that when the observed Gini index is lower than the critical threshold, using Paul and Shankar [3]’s functional form to estimate the Lorenz curve and compute the Gini index would result in the estimated Gini index being equal to 0.4180. Provided that the parameter γ cannot take value below 0, this study sets the value of estimated parameter γ to be equal to 0.0000001 which would result in the estimated value of the Gini index to be 0.4180 for all 20 OECD countries. The results in Table 7 also show that the values of the estimated Gini index calculated using the other popular functional forms, namely, Kakwani and Podder [5], Aggarwal [9], Chotikapanich [16], and a functional form implied by Pareto distribution are much closer to the actual observations than those calculated using Paul and Shankar [3]’s functional specification. The performance comparison between Paul and Shankar [3]’s functional forms and the other existing widely used functional forms considered in their study when the observed Gini index is less than the critical threshold of 0.4180 is also illustrated in Fig 2.

Table 6. The income data of 20 OECD countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on IIM.

Country	Year	Observed Gini index	Information inaccuracy measure (IIM)
Country	Year	Observed Gini index	PS	KP	A	C	Pareto
Slovakia	2017	0.2320	0.1014	0.0077	0.0087	0.0079	0.0201
Slovenia	2017	0.2370	0.1045	0.0046	0.0044	0.0045	0.0148
Czech Republic	2017	0.2450	0.1078	0.0050	0.0025	0.0043	0.0062
Finland	2017	0.2530	0.1016	0.0056	0.0032	0.0049	0.0083
Belgium	2017	0.2600	0.0850	0.0041	0.0040	0.0040	0.0168
Norway	2017	0.2610	0.0885	0.0095	0.0084	0.0092	0.0150
Netherlands	2017	0.2710	0.0845	0.0070	0.0052	0.0065	0.0139
Denmark	2017	0.2760	0.0854	0.0106	0.0082	0.0100	0.0131
Austria	2017	0.2790	0.0762	0.0089	0.0077	0.0086	0.0180
Sweden	2017	0.2800	0.0753	0.0076	0.0065	0.0073	0.0168
Hungary	2017	0.2810	0.0751	0.0084	0.0070	0.0080	0.0182
Germany	2017	0.2910	0.0710	0.0077	0.0056	0.0071	0.0162
Poland	2017	0.2920	0.0704	0.0085	0.0061	0.0078	0.0164
France	2017	0.2930	0.0825	0.0142	0.0095	0.0130	0.0119
Switzerland	2017	0.3010	0.0707	0.0102	0.0062	0.0091	0.0127
Ireland	2017	0.3060	0.0693	0.0093	0.0052	0.0081	0.0158
Canada	2017	0.3090	0.0518	0.0053	0.0049	0.0050	0.0229
Luxembourg	2017	0.3090	0.0546	0.0046	0.0029	0.0040	0.0188
Estonia	2017	0.3160	0.0429	0.0029	0.0040	0.0029	0.0297
Italy	2017	0.3270	0.0423	0.0099	0.0102	0.0098	0.0296

Open in a new tab

PS, Paul and Shankar [3]; KP, Kakwani and Podder [5]; A, Aggarwal [9]; C, Chotikapanich [16]; Pareto, Functional form implied by Pareto distribution.

The bold numbers (if any) indicate that Paul and Shankar [3]’s functional form is superior to the other existing widely used functional forms.

Fig 2 — (A) R². (B) MSE. (C) IIM. (D) Estimated Gini index.

The performance comparison between Paul and Shankar [3]’s functional form and the other popular functional forms when the upper part of income distribution has a long-tailed property

To demonstrate the second limitation of Paul and Shankar [3]’s functional specification, this study employs the data on income shares by decile of the other 20 different countries around the world from the UNU-WIID [40], the NESDC [41], and the World Bank [42] to estimate the Lorenz curve. Table A2 in S1 Appendix reports the estimated values of parameters for each functional form. We then evaluate the performance of Paul and Shankar [3]’s functional form as well as those of the other existing widely used functional forms considered in their study based on R², MSE, IIM, and the estimated Gini index. Our results are reported in Tables 8–11.

Table 8. The income data of the other 20 countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on R².

Country	Year	Income share of the top 20%	Coefficient of determination (R²)
Country	Year	Income share of the top 20%	PS	KP	A	C	Pareto
Thailand	2017	51.12	0.9805	0.9875	0.9978	0.9899	0.9888
Malawi	2017	51.67	0.9608	0.9724	0.9911	0.9762	0.9969
Ecuador	2015	51.90	0.9801	0.9866	0.9977	0.9891	0.9888
Benin	2015	52.13	0.9811	0.9858	0.9958	0.9880	0.9862
Philippines	2012	52.68	0.9788	0.9855	0.9980	0.9883	0.9896
Sri Lanka	2010	54.40	0.9743	0.9795	0.9953	0.9826	0.9909
Nicaragua	2014	54.50	0.9772	0.9817	0.9962	0.9847	0.9893
Bangladesh	2016	54.75	0.9682	0.9736	0.9926	0.9772	0.9928
Costa Rica	2016	54.90	0.9842	0.9878	0.9986	0.9902	0.9847
Saint Lucia	2016	55.39	0.9857	0.9883	0.9982	0.9906	0.9827
Paraguay	2016	55.41	0.9824	0.9855	0.9978	0.9882	0.9857
Mexico	2016	55.70	0.9738	0.9783	0.9958	0.9817	0.9912
India	2012	56.38	0.9810	0.9839	0.9979	0.9868	0.9866
Eswatini	2010	56.73	0.9813	0.9842	0.9984	0.9871	0.9865
Colombia	2015	56.80	0.9815	0.9838	0.9977	0.9866	0.9859
Brazil	2018	58.60	0.9784	0.9797	0.9969	0.9829	0.9876
Egypt	2013	58.78	0.9726	0.9742	0.9953	0.9779	0.9908
Mozambique	2015	59.53	0.9647	0.9664	0.9929	0.9708	0.9943
Honduras	2012	60.10	0.9728	0.9734	0.9957	0.9772	0.9908
Namibia	2016	63.74	0.9810	0.9790	0.9985	0.9822	0.9854

Open in a new tab

PS, Paul and Shankar [3]; KP, Kakwani and Podder [5]; A, Aggarwal [9]; C, Chotikapanich [16]; Pareto, Functional form implied by Pareto distribution.

The bold numbers (if any) indicate that Paul and Shankar [3]’s functional form is superior to the other existing widely used functional forms.

Table 11. The income data of the other 20 countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on the estimated Gini index.

Country	Year	Income share of the top 20%	Observed Gini index	Estimated Gini index
Country	Year	Income share of the top 20%	Observed Gini index	PS	KP	A	C	Pareto
Thailand	2017	51.12	0.4528	0.4669	0.4507	0.4469	0.4496	0.4390
Malawi	2017	51.67	0.4470	0.4600	0.4391	0.4404	0.4385	0.4345
Ecuador	2015	51.90	0.4620	0.4754	0.4594	0.4554	0.4582	0.4475
Benin	2015	52.13	0.4780	0.4842	0.4700	0.4656	0.4687	0.4585
Philippines	2012	52.68	0.4649	0.4812	0.4649	0.4607	0.4636	0.4526
Sri Lanka	2010	54.40	0.4900	0.5020	0.4865	0.4828	0.4850	0.4758
Nicaragua	2014	54.50	0.4950	0.5045	0.4898	0.4853	0.4883	0.4781
Bangladesh	2016	54.75	0.4980	0.5045	0.4891	0.4869	0.4877	0.4805
Costa Rica	2016	54.90	0.5000	0.5112	0.4982	0.4910	0.4965	0.4828
Saint Lucia	2016	55.39	0.5123	0.5191	0.5074	0.4994	0.5056	0.4915
Paraguay	2016	55.41	0.5100	0.5185	0.5058	0.4988	0.5040	0.4911
Mexico	2016	55.70	0.5040	0.5151	0.5001	0.4955	0.4984	0.4882
India	2012	56.38	0.5150	0.5262	0.5135	0.5063	0.5116	0.4986
Eswatini	2010	56.73	0.5145	0.5294	0.5167	0.5090	0.5147	0.5011
Colombia	2015	56.80	0.5240	0.5342	0.5222	0.5145	0.5202	0.5071
Brazil	2018	58.60	0.5240	0.5495	0.5380	0.5301	0.5358	0.5231
Egypt	2013	58.78	0.5400	0.5505	0.5378	0.5313	0.5356	0.5245
Mozambique	2015	59.53	0.5400	0.5545	0.5405	0.5352	0.5382	0.5283
Honduras	2012	60.10	0.5520	0.5633	0.5515	0.5437	0.5490	0.5369
Namibia	2016	63.74	0.5907	0.6052	0.5983	0.5846	0.5953	0.5791

Open in a new tab

PS, Paul and Shankar [3]; KP, Kakwani and Podder [5]; A, Aggarwal [9]; C, Chotikapanich [16]; Pareto, Functional form implied by Pareto distribution.

The bold numbers (if any) indicate that Paul and Shankar [3]’s functional form is superior to the other existing widely used functional forms.

The overall results in Tables 8–11 show that Paul and Shankar [3]’s functional form is, by and large, outperformed by the other existing widely used functional forms, namely, Kakwani and Podder [5], Aggarwal [9], Chotikapanich [16], and a functional form implied by Pareto distribution. The results in Table 8 show that, on the basis of R², there are only 2 out of 100 cases where Paul and Shankar [3]’s functional form performs better than the other existing widely used functional forms. Recall that Paul and Shankar [3] show that, on the criteria of MSE and IIM, their functional form fits the income data of Australia better than the other existing widely used functional forms considered in their study. However, using the income data of the other 20 countries and the same performance assessment criteria as those used in Paul and Shankar [3]’s study, the results in Tables 9 and 10 indicate that their functional form is superior to the other existing widely used functional forms in only 9 out of 100 cases based on MSE and only 2 out of 100 cases based on IIM, respectively. Lastly, on the criterion of the estimated Gini index as measured by the absolute value of the difference between the estimated Gini index and the observed Gini index, the results in Table 11 show that the functional form proposed by Paul and Shankar [3] outperforms the other popular functional specifications in 22 out of 100 cases. Fig 3 illustrates the performance comparison between Paul and Shankar [3]’s functional form and the other existing widely used functional forms considered in their study when the income data has a long-tailed property.

Table 9. The income data of the other 20 countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on MSE.

Country	Year	Income share of the top 20%	Mean-squared error (MSE)
Country	Year	Income share of the top 20%	PS	KP	A	C	Pareto
Thailand	2017	51.12	0.0013	0.0011	0.0003	0.0009	0.0006
Malawi	2017	51.67	0.0024	0.0021	0.0009	0.0019	0.0001
Ecuador	2015	51.90	0.0013	0.0011	0.0003	0.0010	0.0006
Benin	2015	52.13	0.0016	0.0015	0.0005	0.0013	0.0005
Philippines	2012	52.68	0.0013	0.0011	0.0002	0.0010	0.0006
Sri Lanka	2010	54.40	0.0019	0.0018	0.0006	0.0017	0.0004
Nicaragua	2014	54.50	0.0017	0.0017	0.0005	0.0015	0.0005
Bangladesh	2016	54.75	0.0025	0.0024	0.0009	0.0022	0.0003
Costa Rica	2016	54.90	0.0011	0.0010	0.0002	0.0009	0.0009
Saint Lucia	2016	55.39	0.0011	0.0011	0.0002	0.0009	0.0010
Paraguay	2016	55.41	0.0013	0.0013	0.0003	0.0012	0.0008
Mexico	2016	55.70	0.0019	0.0018	0.0005	0.0016	0.0004
India	2012	56.38	0.00137	0.00138	0.0003	0.0012	0.0008
Eswatini	2010	56.73	0.00127	0.00129	0.0002	0.0011	0.0008
Colombia	2015	56.80	0.00145	0.00149	0.0003	0.0013	0.0008
Brazil	2018	58.60	0.0017	0.0018	0.0004	0.0017	0.0006
Egypt	2013	58.78	0.0021	0.0023	0.0006	0.0021	0.0004
Mozambique	2015	59.53	0.0026	0.0028	0.0008	0.0025	0.0003
Honduras	2012	60.10	0.0021	0.0023	0.0005	0.0021	0.0005
Namibia	2016	63.74	0.0015	0.0018	0.0002	0.0016	0.0010

Open in a new tab

PS, Paul and Shankar [3]; KP, Kakwani and Podder [5]; A, Aggarwal [9]; C, Chotikapanich [16]; Pareto, Functional form implied by Pareto distribution.

The bold numbers (if any) indicate that Paul and Shankar [3]’s functional form is superior to the other existing widely used functional forms.

Table 10. The income data of the other 20 countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on IIM.

Country	Year	Income share of the top 20%	Information inaccuracy measure (IIM)
Country	Year	Income share of the top 20%	PS	KP	A	C	Pareto
Thailand	2017	51.12	0.0494	0.0280	0.0085	0.0235	0.0237
Malawi	2017	51.67	0.0930	0.0577	0.0226	0.0505	0.0064
Ecuador	2015	51.90	0.0475	0.0278	0.0068	0.0230	0.0225
Benin	2015	52.13	0.0464	0.0377	0.0210	0.0346	0.0338
Philippines	2012	52.68	0.0556	0.0312	0.0064	0.0254	0.0223
Sri Lanka	2010	54.40	0.0631	0.0466	0.0162	0.0406	0.0202
Nicaragua	2014	54.50	0.0545	0.0411	0.0139	0.0357	0.0237
Bangladesh	2016	54.75	0.0750	0.0617	0.0266	0.0554	0.0190
Costa Rica	2016	54.90	0.0416	0.0291	0.0079	0.0241	0.0360
Saint Lucia	2016	55.39	0.0352	0.0284	0.0107	0.0244	0.0418
Paraguay	2016	55.41	0.0448	0.0340	0.0101	0.0288	0.0325
Mexico	2016	55.70	0.0648	0.0482	0.0134	0.0411	0.0195
India	2012	56.38	0.0475	0.0368	0.0092	0.0309	0.0312
Eswatini	2010	56.73	0.0491	0.0356	0.0061	0.0290	0.0304
Colombia	2015	56.80	0.0458	0.0372	0.0097	0.0314	0.0317
Brazil	2018	58.60	0.0551	0.0486	0.0141	0.0419	0.0309
Egypt	2013	58.78	0.0675	0.0593	0.0165	0.0514	0.0220
Mozambique	2015	59.53	0.0871	0.0747	0.0195	0.0645	0.0130
Honduras	2012	60.10	0.0697	0.0623	0.0153	0.0535	0.0231
Namibia	2016	63.74	0.0522	0.0524	0.0063	0.0432	0.0365

Open in a new tab

PS, Paul and Shankar [3]; KP, Kakwani and Podder [5]; A, Aggarwal [9]; C, Chotikapanich [16]; Pareto, Functional form implied by Pareto distribution.

The bold numbers (if any) indicate that Paul and Shankar [3]’s functional form is superior to the other existing widely used functional forms.

Fig 3 — (A) R². (B) MSE. (C) IIM. (D) Estimated Gini index.

Although one of the aims to develop a functional form for the Lorenz curve is to compute inequality measures such as the Gini index that would be close to the actual observation [3], we would like to note that the shape of income distribution is also important and should be taken into consideration when evaluating the performance of different functional forms. This is because it is possible that different functional forms could have the estimated Gini index that is identical to its observed value but the shapes of distributions are different. By using the income data of Thailand, this point could be demonstrated by finding the value of the estimated parameter of functional form proposed by Paul and Shankar [3] and those of the other popular functional forms that result in the same value of the estimated Gini index. The results are reported in Table 12.

Table 12. The income data of Thailand are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms all of which share the same value of the estimated Gini index which is equal to 0.4528.

	PS	KP	A	C	Pareto
Estimated parameter	γ = 0.2273	δ = 3.5401	θ = 0.3482	k = 3.1328	α = 2.6547
R ²	0.9797	0.9875	0.9977	0.9899	0.9882
MSE	0.0014	0.0010	0.0003	0.0009	0.0008
IIM	0.0498	0.0281	0.0083	0.0236	0.0260
Estimated Gini index	0.4528	0.4528	0.4528	0.4528	0.4528
Observed Gini index	0.4528

Open in a new tab

PS, Paul and Shankar [3]; KP, Kakwani and Podder [5]; A, Aggarwal [9]; C, Chotikapanich [16]; Pareto, Functional form implied by Pareto distribution.

The results in Table 12 show that the estimated parameters of all functional forms are different but all of them result in the same estimated Gini index which is 0.4528. Fig 4 illustrates the actual Lorenz plot and the estimated Lorenz curves using Paul and Shankar [3]’s functional form and the other existing widely used functional forms all of which share the same estimated Gini index.

Given that there are infinite number of the Lorenz curves that could yield the same value of the Gini index, a good functional form should be able to describe the shape of income distributions through the changes in the value of parameters and the fact that it fits the actual data would be the main reason for its choice [35]. Therefore, the shape of income distribution is relatively more important than how close the estimated Gini index is to its actual observation and the priority should be given to the values of goodness-of-fit statistics when assessing the performance of different functional forms. Viewed this way, the performance of Paul and Shankar [3]’s functional form compared to the other existing widely used functional forms considered in their study on the basis of the estimated Gini index, is at best mixed. It does not rank second closely behind that of Aggarwal [9] as for the case when using the income data of Australia reported in their study.

The performance comparison on the basis of the estimated Gini index between single-parameter functional forms and a functional form that contains more than one parameter

Although the key advantage of any single-parameter functional form is its parsimony, we would like to point out that, in order to evaluate the performance of a functional form for the Lorenz curve on the criteria of the estimated Gini index, besides the issues of the shape of income distribution and the goodness-of-fit statistics as discussed above, a functional form that contains more than one parameter is required since the curvature of the estimated Lorenz curve has to be adjustable so that it would fit the actual observations as much as possible while keeping the value of the estimated Gini index constant. As noted by Dagum [35], this cannot be done by using a single-parameter functional form since the estimated Gini index would be a monotonic function of it.

In order to illustrate that a functional form that contains more than one parameter could generally be used to compute the value of the estimated Gini index that would be closer to the actual observations than single-parameter functional forms, this study employs a two-parameter functional form proposed by Sitthiyot and Holasut [2] to estimate the Lorenz curve using the same set of data on decile income shares of 40 countries. The reason that we choose Sitthiyot and Holasut [2]’s functional specification is not only because their specified functional form has several advantages over the functional form proposed by Kakwani [7] which, according to Cheong [43] and Tanak et al. [31], has the best overall performance among all popular functional forms for the Lorenz curves but also has an explicit mathematical solution for the Gini index. By using the same notations for the cumulative normalized rank of income (p) and the cumulative normalized income L(p) as denoted in Materials and Methods and also let a and b be parameters, the specified functional form for the Lorenz curve proposed by Sitthiyot and Holasut [2] is as follows:

L (p) = {(1 - a) * p}^{b} + a * (1 - {(1 - p)}^{\frac{1}{b}}),

(10)

0 \leq a \leq 1,

1 \leq b

This specified functional form satisfies all necessary and sufficient conditions for the Lorenz curve which are: L (0) = 0, L (1) = 1, L(p) is convex, $\frac{d L}{d p} \geq 0$ , and $\frac{d^{2} L}{{d p}^{2}} \geq 0$ . According to Sitthiyot and Holasut [2], the parameter b represents the degree of inequality in income distribution as measured by the Gini index. The parameter a is the weight that controls the curvature of the Lorenz curve such that the Gini index remains constant since, for a given value of parameter b, there are infinite values of parameter a that could give an identical value of the Gini index. The parameter a thus provides the information about countries’ income shares in case their Lorenz curves intersect. In addition, the shape of the estimated Lorenz curve could be conveniently adjusted through the change in parameter a while the value of the Gini index is kept constant. The values of parameters a and b estimated using Sitthiyot and Holasut [2]’s functional form for the Lorenz curve are reported in Tables A1 and A2 in S1 Appendix. The estimated Lorenz curves of 40 countries are then used to calculate the values of the estimated Gini index which in turn would be compared to those computed using single-parameter functional forms considered in Paul and Shankar [3]’s study. Tables 13 and 14 report our results.

Table 13. The income data of 20 OECD countries are used to demonstrate the performance of single-parameter functional forms and that of a two-parameter functional form proposed by Sitthiyot and Holasut [2] on the criterion of estimated Gini index.

Country	Year	Observed Gini index	Estimated Gini index
Country	Year	Observed Gini index	PS	KP	A	C	Pareto	SH
Slovakia	2017	0.2320	0.4180	0.2242	0.2227	0.2239	0.2171	0.2300
Slovenia	2017	0.2370	0.4180	0.2315	0.2305	0.2313	0.2250	0.2372
Czech Republic	2017	0.2450	0.4180	0.2386	0.2389	0.2387	0.2351	0.2445
Finland	2017	0.2530	0.4180	0.2470	0.2470	0.2470	0.2426	0.2530
Belgium	2017	0.2600	0.4180	0.2540	0.2526	0.2538	0.2463	0.2602
Norway	2017	0.2610	0.4180	0.2520	0.2514	0.2518	0.2467	0.2583
Netherlands	2017	0.2710	0.4180	0.2628	0.2622	0.2627	0.2568	0.2690
Denmark	2017	0.2760	0.4180	0.2646	0.2644	0.2646	0.2599	0.2711
Austria	2017	0.2790	0.4180	0.2711	0.2701	0.2709	0.2644	0.2776
Sweden	2017	0.2800	0.4180	0.2709	0.2697	0.2706	0.2639	0.2773
Hungary	2017	0.2810	0.4180	0.2739	0.2728	0.2737	0.2669	0.2802
Germany	2017	0.2910	0.4180	0.2833	0.2824	0.2831	0.2764	0.2898
Poland	2017	0.2920	0.4180	0.2843	0.2835	0.2842	0.2777	0.2907
France	2017	0.2930	0.4180	0.2834	0.2837	0.2834	0.2794	0.2899
Switzerland	2017	0.3010	0.4180	0.2927	0.2924	0.2926	0.2871	0.2992
Ireland	2017	0.3060	0.4180	0.3002	0.2995	0.3001	0.2933	0.3065
Canada	2017	0.3090	0.4180	0.3027	0.3004	0.3022	0.2929	0.3093
Luxembourg	2017	0.3090	0.4180	0.3037	0.3019	0.3034	0.2948	0.3102
Estonia	2017	0.3160	0.4180	0.3120	0.3087	0.3114	0.2999	0.3187
Italy	2017	0.3270	0.4180	0.3183	0.3156	0.3178	0.3079	0.3251

Open in a new tab

PS, Paul and Shankar [3]; KP, Kakwani and Podder [5]; A, Aggarwal [9]; C, Chotikapanich [16]; Pareto, Functional form implied by Pareto distribution; SH, Sitthiyot and Holasut [2].

The bold numbers (if any) indicate that the single-parameter functional forms are superior to the two-parameter function form.

Table 14. The income data of the other 20 countries are used to demonstrate the performance of single-parameter functional forms and that of a two-parameter functional form proposed by Sitthiyot and Holasut [2] on the criterion of estimated Gini index.

Country	Year	Observed Gini index	Estimated Gini index
Country	Year	Observed Gini index	PS	KP	A	C	Pareto	SH
Thailand	2017	0.4528	0.4669	0.4507	0.4469	0.4496	0.4390	0.4548
Malawi	2017	0.4470	0.4600	0.4391	0.4404	0.4385	0.4345	0.4425
Ecuador	2015	0.4620	0.4754	0.4594	0.4554	0.4582	0.4475	0.4632
Benin	2015	0.4780	0.4842	0.4700	0.4656	0.4687	0.4585	0.4742
Philippines	2012	0.4649	0.4812	0.4649	0.4607	0.4636	0.4526	0.4680
Sri Lanka	2010	0.4900	0.5020	0.4865	0.4828	0.4850	0.4758	0.4889
Nicaragua	2014	0.4950	0.5045	0.4898	0.4853	0.4883	0.4781	0.4924
Bangladesh	2016	0.4980	0.5045	0.4891	0.4869	0.4877	0.4805	0.4913
Costa Rica	2016	0.5000	0.5112	0.4982	0.4910	0.4965	0.4828	0.5008
Saint Lucia	2016	0.5123	0.5191	0.5074	0.4994	0.5056	0.4915	0.5100
Paraguay	2016	0.5100	0.5185	0.5058	0.4988	0.5040	0.4911	0.5080
Mexico	2016	0.5040	0.5151	0.5001	0.4955	0.4984	0.4882	0.5014
India	2012	0.5150	0.5262	0.5135	0.5063	0.5116	0.4986	0.5149
Eswatini	2010	0.5145	0.5294	0.5167	0.5090	0.5147	0.5011	0.5178
Colombia	2015	0.5240	0.5342	0.5222	0.5145	0.5202	0.5071	0.5235
Brazil	2018	0.5240	0.5495	0.5380	0.5301	0.5358	0.5231	0.5379
Egypt	2013	0.5400	0.5505	0.5378	0.5313	0.5356	0.5245	0.5370
Mozambique	2015	0.5400	0.5545	0.5405	0.5352	0.5382	0.5283	0.5383
Honduras	2012	0.5520	0.5633	0.5515	0.5437	0.5490	0.5369	0.5495
Namibia	2016	0.5907	0.6052	0.5983	0.5846	0.5953	0.5791	0.5943

Open in a new tab

PS, Paul and Shankar [3]; KP, Kakwani and Podder [5]; A, Aggarwal [9]; C, Chotikapanich [16]; Pareto, Functional form implied by Pareto distribution; SH, Sitthiyot and Holasut [2].

The bold numbers (if any) indicate that the single-parameter functional forms are superior to the two-parameter function form.

The results on the estimated Gini index shown in Tables 13 and 14 indicate that the two-parameter functional form proposed by Sitthiyot and Holasut [2], by and large, outperforms single-parameter functional forms used in Paul and Shankar [3]’s study. As shown in Tables 13 and 14, there are only 12 cases out of 200 cases where the single-parameter functional forms give the values of the estimated Gini index that are closer to their actual observations than the two-parameter functional form proposed by Sitthiyot and Holasut [2], all of which are from countries whose upper part of the income distribution has a long-tailed property. These results confirm that the two-parameter functional form, in general, performs better than one-parameter functional forms on the criterion of the estimated Gini index.

Conclusions

Paul and Shankar [3] propose an alternative single-parameter functional form for the Lorenz curve and use the Australian income data between 2001 and 2010 that have the Gini index between 0.4442 and 0.4633 to show that their functional form is superior to the other existing widely used functional forms. Given that previous studies pointed out that an excellent performance of any parametric functional form for the Lorenz curve that is based on a single country case study and a limited range of distribution must be treated with extreme caution [32–34], this study demonstrates that when the observed Gini index is lower than the critical threshold which is found to be 0.4180 as for the cases of 20 OECD countries, the functional form proposed by Paul and Shankar [3] not only has a serious limitation in that it fails to fit the actual observations well but also is outperformed by the other existing widely used functional forms. This study also shows that when the upper part of income distribution has a long-tailed property as for the cases of the other 20 different countries, the functional form proposed by Paul and Shankar [3] is, by and large, outperformed by the other existing widely used functional forms considered in their study.

This study would like to note that, for future evaluation of any single parametric functional form for the Lorenz curve, the goodness-of-fit statistics should be given the priority since the shape of income distribution is more important than the estimated Gini index. As noted by Dagum [35], whenever a functional form contains a single inequality parameter, the estimated Gini index would be a monotonic function of it, and hence, it would not be able to detect the intersecting Lorenz curves. To evaluate the performance on the criterion of the estimated Gini index, a functional form that contains more than one parameter is required since the curvature of the estimated Lorenz curve has to be adjustable so that it would fit the actual observations as much as possible while keeping the value of the estimated Gini index the same. To illustrate this point, this study employs the two-parameter functional form developed by Sitthiyot and Holasut [2] to estimate the Lorenz curve and compute the value of the estimated Gini index using the grouped data on income shares of 40 countries. The overall results indicate that the two-parameter functional form, by and large, performs better than the single-parameter functional forms considered in Paul and Shankar [3]’s study on the criterion of the value of the estimated Gini index.

Last but not least, before using any parametric functional form to estimate the Lorenz curve and calculate the Gini index, policymakers should be aware that the goodness-of-fit statistical measures, the shape of the estimated Lorenz curve, and the estimated Gini index should be taken into consideration altogether. The key lesson learned from Paul and Shankar [3]’s study is that their results should be treated with great caution since all of them rely on a single country case study and a narrow range of distribution [32–34]. As noted by Dagum [35], a good functional form for estimating the Lorenz curve must describe the shape of income distributions of different countries, regions, socioeconomic groups, and in different time periods through the changes in parameter values. Given that existing studies on the relationship between inequality measures and financial and/or socioeconomic variables rely on the accuracy of inequality measures as discussed in Introduction and many more to come in the future, if the choice of parametric functional form for the Lorenz curve is not a valid candidate for representing the income distribution, the estimates on the income shares and inequality measures could be severely affected by misspecification bias [32]. Thus, before applying any functional form for estimating the Lorenz curve, policymakers should carefully check whether or not it satisfies the aforementioned criteria suggested by Dagum [35]. This is because using a functional form that does not fit actual observations could adversely affect inequality measures and income distribution policies.

Supporting information

S1 Appendix

(DOCX)

Click here for additional data file.^{(24.2KB, docx)}

Acknowledgments

The authors are grateful to Dr. Suradit Holasut for guidance and comments.

Data Availability

All data analyzed during this study are included in this article and can be accessed from the Office of the National Economic and Social Development Council (NESDC), Thailand, the United Nations University World Institute for Development Economics Research (UNU-WIDER), and the World Bank all of which are listed in References The links to the data sources are as follows: 1. https://www.nesdc.go.th/main.php?filename=social 2. https://www4.wider.unu.edu/ 3. https://data.worldbank.org/.

Funding Statement

The authors received no specific funding for this work.

References

1.Lorenz MO. Methods of measuring the concentration of wealth. Pub Am Stat Assoc. 1905;9: 209–219. [Google Scholar]
2.Sitthiyot T, Holasut K. A simple method for estimating the Lorenz curve. Humanit Soc Sci Commun. 2021;8(268): 1–9. Available from: 10.1057/s41599-021-00948-x [DOI] [Google Scholar]
3.Paul S, Shankar S. An alternative single parameter functional form for Lorenz curve. Empir Econ. 2020;59: 1393–1402. 10.1007/s00181-019-01715-3 [DOI] [Google Scholar]
4.Gini C. On the measurement of concentration and variability of characters (English translation from Italian by de Santis F.). Metron. 2005;63: 3–38. [Google Scholar]
5.Kakwani NC, Podder N. On the estimation of Lorenz curves from grouped observations. Int Econ Rev. 1973;14: 278–292. [Google Scholar]
6.Kakwani NC, Nripesh P. Efficient estimation of the Lorenz curve and associated inequality measures from grouped observations. Econometrica. 1976;44: 137–148. [Google Scholar]
7.Kakwani NC. On a class of poverty measures. Econometrica. 1980;48: 437–446. [Google Scholar]
8.Rasche RH, Gaffney JM, Koo AYC, Obst N. Functional forms for estimating the Lorenz curve. Econometrica. 1980;48: 1061–1062. [Google Scholar]
9.Aggarwal V. On optimum aggregation of income distribution data. Sankhyā B. 1984;46: 343–355. [Google Scholar]
10.Gupta MR. Functional form for estimating the Lorenz curve. Econometrica. 1984;52: 1313–1314. [Google Scholar]
11.Arnold BC. A class of hyperbolic Lorenz curves. Sankhyā B. 1986;48: 427–436. [Google Scholar]
12.Rao ULG, Tam AY-P. An empirical study of selection and estimation of alternative models of the Lorenz curve. J Appl Stat. 1987;14: 275–280. 10.1080/02664768700000032 [DOI] [Google Scholar]
13.Villaseñor JA, Arnold BC. Elliptical Lorenz curves. J Econom. 1989;40: 327–338. [Google Scholar]
14.Basmann RL, Hayes K, Slottje D, Johnson J. A general functional form for approximating the Lorenz curve. J Econom. 1990;92: 727–744. [Google Scholar]
15.Ortega P, Martín G, Fernández A, Ladoux M, García A. A new functional form for estimating Lorenz curves. Rev Income Wealth. 1991;37: 447–452. [Google Scholar]
16.Chotikapanich D. A comparison of alternative functional forms for the Lorenz curve. Econ Lett. 1993;41: 129–138. [Google Scholar]
17.Ogwang T, Rao ULG. A new functional form for approximating the Lorenz curve. Econ Lett. 1996;52: 21–29. [Google Scholar]
18.Ogwang T, Rao ULG. Hybrid models of the Lorenz curve. Econ Lett. 2000;69: 39–44. [Google Scholar]
19.Ryu H, Slottje D. Two flexible functional forms for approximating the Lorenz curve. J Econom. 1996;72: 251–274. [Google Scholar]
20.Sarabia JM. A hierarchy of Lorenz curves based on the generalized Tukey’s lambda distribution. Econom Rev. 1997;16: 305–320. [Google Scholar]
21.Sarabia JM, Castillo E, Slottje D. An ordered family of Lorenz curves. J Econom. 1999;91: 43–60. [Google Scholar]
22.Sarabia JM, Castillo E, Slottje D. An exponential family of Lorenz curves. South Econ J. 2001;67: 748–756. [Google Scholar]
23.Sarabia JM, Pascual M. A class of Lorenz curves based on linear exponential loss functions. Commun Stat–Theory Methods. 2002;31: 925–942. [Google Scholar]
24.Rohde N. An alternative functional form for estimating the Lorenz curve. Econ Lett. 2009;105: 61–63. [Google Scholar]
25.Helene O. Fitting Lorenz curves. Econ Lett. 2010;108: 153–155. [Google Scholar]
26.Sarabia JM, Prieto F, Sarabia M. Revisiting a functional form for the Lorenz curve. Econ Lett. 2010;107: 249–252. [Google Scholar]
27.Sarabia JM, Prieto F, Jordá V. About the hyperbolic Lorenz curve. Econ Lett. 2015;136: 42–45. [Google Scholar]
28.Wang Z, Smyth R. A hybrid method for creating Lorenz curves. Econ Lett. 2015;133: 59–63. [Google Scholar]
29.Sarabia JM, Jordá V, Trueba C. The Lamé class of Lorenz curves. Commun in Stat–Theory Methods. 2017;46: 5311–5326. [Google Scholar]
30.Fellman J. Regression analyses of income inequality indices. Theor Econ Lett. 2018;8: 1793–1802. 10.4236/tel.2018.810117 [DOI] [Google Scholar]
31.Tanak AK, Mohtashami Borzadaran GR, Ahmadi J. New functional forms of Lorenz curves by maximizing Tsallis entropy of income share function under the constraint on generalized Gini index. Physica A. 2018;511: 280–288. [Google Scholar]
32.Jordá V, Sarabia JM, Jäntti M. Inequality measurement with grouped data: Parametric and non-parametric methods. J R Stat Soc Series A. 2021;184: 964–984. doi: 10.1111/rssa.12702 [DOI] [Google Scholar]
33.Cowell FA, Mehta F. The estimation and interpolation of inequality measures. Rev Econ Stud. 1982;49: 273–290. [Google Scholar]
34.Shorrocks AF, Wan G. Ungrouping income distributions: Synthesizing samples for inequality and poverty analysis. Research Paper No. 2008/16. United Nations University World Institute for Development Economics Research (UNU-WIDER). 2008. Available from: https://www.wider.unu.edu/publication/ungrouping-income-distributions [Google Scholar]
35.Dagum C. A new model of personal income distribution: Specification and estimation. In: Chotikapanich D, editor. Modeling income distributions and Lorenz curves. Economic studies in equality, social exclusion and well-being, vol 5. New York: Springer; 1977. pp. 3–25. 10.1007/978-0-387-72796-7_1 [DOI] [Google Scholar]
36.Pisula T, Mentel G, Brożyna J. Predicting bankruptcy of companies from the logistics sector operating in the Podkarpacie region. Mod Manag Rev. 2013;18: 113:133. [Google Scholar]
37.Kharlamova G, Stavytskyy A, Zarotiadis G. (2018). The impact of technological changes on income inequality: the EU states case study. J Int Stud. 2018; 11: 76–94. doi: 10.14254/2071-8330.2018/11-2/6 [DOI] [Google Scholar]
38.Bilan Y, Mishchuk H, Samoliuk N, Yurchyk H. Impact of income distribution on social and economic well-being of the state. Sustainability. 2020;12: 1–15. Available from https://www.mdpi.com/2071-1050/12/1/42935136666 [Google Scholar]
39.Tung LT. Can public debt harm social development? Evidence from the Asian-Pacific region. J Intl Stud. 2020;13: 48–61. doi: 10.14254/2071-8330.2020/13-2/4 [DOI] [Google Scholar]
40.The United Nations University World Institute for Development Economics Research (UNU-WIDER). World Income Inequality Database (WIID), Version date: 6 May 2020; 2020. [cited 2021 Jul. 25]. Available from: https://www4.wider.unu.edu/ [Google Scholar]
41.The Office of the National Economic and Social Development Council. Poverty and Income Distribution Statistics; 2021. [cited 2021 Jul. 18]. Available from: https://www.nesdc.go.th/main.php?filename=social [Google Scholar]
42.The World Bank. World Bank Open Data; 2020. [cited 2021. Jul. 22]. Available from: https://data.worldbank.org/ [Google Scholar]
43.Cheong KS. An empirical comparison of alternative functional forms for the Lorenz curve. Appl Econ Lett. 2002;9: 171–176. [Google Scholar]

PLoS One. doi: 10.1371/journal.pone.0287546.r001

Decision Letter 0

Yuriy Bilan

18 Nov 2022

PONE-D-22-23738An investigation of the performance of parametric functional forms for the Lorenz curvePLOS ONE

Dear Dr. Authors

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

After considering the reviewer's comments, the article will be suitable for publication, but major revision is needed

Please submit your revised manuscript by Jan 01 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Yuriy Bilan

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Thank you for stating the following in the Competing Interests section:

I have read the journal's policy and the authors of this manuscript have the following competing interests:

The findings reported in this manuscript contradict the results previously published in Paul and Shankar (2020).

Please confirm that this does not alter your adherence to all PLOS ONE policies on sharing data and materials, by including the following statement: "This does not alter our adherence to PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests). If there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared.

Please include your updated Competing Interests statement in your cover letter; we will change the online submission form on your behalf.

Additional Editor Comments:

After considering the reviewer's comments, the article will be suitable for publication

but major revision is needed

Note from Staff Editor Hanna Landenmark (hlandenmark@plos.org): 1) Please note that there is no requirement to cite the specific articles suggested by the reviewers, but please take into account their request for further context for this study. 2) Please note that it is policy of PLOS ONE to invite a signed review from the authors of the disputed work; this review will be considered by the Academic Editor in the decision process, but the editor will be made aware of the competing interests (http://journals.plos.org/plosone/s/submission-guidelines#loc-manuscripts-disputing-published-work). This invitation will take place following the first round of independent peer review.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: There is actually no "Introduction" section. Currently, a review of the literature is carried out here, which should be updated according to the latest scientific results.

It is worth adding the relevance and purpose of the research.

Other comments are given in the article.

Reviewer #2: The article is interesting in terms of topics, but not entirely correctly implemented in terms of structure. It is noticeable at the beginning that there is no review of world research in this area, i.e. the so-called Review of the literature. Throughout the article, the authors refer to only 15 items !!! This needs to be completely rewritten and corrected. Since you analyze, for example, the Gini index, it is worth referring to such items as:

1. Statistical methods of the bankruptcy prediction in the logistics sector in Poland and Slovakia, Transformations in Business and Economics Vol. 15 (No 1 (37)): 93-114;

2. PREDICTING BANKRUPTCY OF COMPANIES FROM THE LOGISTICS SECTOR OPERATING IN THE PODKARPACIE REGION; doi: 10.7862 / rz.2013.mmr.33

3. Evaluating the impact of GINI index and information gain on classification using decision tree classifier algorithm; doi: 10.14569 / ijacsa.2020.0110277

4. The gini index: A proposal for revision; doi: 10.24309 / recta.2020.21.1.01

or others.

Authors must also emphasize the so-called added value of research. What's new in these findings?

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: PONE-D-22-23738.docx

Click here for additional data file.^{(803.3KB, docx)}

PLoS One. 2023 Jun 23;18(6):e0287546. doi: 10.1371/journal.pone.0287546.r002

Author response to Decision Letter 0

19 Dec 2022

Dear Reviewers #1 and #2,

We sincerely thank both Reviewers for pointing out several important concerns and providing useful suggestions. We have made a major revision by addressing all comments and suggestions made by both Reviewers and tried our best to incorporate them into our revised manuscript. We hope that our revised manuscript is clearer in all aspects that the Reviewers have concerned and/or suggested. Let us respond to the Reviewers’ concerns and suggestions as follows.

Reviewer #1:

- There is actually no "Introduction" section. Currently, a review of the literature is carried out here, which should be updated according to the latest scientific results.

We sincerely thank Reviewer #1 for comment on this very important point. We would like to inform Reviewer #1 that we have made a substantial revision in Introduction by providing definition of the Lorenz curve in paragraph 1 and discussing how the Gini index can be calculated from the Lorenz curve in paragraph 2 in our revised manuscript. We also explain in Introduction, paragraph 3 in our revised manuscript how the Lorenz curve can be estimated as well as provide the list of up-to-date studies that propose different parametric functional forms for estimating the Lorenz curve.

- It is worth adding the relevance and purpose of the research.

When justifying its relevance, it is worth pointing out which economic and social processes are affected by income inequality, why is it important to study it?

For example: Impact of Income Distribution on Social and Economic Well-Being of the State https://doi.org/10.3390/su12010429

In response to Reviewer #1’s suggestion to add the relevance and purpose of our study, we state the purpose of our study in Introduction, paragraph 4 in our revised manuscript which is to investigate an alternative single-parameter functional form for the Lorenz curve proposed by Paul and Shankar (2020) who show that their functional form outperforms the other 4 popular parametric functional forms for the Lorenz curve, namely, Kakwani and Podder (1973), Aggarwal (1984), Chotikapanich (1993), and a functional form implied by Pareto distribution.

In addition, we state the relevance and contribution of our study in Introduction, paragraph 5 in our revised manuscript that it is relevant and worthwhile to conduct an investigation to find out if we use grouped income data of other countries, the performance of Paul and Shankar (2020)’s functional form is still superior to the other existing widely used functional forms considered in their study. The findings from this investigation should also contribute as a check-and-balance not only for economics but also for other scientific disciplines that use the Lorenz curve to analyze size distributions of non-negative quantities and inequalities.

Since the main focus of our study is to investigate the performance of parametric functional forms for estimating the Lorenz curve and to examine whether or not those parametric functional forms have an explicit mathematical solution for the Gini index, we provide an additional justification for our study as noted by Reviewer #1 and also by PLOS ONE Staff Editor Dr. Hanna Landenmark by noting in Introduction, paragraph 5 and in Conclusions, paragraph 3 in our revised manuscript that various studies on the relationship between inequality measures and financial and/or socioeconomic variables such as Kharlamova et al. (2018), Bilan et al. (2020), and Tung (2020) as suggested by Reviewer #1 and Pisular et al. (2013) as suggested by Reviewer #2, rely on the accuracy of inequality measures that could possibly be derived from a parametric functional form for the Lorenz curve. If the choice of parametric functional form is not a valid candidate for representing the income distribution, the estimates on the income shares and inequality measures might be severely affected by misspecification bias. This justification is reiterated in Conclusions, paragraph 3 in our revised manuscript.

- In the literature review, it is worth investigating the relationship between income inequality and other parameters (social, economic, etc.).

For example: ANALYSIS OF THE RELATIONSHIP BETWEEN INCOME INEQUALITY AND SOCIAL VARIABLES: EVIDENCE FROM INDONESIA DOI: 10.14254/2071-789X.2021/14-1/7

THE RELATIONSHIP BETWEEN INCOME GROWTH AND INEQUALITY: EVIDENCE FROM AN ASIAN EMERGING ECONOMY DOI: 10.14254/2071-789X.2022/15-2/6

THE IMPACT OF TECHNOLOGICAL CHANGES ON INCOME INEQUALITY: THE EU STATES CASE STUDY DOI: 10.14254/2071-8330.2018/11-2/6

In response to the issue of investigating studies on the relationship between income inequality and other parameters as suggested by Reviewer #1, we went over those studies and find them very useful. However, given that the main focus of our study is to investigate the performance of parametric functional forms for estimating the Lorenz curve and to examine whether or not those parametric functional forms considered in Paul and Shankar (2020) study have a closed-form expression for the Gini index, we therefore include studies by Kharlamova et al. (2018), Bilan et al. (2020), Tung (2020) as suggested by Reviewer #1 and Pisular et al. (2013) as suggested by Reviewer #2 in Introduction, paragraph 5 in our revised manuscript in order to illustrate the importance of using the appropriate parametric functional form for estimating the Lorenz curve. We also mentioned the importance of this issue in Conclusions, paragraph 3 in our revised manuscript.

- Discussion should be separated into a separate section.

We follow Reviewer #1’s suggestion by deleting “discussion” in our revised manuscript.

- It is worth noting other publications in which the Gini index is considered.

For example: CAN PUBLIC DEBT HARM SOCIAL DEVELOPMENT? EVIDENCE FROM THE ASIAN-PACIFIC REGION doi:10.14254/2071-8330.2020/13-2/4

We follow Reviewer #1’s suggestion by including study by Tung (2020) in Introduction, paragraph 5 and reiterating its relevance to our study in Conclusions, paragraph 3 in our revised manuscript.

- It is necessary to update according to the latest research, to issue according to the requirements.

We sincerely thank Reviewer #1 for pointing this out. We cite the up-to-date studies that propose different parametric functional forms for estimating the Lorenz curve in Introduction, paragraph 3 and include them in References in our revised manuscript.

Reviewer #2:

- The article is interesting in terms of topics, but not entirely correctly implemented in terms of structure. It is noticeable at the beginning that there is no review of world research in this area, i.e. the so-called Review of the literature. Throughout the article, the authors refer to only 15 items !!! This needs to be completely rewritten and corrected. Since you analyze, for example, the Gini index, it is worth referring to such items as:

1. Statistical methods of the bankruptcy prediction in the logistics sector in Poland and Slovakia, Transformations in Business and Economics Vol. 15 (No 1 (37)): 93-114;

2. PREDICTING BANKRUPTCY OF COMPANIES FROM THE LOGISTICS SECTOR OPERATING IN THE PODKARPACIE REGION; doi: 10.7862 / rz.2013.mmr.33

3. Evaluating the impact of GINI index and information gain on classification using decision tree classifier algorithm; doi: 10.14569 / ijacsa.2020.0110277

4. The gini index: A proposal for revision; doi: 10.24309/recta.2020.21.1.01

or others.

We sincerely thanks Reviewer #2 for comment and suggestion regarding the review of world research in the area of parametric functional forms for estimating the Lorenz curve. In our revised manuscript, we have made a major revision by citing the up-to-date studies in Introduction, paragraph 3 and including those studies in References. We also rewrite Introduction by providing the definition of the Lorenz curve and its applications in paragraph 1. In addition, we explain how the Gini index can be derived from the Lorenz curve in Introduction, paragraph 2. In Introduction, paragraph 5 and Conclusions, paragraph 3, studies by Pisula et al. (2013) as suggested by Reviewer #2 and Kharlamova et al. (2018), Bilan et al. (2020), and Tung (2020) as suggested by Reviewer #1, all of which employ the Gini index in their analyses, are included as a part of our discussion on the importance of using an appropriate parametric functional form for estimating the Lorenz curve and calculating the Gini index.

- Authors must also emphasize the so-called added value of research. What's new in these findings?

In response to Reviewer #2’ s concern on the issue of the added value of our study, we state in Introduction, paragraph 5 in our revised manuscript that, given the superiority of Paul and Shankar (2020)’s functional form over the other existing widely used functional forms, it is therefore relevant and worthwhile to conduct an examination to find out if we use grouped income data of other countries, the performance of Paul and Shankar (2020)’s functional form is still superior to the other existing widely used functional forms considered in their study. The findings from our investigation should also contribute as a check-and-balance not only for economics but also for other scientific disciplines that use the Lorenz curve to analyze size distributions of non-negative quantities and inequalities. This is because using a functional form that does not fit the actual observations could adversely affect inequality measures and income distribution policies.

In addition to new findings reported in Results, we also summarize our main findings in Introduction, paragraphs 6 and 7 and in Conclusions, paragraphs 1 and 2 in our revised manuscript as suggested by Reviewer #2.

Attachment

Submitted filename: Response to Reviewers_R1.pdf

Click here for additional data file.^{(101.8KB, pdf)}

PLoS One. doi: 10.1371/journal.pone.0287546.r003

Decision Letter 1

Ayesha Afzal

8 Jun 2023

An investigation of the performance of parametric functional forms for the Lorenz curve

PONE-D-22-23738R1

Dear Dr. Sitthiyot,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Dr Ayesha Afzal

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #1: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #1: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #1: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Reviewer #1: The authors of this scientific article made appropriate corrections according to the comments.

Reviewer #3: This is an interesting study; however, it is limited in terms of the authors discussing the practical implications of their findings. Therefore, they should address this concern before I can recommend the article for publication.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #3: No

**********

PLoS One. doi: 10.1371/journal.pone.0287546.r004

Acceptance letter

Ayesha Afzal

14 Jun 2023

PONE-D-22-23738R1

An investigation of the performance of parametric functional forms for the Lorenz curve

Dear Dr. Sitthiyot:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Ayesha Afzal

Academic Editor

PLOS ONE

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Appendix

(DOCX)

Click here for additional data file.^{(24.2KB, docx)}

Attachment

Submitted filename: PONE-D-22-23738.docx

Click here for additional data file.^{(803.3KB, docx)}

Attachment

Submitted filename: Response to Reviewers_R1.pdf

Click here for additional data file.^{(101.8KB, pdf)}

Data Availability Statement

[pone.0287546.ref001] 1.Lorenz MO. Methods of measuring the concentration of wealth. Pub Am Stat Assoc. 1905;9: 209–219. [Google Scholar]

[pone.0287546.ref002] 2.Sitthiyot T, Holasut K. A simple method for estimating the Lorenz curve. Humanit Soc Sci Commun. 2021;8(268): 1–9. Available from: 10.1057/s41599-021-00948-x [DOI] [Google Scholar]

[pone.0287546.ref003] 3.Paul S, Shankar S. An alternative single parameter functional form for Lorenz curve. Empir Econ. 2020;59: 1393–1402. 10.1007/s00181-019-01715-3 [DOI] [Google Scholar]

[pone.0287546.ref004] 4.Gini C. On the measurement of concentration and variability of characters (English translation from Italian by de Santis F.). Metron. 2005;63: 3–38. [Google Scholar]

[pone.0287546.ref005] 5.Kakwani NC, Podder N. On the estimation of Lorenz curves from grouped observations. Int Econ Rev. 1973;14: 278–292. [Google Scholar]

[pone.0287546.ref006] 6.Kakwani NC, Nripesh P. Efficient estimation of the Lorenz curve and associated inequality measures from grouped observations. Econometrica. 1976;44: 137–148. [Google Scholar]

[pone.0287546.ref007] 7.Kakwani NC. On a class of poverty measures. Econometrica. 1980;48: 437–446. [Google Scholar]

[pone.0287546.ref008] 8.Rasche RH, Gaffney JM, Koo AYC, Obst N. Functional forms for estimating the Lorenz curve. Econometrica. 1980;48: 1061–1062. [Google Scholar]

[pone.0287546.ref009] 9.Aggarwal V. On optimum aggregation of income distribution data. Sankhyā B. 1984;46: 343–355. [Google Scholar]

[pone.0287546.ref010] 10.Gupta MR. Functional form for estimating the Lorenz curve. Econometrica. 1984;52: 1313–1314. [Google Scholar]

[pone.0287546.ref011] 11.Arnold BC. A class of hyperbolic Lorenz curves. Sankhyā B. 1986;48: 427–436. [Google Scholar]

[pone.0287546.ref012] 12.Rao ULG, Tam AY-P. An empirical study of selection and estimation of alternative models of the Lorenz curve. J Appl Stat. 1987;14: 275–280. 10.1080/02664768700000032 [DOI] [Google Scholar]

[pone.0287546.ref013] 13.Villaseñor JA, Arnold BC. Elliptical Lorenz curves. J Econom. 1989;40: 327–338. [Google Scholar]

[pone.0287546.ref014] 14.Basmann RL, Hayes K, Slottje D, Johnson J. A general functional form for approximating the Lorenz curve. J Econom. 1990;92: 727–744. [Google Scholar]

[pone.0287546.ref015] 15.Ortega P, Martín G, Fernández A, Ladoux M, García A. A new functional form for estimating Lorenz curves. Rev Income Wealth. 1991;37: 447–452. [Google Scholar]

[pone.0287546.ref016] 16.Chotikapanich D. A comparison of alternative functional forms for the Lorenz curve. Econ Lett. 1993;41: 129–138. [Google Scholar]

[pone.0287546.ref017] 17.Ogwang T, Rao ULG. A new functional form for approximating the Lorenz curve. Econ Lett. 1996;52: 21–29. [Google Scholar]

[pone.0287546.ref018] 18.Ogwang T, Rao ULG. Hybrid models of the Lorenz curve. Econ Lett. 2000;69: 39–44. [Google Scholar]

[pone.0287546.ref019] 19.Ryu H, Slottje D. Two flexible functional forms for approximating the Lorenz curve. J Econom. 1996;72: 251–274. [Google Scholar]

[pone.0287546.ref020] 20.Sarabia JM. A hierarchy of Lorenz curves based on the generalized Tukey’s lambda distribution. Econom Rev. 1997;16: 305–320. [Google Scholar]

[pone.0287546.ref021] 21.Sarabia JM, Castillo E, Slottje D. An ordered family of Lorenz curves. J Econom. 1999;91: 43–60. [Google Scholar]

[pone.0287546.ref022] 22.Sarabia JM, Castillo E, Slottje D. An exponential family of Lorenz curves. South Econ J. 2001;67: 748–756. [Google Scholar]

[pone.0287546.ref023] 23.Sarabia JM, Pascual M. A class of Lorenz curves based on linear exponential loss functions. Commun Stat–Theory Methods. 2002;31: 925–942. [Google Scholar]

[pone.0287546.ref024] 24.Rohde N. An alternative functional form for estimating the Lorenz curve. Econ Lett. 2009;105: 61–63. [Google Scholar]

[pone.0287546.ref025] 25.Helene O. Fitting Lorenz curves. Econ Lett. 2010;108: 153–155. [Google Scholar]

[pone.0287546.ref026] 26.Sarabia JM, Prieto F, Sarabia M. Revisiting a functional form for the Lorenz curve. Econ Lett. 2010;107: 249–252. [Google Scholar]

[pone.0287546.ref027] 27.Sarabia JM, Prieto F, Jordá V. About the hyperbolic Lorenz curve. Econ Lett. 2015;136: 42–45. [Google Scholar]

[pone.0287546.ref028] 28.Wang Z, Smyth R. A hybrid method for creating Lorenz curves. Econ Lett. 2015;133: 59–63. [Google Scholar]

[pone.0287546.ref029] 29.Sarabia JM, Jordá V, Trueba C. The Lamé class of Lorenz curves. Commun in Stat–Theory Methods. 2017;46: 5311–5326. [Google Scholar]

[pone.0287546.ref030] 30.Fellman J. Regression analyses of income inequality indices. Theor Econ Lett. 2018;8: 1793–1802. 10.4236/tel.2018.810117 [DOI] [Google Scholar]

[pone.0287546.ref031] 31.Tanak AK, Mohtashami Borzadaran GR, Ahmadi J. New functional forms of Lorenz curves by maximizing Tsallis entropy of income share function under the constraint on generalized Gini index. Physica A. 2018;511: 280–288. [Google Scholar]

[pone.0287546.ref032] 32.Jordá V, Sarabia JM, Jäntti M. Inequality measurement with grouped data: Parametric and non-parametric methods. J R Stat Soc Series A. 2021;184: 964–984. doi: 10.1111/rssa.12702 [DOI] [Google Scholar]

[pone.0287546.ref033] 33.Cowell FA, Mehta F. The estimation and interpolation of inequality measures. Rev Econ Stud. 1982;49: 273–290. [Google Scholar]

[pone.0287546.ref034] 34.Shorrocks AF, Wan G. Ungrouping income distributions: Synthesizing samples for inequality and poverty analysis. Research Paper No. 2008/16. United Nations University World Institute for Development Economics Research (UNU-WIDER). 2008. Available from: https://www.wider.unu.edu/publication/ungrouping-income-distributions [Google Scholar]

[pone.0287546.ref035] 35.Dagum C. A new model of personal income distribution: Specification and estimation. In: Chotikapanich D, editor. Modeling income distributions and Lorenz curves. Economic studies in equality, social exclusion and well-being, vol 5. New York: Springer; 1977. pp. 3–25. 10.1007/978-0-387-72796-7_1 [DOI] [Google Scholar]

[pone.0287546.ref036] 36.Pisula T, Mentel G, Brożyna J. Predicting bankruptcy of companies from the logistics sector operating in the Podkarpacie region. Mod Manag Rev. 2013;18: 113:133. [Google Scholar]

[pone.0287546.ref037] 37.Kharlamova G, Stavytskyy A, Zarotiadis G. (2018). The impact of technological changes on income inequality: the EU states case study. J Int Stud. 2018; 11: 76–94. doi: 10.14254/2071-8330.2018/11-2/6 [DOI] [Google Scholar]

[pone.0287546.ref038] 38.Bilan Y, Mishchuk H, Samoliuk N, Yurchyk H. Impact of income distribution on social and economic well-being of the state. Sustainability. 2020;12: 1–15. Available from https://www.mdpi.com/2071-1050/12/1/42935136666 [Google Scholar]

[pone.0287546.ref039] 39.Tung LT. Can public debt harm social development? Evidence from the Asian-Pacific region. J Intl Stud. 2020;13: 48–61. doi: 10.14254/2071-8330.2020/13-2/4 [DOI] [Google Scholar]

[pone.0287546.ref040] 40.The United Nations University World Institute for Development Economics Research (UNU-WIDER). World Income Inequality Database (WIID), Version date: 6 May 2020; 2020. [cited 2021 Jul. 25]. Available from: https://www4.wider.unu.edu/ [Google Scholar]

[pone.0287546.ref041] 41.The Office of the National Economic and Social Development Council. Poverty and Income Distribution Statistics; 2021. [cited 2021 Jul. 18]. Available from: https://www.nesdc.go.th/main.php?filename=social [Google Scholar]

[pone.0287546.ref042] 42.The World Bank. World Bank Open Data; 2020. [cited 2021. Jul. 22]. Available from: https://data.worldbank.org/ [Google Scholar]

[pone.0287546.ref043] 43.Cheong KS. An empirical comparison of alternative functional forms for the Lorenz curve. Appl Econ Lett. 2002;9: 171–176. [Google Scholar]

PERMALINK

An investigation of the performance of parametric functional forms for the Lorenz curve

Thitithep Sitthiyot

Kanyarat Holasut

Roles

Abstract

Introduction

Fig 1. The Lorenz curve and the Gini index.

Materials and methods

Table 1. The descriptive statistics of the observed Gini index and the decile income shares of 20 OECD countries whose values of the observed Gini index are below the critical threshold of 0.4180.

Table 2. The descriptive statistics of the observed Gini index and the decile income shares of the other 20 countries whose upper part of the income distribution has a long-tailed property defined as the income share of the top 20% being greater than 50% of total income share.

Results

The mathematical proof of the critical threshold for the Gini index

Table 3. The values of the Gini index calculated using different values of parameter γ based on Paul and Shankar [3]’s functional form.

The performance comparison between Paul and Shankar [3]’s functional form and the other popular functional forms when the Gini index is below the critical threshold

Table 4. The income data of 20 OECD countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on R2.

Table 7. The income data of 20 OECD countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on the estimated Gini index.

Table 5. The income data of 20 OECD countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on MSE.

Table 6. The income data of 20 OECD countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on IIM.

The performance comparison between Paul and Shankar [3]’s functional form and the other popular functional forms when the upper part of income distribution has a long-tailed property

Table 8. The income data of the other 20 countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on R2.

Table 11. The income data of the other 20 countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on the estimated Gini index.

Table 9. The income data of the other 20 countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on MSE.

Table 10. The income data of the other 20 countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on IIM.

Table 12. The income data of Thailand are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms all of which share the same value of the estimated Gini index which is equal to 0.4528.

Fig 4. The actual Lorenz plot and the estimated Lorenz curves using different functional forms all of which share the same value of the estimated Gini index which is equal to 0.4528.

The performance comparison on the basis of the estimated Gini index between single-parameter functional forms and a functional form that contains more than one parameter

Table 13. The income data of 20 OECD countries are used to demonstrate the performance of single-parameter functional forms and that of a two-parameter functional form proposed by Sitthiyot and Holasut [2] on the criterion of estimated Gini index.

Table 14. The income data of the other 20 countries are used to demonstrate the performance of single-parameter functional forms and that of a two-parameter functional form proposed by Sitthiyot and Holasut [2] on the criterion of estimated Gini index.

Conclusions

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Yuriy Bilan

Roles

Author response to Decision Letter 0

Decision Letter 1

Ayesha Afzal

Roles

Acceptance letter

Ayesha Afzal

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Table 4. The income data of 20 OECD countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on R².

Table 8. The income data of the other 20 countries are used to demonstrate the performance of Paul and Shankar [3]’s functional form and those of the other popular functional forms based on R².