Summary Indicators of Income and Wealth Distribution

Latest release

Survey of Income and Housing, User Guide, Australia

Reference period

2019-20 financial year

Released

28/04/2022

Next release Unknown

Introduction

There are many ways to illustrate aspects of the distribution of income and wealth, and to measure the extent of inequality. In the Survey of Income and Housing (SIH), five main types of indicators are used - means and medians, frequency distributions, percentile ratios, income and wealth shares, and Gini coefficients. This part of the publication describes how these indicators are derived.

Analysis of both income and wealth provides the most complete understanding of how economic resources are distributed across the population.

Analysis of households and persons

There are two common ways of presenting analysis of households:

number of households, or
number of people in households.

In the former, each household contributes the same regardless of its size e.g. a four person household would have the same representation as a person living alone. These are called household weighted estimates.

To provide a better understanding of the circumstances of people it is often preferable to study people in households e.g. the number of people in Australian households experiencing economic hardship. In this analysis, each person is attributed with the characteristics of the household to which they belong e.g. household income is used to determine whether it is a low or high income household but analysis is about numbers of people experiencing hardship. This approach keeps the focus on individual circumstances while recognising that people share household resources. The main income measure used in SIH publications is equivalised disposable household income, while the main wealth measure is net wealth of household. When data is equivalised, the means and medians are person weighted. Most estimates that are not equivalised, are household weighted. The exception is in tables that refer to 'household characteristics of persons' or 'persons in households'. These estimates are person weighted.

Summary measures

Counts

Counts provide an estimate of the total number of people or households with a particular characteristic and are derived by summing the survey weights of each observation of interest. In sample surveys the weights enable extrapolation of the survey responses to official population estimates.

Means and medians

Mean (average) and median (the midpoint when all persons or households are ranked in ascending order) are simple indicators that can be used to show income and wealth differences between subgroups of the population.

Mean

The mean, or average, value of a data item is calculated by multiplying the value of the data item for the population of interest in each record by the weight of the record and summing the resultant products, and then dividing the total by the sum of the weights of the records. For example, the mean gross income of Queensland households is the weighted sum of the gross income of each such household divided by the sum of the weights relating to each such household.

Advantages of the mean are that it is easy to calculate and the means of all subcomponents sum to the mean of all observations. Its drawbacks are the effect of extreme values and asymmetry of the distribution, both of which are relevant for income and wealth data. For example, a small number of very wealthy and a large number of relatively poor households may have the same average income or wealth as a population where there is equal distribution of resources.

Median

Medians divide the population of interest into halves. To identify the median record, the population is first ranked in ascending order according to the data item of interest. Except for person weighted measures of household variables, the weights of the records are then accumulated until half the population is accrued. The record at which this occurs is the median record, and its value for the data item of interest is the median value. For person weighted measures of household variables, the household weights are multiplied by the number of persons in the household before accumulation.

Compared to the mean, the median is a more stable measure and is less affected by extreme values and sample fluctuations. However, median values of subcomponents do not sum to the median of all observations.

Frequency distribution

A frequency distribution illustrates the location and spread of income and wealth within a population. It groups the population into classes by size of household income or wealth, and gives the number or proportion of people in each income or wealth range. A graph of the frequency distribution is a good way to portray the essence of the income or wealth distribution. Graph 1 shows the proportion of people within $50 household income ranges.

Graph 1 - Distribution of household income(a), 2017-18 to 2019-20
$	2017-18 (%)	2019-20 (%)
0	-	-
50	0.5	0.8
100	0.2	0.4
150	0.5	0.3
200	0.5	0.4
250	0.6	0.5
300	1.0	0.8
350	1.5	1.2
400	1.8	1.9
450	2.3	1.8
500	5.3	4.3
550	4.2	4.3
600	4.1	4.0
650	4.1	3.8
700	4.4	4.1
750	4.5	4.2
800	4.4	4.4
850	4.0	3.9
900	3.8	3.9
950	4.0	3.8
1000	3.5	3.7
1050	3.6	3.1
1100	3.0	3.0
1150	3.4	3.4
1200	3.0	3.1
1250	2.6	2.8
1300	2.9	2.4
1350	2.2	2.8
1400	1.9	2.2
1450	2.0	2.1
1500	1.5	1.8
1550	2.0	1.6
1600	1.7	1.5
1650	1.3	1.5
1700	1.1	1.4
1750	1.0	1.2
1800	0.9	1.4
1850	1.0	0.9
1900	0.8	0.9
1950	0.8	1.0
2000	0.6	0.7
2050	0.6	0.8
2100	0.4	0.5
2150	0.5	0.5
2200	0.4	0.4
2250	0.3	0.3
2300	0.3	0.5
2350	0.3	0.4
2400	0.3	0.3
2450	0.5	0.2
2500	0.2	0.3
2550	0.2	0.3
2600	0.1	0.2
2650	0.2	0.2
2700	0.2	0.2
2750	0.2	0.1
2800	0.1	0.2

Graph 1 - Distribution of household income(a), 2017-18 to 2019-20

["$","2017-18","2019-20"]

[["0","50","100","150","200","250","300","350","400","450","500","550","600","650","700","750","800","850","900","950","1000","1050","1100","1150","1200","1250","1300","1350","1400","1450","1500","1550","1600","1650","1700","1750","1800","1850","1900","1950","2000","2050","2100","2150","2200","2250","2300","2350","2400","2450","2500","2550","2600","2650","2700","2750","2800"],[[null],[0.5],[0.2],[0.5],[0.5],[0.6],[1],[1.5],[1.8],[2.3],[5.3],[4.2],[4.1],[4.1],[4.4],[4.5],[4.4],[4],[3.8],[4],[3.5],[3.6],[3],[3.4],[3],[2.6],[2.9],[2.2],[1.9],[2],[1.5],[2],[1.7],[1.3],[1.1],[1],[0.9],[1],[0.8],[0.8],[0.6],[0.6],[0.4],[0.5],[0.4],[0.3],[0.3],[0.3],[0.3],[0.5],[0.2],[0.2],[0.1],[0.2],[0.2],[0.2],[0.1]],[[null],[0.8],[0.4],[0.3],[0.4],[0.5],[0.8],[1.2],[1.9],[1.8],[4.3],[4.3],[4],[3.8],[4.1],[4.2],[4.4],[3.9],[3.9],[3.8],[3.7],[3.1],[3],[3.4],[3.1],[2.8],[2.4],[2.8],[2.2],[2.1],[1.8],[1.6],[1.5],[1.5],[1.4],[1.2],[1.4],[0.9],[0.9],[1],[0.7],[0.8],[0.5],[0.5],[0.4],[0.3],[0.5],[0.4],[0.3],[0.2],[0.3],[0.3],[0.2],[0.2],[0.2],[0.1],[0.2]]]

[]

[{"axis_id":"0","tick_interval":"","axis_min":"","axis_max":"","axis_title":"","precision":-1,"axis_units":"","tooltip_units":"","table_units":"","data_unit_prefix":"","data_unit_suffix":"","reverse_axis":false}]

[{"value":"0","axis_id":"0","axis_title":"%","axis_units":"","tooltip_units":"","table_units":"(%)","axis_min":"0","axis_max":"6","tick_interval":null,"precision":"-1","data_unit_prefix":"","data_unit_suffix":"","reverse_axis":false}]

Equivalised Disposable Household Income, weekly

Annotation: Persons with an income between $50 and $2,800 are shown in $50 ranges on the graph

Sources: ABS Survey of Income and Housing, 2017–18, 2019–20

Frequency distributions can provide considerable detail about variations in the income or wealth of the population being described, but it is difficult to describe the differences between two frequency distributions. They are therefore often accompanied by other summary statistics, such as the mean and median. Taken together, the mean and median can provide an indication of the shape of the frequency distribution. As can be seen in the Graph 1, above, the distribution of income tends to be asymmetrical, with a small number of people having relatively high household incomes and a larger number of people having relatively lower household incomes. The greater the asymmetry, the greater will be the difference between the mean and the median. The small number of very high values raises the mean, while the median is not impacted by extreme values.

Quantile measures

When persons (or any other units) are ranked from the lowest to the highest on the basis of some characteristic such as their household income or wealth, they can then be divided into equally sized groups. The generic term for such groups is quantiles.

Quintiles, deciles and percentiles

When the population is divided into five equally sized groups, the quantiles are called quintiles. If there are 10 groups, they are deciles, and division into 100 groups gives percentiles. Thus the first quintile will comprise the first two deciles and the first 20 percentiles.

SIH publications frequently present data classified into income or wealth quintiles, supplemented by data relating to those with incomes in the 3rd to 20th percentiles of equivalised disposable household income, i.e. the lowest income quintile excluding the bottom two percentiles. The latter is included to enable quintile-style analysis to be carried out without undue impact from very low incomes which may not accurately reflect levels of economic wellbeing. Estimates for this population in the relevant data cubes are labelled 'Adjusted lowest income quintile'.

Equivalised disposable household income and equivalised net wealth of household are some of the measures used to define the income and wealth quantiles shown in SIH publications, and the quantiles each comprise the same number of persons, that is, they are person weighted.

Gross household income and net worth of household are other measures used to define the income and wealth quantiles in these publications, and the quantiles each comprise the same number of households, that is, they are household weighted.Gross household income and net worth of household are other measures used to define the income and wealth quantiles in these publications, and the quantiles each comprise the same number of households, that is, they are household weighted.

Upper values, medians and percentile ratios

In some analyses, the statistic of interest is the boundary between quantiles. This is usually expressed in terms of the upper value of a particular percentile. For example, the upper value of the first quintile is also the upper value of the twentieth percentile and is described as P20. The upper value of the ninth decile is P90. The median of a whole population is P50, the median of the third quintile is also P50, the median of the first quintile is P10, etc.

Percentile ratios summarise the relative distance between two points on the income or wealth distribution. To illustrate the full spread of the distribution, the percentile ratio needs to refer to points near the extremes of the distribution, for example, the P90/P10 ratio. The P80/P20 ratio better illustrates the magnitude of the range within which the income or wealth levels of the majority of the population fall. The P80/P50 and P50/P20 ratios focus on comparing the ends of the distribution with the midpoint (the median).

Income or wealth shares

Income or wealth shares can be calculated and compared for each income or wealth quintile (or any other subgrouping) of a population. The aggregate income of the units in each quintile is divided by the overall aggregate income of the entire population to derive income or wealth shares.

Gini coefficient

Taken together, the simple measures of income or wealth distribution such as mean, median, percentile ratios and income shares can provide an indication of changes in the income or wealth distribution of a population over time, or differences in the income or wealth distributions of two separate populations. However, none of the simple measures comprise a single statistic that summarises the whole income or wealth distribution in a way that directly considers and compares the individual income or wealth levels of all members of the population. In SIH publications, the Gini coefficient is used to compile a single statistic of inequality by summarising the distribution of income or wealth across the population.

Concept of inequality

It is generally agreed that perfect equality in the distribution of income or wealth can be defined as the situation in which everyone in the population lives in a household with the same equivalised disposable household income or net worth. If any person has lower or higher equivalised disposable household income than any other person, there is inequality in the income distribution, and the same definition applies to wealth inequality. However, there is no unique, generally accepted way of summarising the degree to which a population does not have perfect equality, or, more practically, summarising the difference in inequality between two populations.

Unequal distributions of income can occur in many different ways. The majority of people may have very similar incomes with pockets of very high or very low income. Wealth, due to the effect of accrual over the life course, is generally more unequally distributed, that is, more concentrated among older persons than younger persons. Or entire populations may be heavily clustered at the top and the bottom of the income distribution with few people receiving incomes in between these extremes. To evaluate one distribution as having greater or lesser inequality than another, it is necessary to compare the distributions in terms of which segments of the population have a greater share of income and which segments have a lower share. It is then necessary to at least implicitly judge whether the relative gains by some people is more than offset or less than offset by the relative losses of other people. Different observers may make different judgments about the same situation, depending on factors including personal preferences.

For example, consider the equivalised disposable household income of the two populations A and B depicted in Graph 2, 'Frequency Distributions'. Population A is derived from the 2000–01 SIH population, while population B covers the same people as in population A, but everyone's income is transformed to reduce the proportional differences in income across the population while retaining the same mean income for the population. Therefore fewer people are on very low or very high incomes and more people are between these extremes, with the median for population B closer to the mean, and less spread between P10 and P90.

Graph 2 - Frequency distributions

Image

Description

Example of graph showing frequency distributions of income comparing P10s, medians, means and P90s for population A and population B.

The extent to which the income distributions for populations A and B vary from equality, and from each other, can be illustrated graphically another way, using Lorenz curves.

Lorenz curves

The Lorenz curve is a graph with the horizontal axis showing the cumulative proportion of the persons in the population ranked according to their income and with the vertical axis showing the corresponding cumulative proportion of equivalised disposable household income. The graph then shows the income share of any selected cumulative proportion of the population. The diagonal line represents a situation of perfect equality, i.e., where all people have the same equivalised disposable household income. Graph 3 'Lorenz Curves' shows the Lorenz curves for the two populations described above.

Graph 3 - Lorenz curves

Image

Description

Example of graph showing the Lorenz curve and the line of perfect equality for population A and population B.

Since the distribution of population B's income is uniformly less widely spread than for population A, all points of the Lorenz curve for population B are closer to the line of perfect equality than the corresponding points of the Lorenz curve for population A. In this situation, population B is said to be in a position of Lorenz dominance and can be regarded as having a more equal income distribution than population A. However, if the Lorenz curves of two populations cross over there is no Lorenz dominance and there is no generally accepted way of defining which of the two populations has the more equal income distribution.

Gini coefficient

The Gini coefficient can best be described by reference to the Lorenz curve. It is defined as the ratio of the area between the actual Lorenz curve and the diagonal (or line of equality) and the total area under the diagonal. The Gini coefficient ranges between zero when all incomes are equal and one when one unit receives all the income, that is, the smaller the Gini coefficient the more even the distribution of income.

Normally the degree of inequality is greater for the whole population than for a subgroup within the population because subpopulations are usually more homogeneous than full populations. This is illustrated in Graph 4 below, which shows two Lorenz curves from the 2019–20 SIH. The Lorenz curve for the whole population of the SIH is further from the diagonal than the curve for persons living in one parent, one family households, with at least one dependent child. Correspondingly, the calculated Gini coefficient for all persons was 0.324 while the coefficient for the persons in the one parent households was 0.311.

Graph 4 - Lorenz curves 2019-20 SIH
	EDHI (%)	Perfect Equality (%)	Lone Parent EDHI (%)
0	0	0	0
1	0.003	1	0.024
2	0.082	2	0.115
3	0.278	3	0.348
4	0.536	4	0.660
5	0.835	5	0.999
6	1.163	6	1.359
7	1.513	7	1.751
8	1.887	8	2.170
9	2.285	9	2.597
10	2.701	10	3.050
11	3.127	11	3.514
12	3.557	12	3.991
13	3.998	13	4.472
14	4.450	14	4.963
15	4.913	15	5.461
16	5.384	16	5.965
17	5.866	17	6.473
18	6.359	18	6.989
19	6.862	19	7.516
20	7.375	20	8.053
21	7.900	21	8.596
22	8.437	22	9.149
23	8.988	23	9.717
24	9.551	24	10.295
25	10.124	25	10.881
26	10.709	26	11.486
27	11.305	27	12.103
28	11.911	28	12.726
29	12.529	29	13.367
30	13.157	30	14.022
31	13.795	31	14.692
32	14.444	32	15.388
33	15.104	33	16.097
34	15.775	34	16.810
35	16.454	35	17.531
36	17.143	36	18.262
37	17.842	37	18.997
38	18.553	38	19.740
39	19.277	39	20.492
40	20.011	40	21.253
41	20.756	41	22.039
42	21.513	42	22.838
43	22.282	43	23.644
44	23.063	44	24.453
45	23.855	45	25.270
46	24.658	46	26.101
47	25.472	47	26.945
48	26.297	48	27.800
49	27.135	49	28.662
50	27.982	50	29.529
51	28.841	51	30.403
52	29.713	52	31.289
53	30.597	53	32.190
54	31.494	54	33.097
55	32.404	55	34.019
56	33.331	56	34.958
57	34.273	57	35.909
58	35.231	58	36.872
59	36.202	59	37.842
60	37.189	60	38.820
61	38.190	61	39.810
62	39.205	62	40.802
63	40.230	63	41.800
64	41.268	64	42.812
65	42.321	65	43.842
66	43.391	66	44.884
67	44.477	67	45.940
68	45.578	68	47.008
69	46.694	69	48.086
70	47.829	70	49.180
71	48.983	71	50.293
72	50.156	72	51.424
73	51.342	73	52.571
74	52.545	74	53.730
75	53.768	75	54.918
76	55.012	76	56.122
77	56.276	77	57.336
78	57.561	78	58.565
79	58.873	79	59.806
80	60.208	80	61.060
81	61.570	81	62.321
82	62.955	82	63.592
83	64.370	83	64.883
84	65.816	84	66.194
85	67.292	85	67.529
86	68.800	86	68.890
87	70.347	87	70.274
88	71.930	88	71.682
89	73.547	89	73.169
90	75.212	90	74.706
91	76.927	91	76.291
92	78.700	92	77.925
93	80.538	93	79.641
94	82.460	94	81.441
95	84.490	95	83.358
96	86.659	96	85.424
97	89.021	97	87.675
98	91.639	98	90.199
99	94.753	99	92.974
100	100.000	100	100.000

Graph 4 - Lorenz curves 2019-20 SIH

["","EDHI","Perfect Equality","Lone Parent EDHI"]

[["0","1","2","3","4","5","6","7","8","9","10","11","12","13","14","15","16","17","18","19","20","21","22","23","24","25","26","27","28","29","30","31","32","33","34","35","36","37","38","39","40","41","42","43","44","45","46","47","48","49","50","51","52","53","54","55","56","57","58","59","60","61","62","63","64","65","66","67","68","69","70","71","72","73","74","75","76","77","78","79","80","81","82","83","84","85","86","87","88","89","90","91","92","93","94","95","96","97","98","99","100"],[[0],[0.003],[0.082],[0.278],[0.536],[0.835],[1.163],[1.513],[1.887],[2.285],[2.701],[3.127],[3.557],[3.998],[4.45],[4.913],[5.384],[5.866],[6.359],[6.862],[7.375],[7.9],[8.437],[8.988],[9.551],[10.124],[10.709],[11.305],[11.911],[12.529],[13.157],[13.795],[14.444],[15.104],[15.775],[16.454],[17.143],[17.842],[18.553],[19.277],[20.011],[20.756],[21.513],[22.282],[23.063],[23.855],[24.658],[25.472],[26.297],[27.135],[27.982],[28.841],[29.713],[30.597],[31.494],[32.404],[33.331],[34.273],[35.231],[36.202],[37.189],[38.19],[39.205],[40.23],[41.268],[42.321],[43.391],[44.477],[45.578],[46.694],[47.829],[48.983],[50.156],[51.342],[52.545],[53.768],[55.012],[56.276],[57.561],[58.873],[60.208],[61.57],[62.955],[64.37],[65.816],[67.292],[68.8],[70.347],[71.93],[73.547],[75.212],[76.927],[78.7],[80.538],[82.46],[84.49],[86.659],[89.021],[91.639],[94.753],[100]],[[0],[1],[2],[3],[4],[5],[6],[7],[8],[9],[10],[11],[12],[13],[14],[15],[16],[17],[18],[19],[20],[21],[22],[23],[24],[25],[26],[27],[28],[29],[30],[31],[32],[33],[34],[35],[36],[37],[38],[39],[40],[41],[42],[43],[44],[45],[46],[47],[48],[49],[50],[51],[52],[53],[54],[55],[56],[57],[58],[59],[60],[61],[62],[63],[64],[65],[66],[67],[68],[69],[70],[71],[72],[73],[74],[75],[76],[77],[78],[79],[80],[81],[82],[83],[84],[85],[86],[87],[88],[89],[90],[91],[92],[93],[94],[95],[96],[97],[98],[99],[100]],[[0],[0.024],[0.115],[0.348],[0.66],[0.999],[1.359],[1.751],[2.17],[2.597],[3.05],[3.514],[3.991],[4.472],[4.963],[5.461],[5.965],[6.473],[6.989],[7.516],[8.053],[8.596],[9.149],[9.717],[10.295],[10.881],[11.486],[12.103],[12.726],[13.367],[14.022],[14.692],[15.388],[16.097],[16.81],[17.531],[18.262],[18.997],[19.74],[20.492],[21.253],[22.039],[22.838],[23.644],[24.453],[25.27],[26.101],[26.945],[27.8],[28.662],[29.529],[30.403],[31.289],[32.19],[33.097],[34.019],[34.958],[35.909],[36.872],[37.842],[38.82],[39.81],[40.802],[41.8],[42.812],[43.842],[44.884],[45.94],[47.008],[48.086],[49.18],[50.293],[51.424],[52.571],[53.73],[54.918],[56.122],[57.336],[58.565],[59.806],[61.06],[62.321],[63.592],[64.883],[66.194],[67.529],[68.89],[70.274],[71.682],[73.169],[74.706],[76.291],[77.925],[79.641],[81.441],[83.358],[85.424],[87.675],[90.199],[92.974],[100]]]

[]

[{"value":"0","axis_id":"0","axis_title":"Cumulative proportion of persons ranked according to (%):","axis_units":"","tooltip_units":"","table_units":"","axis_min":null,"axis_max":"100","tick_interval":"20","precision":"-1","data_unit_prefix":"","data_unit_suffix":"","reverse_axis":false}]

[{"value":"0","axis_id":"0","axis_title":"Cumulative proportion of income (%)","axis_units":"","tooltip_units":"(%)","table_units":"(%)","axis_min":null,"axis_max":"100","tick_interval":"20","precision":"-1","data_unit_prefix":"","data_unit_suffix":"","reverse_axis":false}]

Equivalised Disposable Household Income

Source(s): Survey of Income and Housing 2019–20

Mathematically, the Gini coefficient can be expressed as:

$G=\left(\frac{1}{2 n^{2} \mu}\right)\displaystyle\sum_{i=1}^{n}\sum_{j=1}^{n}\left|y_{i}-y_{j}\right|$

where:

n is the number of people in the population

u is the mean equivalised disposable household income of all people in the population

and yi and yj are the equivalised disposable household income of the ith and jth persons in the population.

The Gini coefficient is a summary of the differences between each person in the population and every other person in the population. The differences are the absolute arithmetic differences, and therefore a difference of $x between two relatively high income people contributes as much to the index as a difference of $x between two relatively low income people.

An increase in the income of a person with income greater than median income will always lead to an increase in the coefficient, and a decrease in the income of a person with income lower than median income will also always lead to an increase in the coefficient. The extent of the increase will depend on the proportion of people that have income in the range between median income and the income of the person with the changed income, both before and after the change in income. At the extremes, increasing the income of the person with the lowest income by $x – or increasing the income of the person with the highest income by $x – will respectively decrease and increase the Gini coefficient by the same amount (assuming the lowest income person remains the lowest income person after the change).

The Gini coefficient is sometimes criticised as being too sensitive to relative changes around the middle of the income distribution. This sensitivity arises because the derivation of the Gini coefficient reflects the ranking of the population, and ranking is most likely to change at the densest part of the income distribution, which is likely to be around the middle of the distribution.

The Gini coefficient is the only single statistic summary of income distribution included in the SIH publications. The Gini is preferred over other summary measures because it is not overly sensitive to the very low income or wealth values that can be reported, and it is relatively simple to interpret.

APA

Citation

Summary Indicators of Income and Wealth Distribution

APA

Citation

Introduction

Analysis of households and persons

Summary measures

Counts

Means and medians

Mean

Median

Frequency distribution

Quantile measures

Quintiles, deciles and percentiles

Upper values, medians and percentile ratios

Income or wealth shares

Gini coefficient

Concept of inequality

Graph 2 - Frequency distributions

Lorenz curves

Graph 3 - Lorenz curves

Gini coefficient

Provide feedback