5204.0.55.011 - Australian National Accounts: Distribution of Household Income, Consumption and Wealth, 2003-04 to 2011-12

ARCHIVED ISSUE Released at 11:30 AM (CANBERRA TIME) 03/10/2014 First Issue

Page tools: Print

Print Page Print all pages in this product

Print All

OVERVIEW OF METHODOLOGY

The methodology used to compile the data in this publication is based on the methodology (summarised below) used in the Information Paper: Australian National Accounts, Distribution of Household Income, Consumption and Wealth, 2009-10 (cat. no. 5204.0.55.009). However, minor improvements made to the original methodology as well as the challenges involved in adapting the methodology and data sources for use in the construction of a time series will be described in detail in the chapter.

A summary of the methodology used in this release from the Information Paper: Australian National Accounts, Distribution of Household Income, Consumption and Wealth, 2009-10 (cat. no. 5204.0.55.009) is as follows:
Current price household estimates for income, consumption and wealth from the Australian System of National Accounts, 2012-13 (cat. no. 5204.0), for the years 2003-04, 2005-06, 2007-08, 2009-10 and 2011-12 were distributed for five household distributional indicators based on data from the ABS Survey of Income and Housing (2003-04, 2005-06, 2007-08, 2009-10 and 2011-12 releases) and ABS Household Expenditure Survey (2003-04 and 2009-10 releases). Estimates for non-profit institutions serving households (NPISH) included in the household sector in the 5204.0 estimates were removed from the household national accounts in this information paper.

The household national accounts estimates for a particular year (macro) and the corresponding ABS household survey estimates (micro) were compared, and coverage ratios (micro/macro) calculated. For some items, the macro and or micro estimates were adjusted to derive the most relevant common scope for comparison. The corresponding micro household items were sub sectored into the following household groups: main source of income; equivalised income quintiles; household composition; age of household reference person; and equivalised net worth quintiles.

The Australian System of National Accounts (ASNA) household components and aggregates were distributed to the five household groups:

directly using the distribution of the equivalent micro component when the coverage ratio was considered adequate, for example, social assistance benefits;
indirectly by a related micro distribution when there was no direct micro distribution information for the national accounts item, for example the national accounts item non-life insurance claims were distributed using the micro distribution for total insurance premiums paid;
indirectly by creating a micro distribution ('synthesised') based on related micro distribution, for example, synthesised micro distribution was created for the national accounts item financial intermediation services indirectly measured (FISIM) for consumer loans; and
by the corresponding aggregate distribution for income (disposable income), consumption (final consumption expenditure), assets (total assets) and liabilities (total liabilities), when micro distributions either directly or indirectly are not available. For these national accounts items, the inclusion or exclusion of the item did not impact on the distribution of the national accounts aggregates.

The very remote communities and people living in non-private dwellings, populations that were out of scope of the micro surveys, were excluded from the ASNA estimates and distributed separately using data from the 2006 and 2011 ABS Census of Population and Housing. These distributions were then added to the ASNA distributions based on the micro surveys to obtain the final distribution of the ASNA household income, consumption and wealth estimates.

For detail information regarding the methodology described above, please refer to Information Paper: Australian National Accounts, Distribution of Household Income, Consumption and Wealth, 2009-10 (cat. no. 5204.0.55.009), Chapter 4 - Data Sources and Methodology: Distribution of the Household National Accounts Estimates.

IMPROVEMENTS TO THE ORIGINAL METHODOLOGY

The following improvements were made to the original methodology described above:

Household Quintiles

There are two options for quintile boundaries when sorting households into equivalised income and net worth quintiles. Either, an equal number of people are allocated to each quintile (person weighted quintiles) or equal number of households are allocated to each quintile (household weighted quintiles). For national accounts purposes, the preference is to use household weighted quintiles as the preferred unit of analysis is the household. However, due to resource and time limitations, person based quintiles were used in the compilation of Information Paper: Australian National Accounts, Distribution of Household Income, Consumption and Wealth, 2009-10 (cat. no. 5204.0.55.009).

Due to the addition of the out of scope population to the micro defined equivalised income and net worth quintiles, the final income and net worth quintiles published in this release are not equal in proportion. From the lowest to the highest quintile, the following proportion of households: 20.9; 20.1; 19.7; 19.6; and 19.7 respectively are in each quintile. This is still an improvement on the quintiles generated for the original methodology.

Distribution of non-indigenous population living in very remote communities

Previously, due to access limitation of the 2006 Census data set, the non-indigenous population living in very remote communities were distributed based on the distribution of the in scope micro population. For this release, we were able access a larger scope of the 2006 Census data set, therefore non-indigenous households living in very remote communities were distributed using the demographic information from the Census data set, this has led to greater conceptual consistency across the distributions of the out of scope population.

IMPLEMENTATION OF THE TIME SERIES

Periodicity

The time series presented in this release for distribution of the household income, consumption and wealth is biannual from 2003-04 to 2010 -12. The decision to start the time series in 2003-04, and compile the estimates biennially was based on:

the availability of income and wealth modules from the ABS Survey of Income and Housing (SIH);
the availability of the improved income data collected through the ABS Survey of Income and Housing, which was 2003-04;
distributed data points not being more than two years apart from the nearest ABS Census data and ABS Household Expenditure Survey (HES); and
availability of the micro Social Transfer in Kind (STiK) data.

Model used to estimate for data gaps in distributional source data

Two options were investigated to model for distributional household indicators for the years that the source micro (SIH, HES, Census and STiK) data was not available. The first option was to use the nearest available source data for the missing years (nearest year method); and the second option was to linearly interpolate (or extrapolate) the data for the missing years. The second option was chosen.

Option one: nearest year method

The Table 3.1 below provides the source data that would be applied for the ASNA time series points. For example, to compile the estimates for 2007-08, the HES data from the 2009-10 survey would have been used as no HES data exists for 2007-08.

Table 3.1: Year of source data used to compile estimates for each year using the ‘nearest year’ method

Data Set	ASNA	Census	SIH-Income	SIH-Wealth	HES	STiK


2003-04	2003-04	2006	2003-04	2003-04	2003-04	2003-04
2005-06	2005-06	2006	2005-06	2005-06	2003-04	2003-04
2007-08	2007-08	2006	2007-08	2009-10	2009-10	2009-10
2009-10	2009-10	2011	2009-10	2009-10	2009-10	2009-10
2011-12	2011-12	2011	2011-12	2011-12	2009-10	2011-12

When this model was applied to distribute the ASNA data for the time series, and growth patterns for components were analysed some major flaws were revealed, due to the following assumption of this model:

(1) for any two years that share the same distributional source data, the distribution pattern does not change. For example, 2003-04 HES was used for years 2003-04 and 2005-06, and therefore the assumption is that the household consumption pattern between these two years would not have changed.

(2) any changes to the distribution of household income, consumption and wealth between two points where no source data is available, and the nearest data used is two different time points of a survey, the changes in data between the two iterations of survey are captured over the two years, regardless of the distance between the source data estimates. For example, the time point 2005-06, the 2003-4 HES data was applied; for the time point 2007-08, the 2009-10 HES data applied. Despite the fact that these two HES surveys are six years apart, the changes in consumption pattern between these two surveys will be captured in the two year period between 2005-06 and 2007-08, and therefore the compression of the change makes the change between the data points bigger than it is in reality.

The aim of implementing a time series of the household distribution ASNA income, consumption and wealth data is to capture changes overtime of these household aggregates and the components, the nearest year method did not adequately do this.

Option two: linear interpolation and extrapolation

As mentioned above, linear interpolation (and extrapolation) was applied to generate ASNA distributional household groups for the years that the source micro (SIH, HES, Census and STiK) data was not available. A detail description is provided of this model below.

Table 3.2: Household final consumption expenditure, Equivalised income quintiles, clothing and footwear, 2003-04 and 2009-10

Financial Year	Numeric Year	Q1	Q2	Q3	Q4	Q5


2003-04	2004.5	1333.43	1949.69	2527.73	3304.56	4423.15
2005-06	2010.5	1870.97	2366.12	3152.12	4277.48	6868.69

Table 3.2 provides the ASNA household final consumption expenditure (HFCE) distributed to equivalised income quintiles using available source data from 2003-04 and 2009-10 HES. Linear interpolation and extrapolation methodology is applied to generate 2005-06, 2007-08 and 2011-12 ASNA HFCE equivalised income quintiles using the 2003-04 and 2009-10 HFCE quintile estimates from Table 3.2.

The following example is used to explain the methodology used to linearly interpolate and extrapolate missing values. Let Point A be a known value (v_A) at a known point in time (t_A) and let point B be another known value (v_B) at a known point in time (t_B) where t_B > t_A. Let point X be an unknown value (v_X) at a known point in time (t_X) that we want to estimate. This information is summarised in Figure 3.1.

Figure 3.1: Graphical display of the variables used for Formulas 3.1 and 3.2

As triangles APX and AQB are similar triangles, Formula 3.1 can be constructed in which the only unknown variable is v_X.

Formula 3.1 (to find the value of v_X)

Initial. Isolating v_X and simplifying then gives Formula 3.2.

Formula 3.2 (to find the value of v_X, simplified)

Using the data from 2003-04 for quintile 1 Point A is (2004.5, 1333.43), Point B is (2010.5, 1870.97) and Point X is (2008.5, v_X). Substituting these numbers into formula 3.2 give us formula 3.3 which gives us a value for v_x.

Formula 3.3 Formula to find the value of v_X in the given example.

The above process is repeated to generate data for all quintiles and for all missing years. Table 3.5 contains the time series , equivalised income quintiles, ASNA HFCE, clothing and footwear, 2003-04 to 2011-12.

Table 3.3: Household final consumption expenditure, Equivalised income quintiles, ASNA, HFCE, clothing and footwear,$m, 2003-04 to 2011-12.

Financial Year	Numeric Year	Q1	Q2	Q3	Q4	Q5


2003-04	2004.5	1333.43	1949.69	2527.73	3304.56	4423.15
2005-06	2006.5	1512.61	2088.50	2735.86	3628.87	5238.33
2007-08	2008.5	1691.79	2227.31	2943.99	3953.18	6053.51
2009-10	2010.5	1870.97	2366.12	3152.12	4277.48	6868.69
2011-12	2012.5	2050.15	2504.93	3360.25	4601.79	7683.86

The data in Table 3.3 , shows the interpolated equivalised income quintile data, linearly interpolated for years 2005-06 and 2007- 08 and extrapolated for 2011-12. The simulated data does have some shortcomings that it derives clearly linear changes in distribution, however, it does lead to a more realistic growth pattern than the nearest year model.

This ‘linear interpolation’ method was used to simulate Census, SIH and HES values using the available source data as the two nearest data points as the known points. The following should be noted in applying this method across all missing values:

2006 Census was assigned a numeric year of 2006.5 (i.e. 30th June 2006) and 2011 Census was assigned a numeric year of 2011.5 (i.e. 30th June 2011) even though the Censuses took place in August 2006 and August 2011 respectively. This was done to simplify the compilation of the data in the publication. However, it was assumed that any effect of the change in the economic and demographic structure of the Australian population over this 1.5 months would be insignificant, especially given the smaller effect of Census data on the final figures.
When using this process for extrapolation, the two nearest data points were still used. However, these were either both below or above the points being simulated. For example, when simulating 2011-12 HES, data items from the 2003-04 HES and the 2009-10 HES were used (as ‘Point A’ and ‘Point B’ respectively in the earlier example) as these were the two closest points. The same formula still holds in this case.
The one exception to using this process was in calculating the number of households in each of the net worth quintiles in 2007-08. This process wasn’t used as it was important to ensure that total number of households was the same regardless of which groups the items were divided into. In this case Formula 3.4 was used (see below) where Q_nt is the number in the number of households in Equivalised Net Worth Quintile n for year t and _Total is the total number of households for year t. Specifically, Formula 3.4 is distributing the difference in total households between 2005-06 and 2007-08 between each quintile proportional to the amount that each quintile contributed to the growth in total number of households from 2005-06 to 2009-10. It then adds that distributed difference to the 2005-06 count for each quintile to get the total number of households in each quintile for 2007-08.

Formula 3.4: Calculation of number of households in each Equivalised Net Worth Quintile for 2007-08.

METHODOLOGY FOR ESTIMATING MISSING SURVEY OF INCOME AND HOUSING DATA ITEMS

The 2003-04 and 2005-05 did not include some income and wealth data items included in 2009-10 SIH. When these missing data item form part of aggregates in the SIH, it meant the aggregates were underestimated in 2003-04 and 2005-06 compared 2009-10 SIH. This was particularly problematic for some of the income items as these items affected household’s main source of income (MSI) and the derivation of household equivalised income quintiles. Also, if SIH data set was not made consistent with the 2007-08 data, large series break between 2005-06 and 2007-08 would have appeared in ASNA distributional household data set.

The missing SIH data items for 2003-04 and 2005-06 were estimated by applying factors to the 2003-04 and 2005-06 SIH data, the factors were calculated from the items (now being reported) in 2007-08, 2009-10 and 2011-12 SIH. The missing SIH income items impacted only on MSI Property Income and Superannuation and Other categories . Once these MSI categories were estimated, new equivalised income quintiles were derived for 2003-04 and 2005-06 SIH data

CHANGING DEMOGRAPHICS OVER TIME

Once the time series for the distribution of household income, consumption and wealth was produced, the ABS reflected on the challenge of how this data set could be analysed as time series.

When distributional data across different years was compared, it is important to note that the change in the estimate is impacted by demographic changes over time such as increase in the number of households in a particular household distributional group. For example, when analysing final consumption expenditure on food, distributed by age of household reference person over 65 years , a change in the estimate from 2009-10 to 2011-12, need to be divided into (a) change due changes in consumption habits and (b) change due to more households in 2011-12 where the reference person was over 65?”

In order to separate changes due to (a) and (b) as described above, we applied a number of different methods to control for the demographic shift , that is (b).

(i) dollars per household

The data was analysed in ‘dollars per household’ terms. For this measure, the total level of an item in each group was divided by the number of households in each group to give the amount of income, consumption or wealth for that item per household in that group. This analysis while removing the effect of the change in demographic shift, it also removed the sense of some groups simply being bigger than others by virtue of having a greater number of households in that group.

For example, in 2011-12 consumption of food by households with a reference person aged 15-24 was $2,483m while the consumption of food by households with a reference person aged between 25 and 34 years was $11,418m. When, expressed in dollar per household terms the annual consumption per household was $6,784 and $7,995 respectively. The similarity of these two numbers hid the fact that less money was being spent on food by households with a reference person of age 15-24 years.

(ii) standardisation to a reference year

The idea of the standardisation method was to remove demographic shift without removing a sense of total spending. The approach standardise the household distribution to a reference year, in effect answering the question of "What would income/consumption/wealth distributions look like if there were the same number of households as in the reference year and they were distributed the same way?". This was achieved by converting the distributed ASNA items into dollars per household terms and multiplying these numbers by the number of households in that group in the reference year. An issue with this analysis was that while it included a sense of the different sizes of the groups, it removed the fact that the total number of households between all the groups was increasing and, as such, items on this basis understated total growth.

(iii) standardisation to the current year

This analysis was undertaken to try and get a sense of total growth back into the data. This was achieved by dividing the standardised items found in (ii) by the total number of households in the reference year and then multiplying the results by the total number of households in the actual year. This was like answering the question of "What would the income, consumption and wealth distributions look like this year if the households this year were distributed in the same way as the reference year?" This analysis returned a sense of total growth back into the data and maintained the sense of difference in the size of groups. However, the total figures for each item were no longer the total figures from the ASNA. In fact, the total figures were different depending upon what groups you were distributing the totals into.

After weighing up the advantages and disadvantages of the three methods described above to account for demographic shifts, method (i) dollars per household was applied to the data. While it may be useful or interesting to further explore other methods of adjusting for demographic shift, they are beyond the scope of this publication.

TIME SERIES ANALYSIS INCLUDED IN THE RELEASE

The release includes the following tables to:

analyse how household distributional groups have contributed to total growth of income, consumption and wealth - electronic table 9. This table includes demographic shifts such increase in the number of households in a particular household group as they are a important driver to total growth;
remove demographic shifts, the method used in this release is dollars per household - electronic table 3 and 4;
analyse household groups per household growth in income, consumption and wealth - electronic table 10;
analyse contribution by component (income, consumption and net worth) to a household group's growth per household of GDI, HFCE, Net Worth and Actual Final Consumption - electronic table 5 to 8; and
analyse the impacts of redistribution policies such as income tax, social assistance benefits and social transfers in kind; and the effectiveness of the policies over time- electronic table 11.

STRENGTHS AND WEAKNESS OF TIME SERIES

To enable users to interpret the data from the time series, the strengths and limitation of the time series are presented below.

Strengths

The time series of the distributed household income, consumption and wealth data:

is benchmarked to the aggregates published Australian System of National Accounts, 2012-2013, (cat. no. 5204.0), which enables users to interpret household distributional data within the broader context of published estimates on the Australian economy such as gross domestic product (GDP).
is complete and consistent for the years presented, where data was missing linear interpolation and other modelling techniques have been implemented.
is based on and expands upon work undertaken in an Organisation for Economic Cooperation and Development (OECD) and Eurostat (European Union statistical commission) expert group for measuring disparities in a national accounts framework. As a result, the data set produced for this publication should be comparable with any time series analysis on the distribution of household income, consumption and wealth in a national accounts framework performed by members of this expert group in the future.
may be extended with future data points and revised with new source data, enabling a more accurate and longer time series. The time series is based on robust methodology formulated through an international expert group (see above), with the availability of new source data from ABS micro surveys (Survey of Income and Housing and Household Expenditure Survey) and revised aggregate data from the ASNA , the data presented in this release may be easily revised and updated with future data points.

Weakness

The major weakness of this data set are the relatively long intervals between collection years for the Household Expenditure Survey (HES) and, to a lesser extent, the Census of Population and Housing (Census). As a result:

due to the availability of only two time periods for the HES (2003-04 and 2009-10) and the unavailability of proxies for the time periods in between, the missing source data had to be interpolated or extrapolated. While linear movements were assumed for the purpose of interpolation and extrapolation, care should be taken when looking at the distributions of household consumption items for 2005-06, 2007-08 and 2011-12, as this assumption may not be accurate. An additional limitation on extrapolation, is the estimates are based on past observations, increasing the possibility of error caused by the actual movement deviating from the past movement.
due to the unavailability of data on the distribution of household income, consumption and wealth items for populations living in Very Remote Communities and in Non-Private Dwellings, estimates had to be made for these items using ABS Census data. However, as this issue only affected about 2% of households the impact of the missing source data is assumed to be very small.

While improvements were made to equate the size of the equalised income and net worth quintiles, the number of households in each of the quintiles still ranges from 19.6% to 20.9%. This should be taken into account when comparing the quintiles.

Finally, users are encouraged to read Chapter 4, Methodological Issues, in the Information Paper: Australian National Accounts, Distribution of Household Income, Consumption and Wealth, 2009-10 (cat. no. 5204.0.55.009), to understand the issues in constructing a household distributional data set for a single time point.