|Page tools: Print Page Print All|
18 The SDB date of arrival on which the scope is based reflects an individual's latest arrival pertaining to their latest permanent visa. For visas that were granted onshore (i.e. in Australia), the Arrival Date refers to the latest date of arrival prior to the grant of that visa. For Settlement visas that were granted offshore (i.e. outside of Australia), the Arrival Date refers to the first date of arrival after the grant of that visa.
19 Of the 1,773,333 SDB records, 951,234 records (54%) were linked to a PIT record. After obtaining the PIT records for the 2009-10 dataset, there were 812,506 records on the 2009-10 PITMID.
20 The main reasons for not linking an SDB record to a PIT record are:
21 Analysis of the SDB and the PIT datasets prior to linking indicated that they were both of a sufficiently high quality to link if a corresponding PIT record existed. However, there are a number of valid reasons why an SDB record would not have a corresponding PIT record. There are many legitimate reasons for there not being a corresponding PIT record as not every person would be required to submit a tax return. These include but are not restricted to a person:
22 As such, it was not expected that the linkage rate for this feasibility study would reach 100% for the entire population on the SDB. There has been no weighting or adjustment applied to the dataset but due to the high match rate, bias is anticipated to be small. For more information, see the "Reliability of estimates" section below.
23 Employee income includes the main forms of payments made to employees for their work or services. Employee income, as reported on the income tax return, includes:
24 Employee income includes employer social contributions. However, income for many components of social contributions are not reported on the PIT dataset and hence are not included in the calculation of Employee income for this publication.
25 Reportable employer superannuation contributions include contributions for both employees and own unincorporated business owners. However, the proportion of people reporting as own unincorporated business owners is considered to be sufficiently small for the gross amount to be included as employee income.
26 For the first time, other net foreign employment income was separately identified by the ATO on the 2009-10 tax form for individuals (Item Q20-T). Items Q20-T and Q20-N on the tax return are included as foreign employment income. Amounts in T are for foreign employment income where an Individual PAYG form has not been issued by the foreign employer.
27 Amounts in Q20-N are for foreign employment income where tax exemptions apply (e.g. employment income for some defence and police work). Where the foreign employment income is 100% tax exempt, it will not appear on the tax return or the supplementary form. Where the foreign employment income is not 100% tax exempt, the amounts reported in Q20-N are taxable amounts, that is gross amounts less deductions.
28 There is currently no way to separately identify income from wage and salary from foreign sources in the item "Total income from wage and salary (before tax and application of Medicare levy) as shown on the 'PAYG payment summary - individual non-business'".
Own unincorporated business income
29 Own unincorporated business income includes the following data items on the individual income tax return:
30 "Net personal services income" does not include income a person received as an employee, making it different from "Attributed personal services income".
31 Investment income includes the following data items on the individual income tax returns:
Superannuation and annuity income (included in Other income)
32 Superannuation and annuity income includes the following data items on the individual income tax returns:
33 A change to legislation relating to superannuation, taking effect from 1 July 2007, means that people aged 60 years and over who receive superannuation income in the form of a lump sum or income stream (such as a pension) from a taxed source, will receive that income tax free. Therefore, if a person has no other income, or their total income is below the tax-free threshold, or any tax payable is mitigated by a tax offset (such as Senior Australian Tax Offset), then this person will not be required to lodge a tax return. Therefore, there is a known undercount for this data item.
34 Methods for including more complete Superannuation and annuities income from additional datasets are being considered for future iterations of this project.
Other income (excluding Government pensions and allowances)
35 Other income (excluding Government pensions and allowances) includes the following data items on the individual income tax returns:
36 Due to the tax exempt status of many Government pensions and allowances, information on these income sources are under-reported on the PIT dataset. For this reason, Government pensions and allowances are not included in the calculation of Total income in this publication. Methods for including more complete Government pensions and allowances information sourced from different DSS datasets are being considered for future iterations of this project.
Total income (excluding Government pensions and allowances)
37 Total income (excluding Government pensions and allowances) is calculated by summing the values of Employee income, Own unincorporated business income, Investment income and Other income (including superannuation and annuities).
Non-standard income sources
Income from foreign sources
38 Given the migrant focus of this publication, a non-standard income source for income from foreign sources was calculated.
39 Income from foreign sources include the following data items on the individual income tax return:
Taxable income or loss deciles
40 The Australian taxpayer population is ranked in ascending order according to the Taxable income or loss reported on their 2009-10 income tax return (ITR), and then divided into ten equal groups (i.e. deciles), each comprising 10% of the population. Extreme values have not been excluded from the individual income tax data and therefore contributes to medians and decile cut offs . Individual taxation return lodgers whose age
Note that the deciles are based on all 2009-10 ITR lodgments and differ from the ATO Taxation statistics percentile table which is based on Taxable individuals only. Taxable individuals only include those with a net tax amount greater than zero.
Counts of individuals
41 Individuals may receive income from a number of sources. Net income from a specific source may be positive or negative. For example, an individual may have positive net income from Employee income but negative net income from Investment. The number of individuals for each income source includes all persons with either positive or negative net income from that source.
42 Readers should note that individuals can receive income from more than one source. The total number of persons receiving total income cannot be calculated as the sum of the individual income source components. For example, an individual could derive income from Employee income, Investment and their Own unincorporated business and therefore contribute to the person count in all three income categories. However, they would only be counted once in Total income.
43 There are a few data consideration that users should be aware of when interpreting or analysing the statistics.
ATO definition of Australian resident for taxation purposes
44 Generally, the ATO considers someone to be an Australian resident for tax purposes if they:
Definition of 'Job'
45 A job is determined by an Individual Pay-As-You-Go (PAYG) record linked to an ABN record. A person with more than one PAYG link to an ABN is considered as having more than one job in the financial year. This is not the same as being considered a "multiple job holder" as it cannot be determined whether a person held each job concurrently, consecutively, or both. In addition, where there is a job splitting arrangement in place for two employees performing the one job, this would be counted as two jobs based on the Individual PAYG rather than one. Given this definition and the available tax data, there will be an over-count for the number of jobs.
46 The ABS uses an economic statistics model on the Australian Business Register (ABR) to describe the characteristics of a business and the structural relationship between related businesses. Within large and diverse business groups, the units model is used to define reporting units that can provide data to the ABS at suitable levels of detail.
47 In mid 2002, the ABS commenced sourcing its register information from the ABR and at that time changed its business register to a two population model. The two populations comprise what is called the Profiled Population and the Non-Profiled Population. The main distinction between businesses in the two populations relates to the complexity of the business structure and the degree of intervention required to reflect the business structure for statistical purposes.
48 The majority of businesses included on the ABR are in the Non-Profiled Population. Most of these businesses are understood to have simple structures. For these businesses, the ABS is able to use the Australian Business Number (ABN) as the basis for a statistical unit. One ABN equates to one statistical unit.
49 For a small number of businesses, the ABN unit is not suitable for ABS economic statistics purposes and the ABS maintains its own units structure through direct contact with businesses. These businesses constitute the Profiled Population. This population consists typically of large or complex groups of businesses. For more information on how the ABS codes these businesses, refer to the Explanatory Notes of publication Australian Industry, 2013-14 (cat. no. 8155.0).
Processing of tax returns
50 The data presented in this publication were compiled before the processing of all income tax returns for any given year may have been completed. Data provided to the ABS by the ATO are from returns processed up to 31 October, 16 months after the end of the financial year. Any returns lodged after this date are not included. Therefore for 2009-10, returns processed after 31 October 2011 are not included. This enables a consistent basis for comparison across financial years.
51 Due to the late lodgement of tax returns in any tax reference year, the data provided in this report under-estimates the total taxable income for any given financial year.
52 Annual revised data is published by the ATO in the Taxation Statistics publication (Personal Tax, Table 7 in 2009-10 and Individuals' Tax from 2010-11) for selected income items. One of these items is 'Wage and Salary'. Although this data item is different to the data contained in this release (as it does not include all the items for Employee income listed in paragraph 23), it can be used to give an indication of the likely direction of change in the number of 'Wage and salary' earners and total 'Wage and salary' income as more tax returns are lodged.
53 As an example, Table 1 below shows that for the 2009-10 income year, an additional 4.7% of taxpayers earning income from 'Wages and salaries' (included in Employee income) lodged their income tax returns in the two years after the initial processing cut off of 31 October 2011. This translated to a further 4.9% of 'Wages and salaries' income being reported, two years onwards.
Changes in taxation policy
54 The ATO provides information annually in Taxation Statistics on their website about changes that may affect taxation statistics. Changes relating to personal income tax are in each edition of Taxation Statistics.
55 For the income year 2009-10, a number of changes were noted in Chapter 2 of Taxation Statistics, including:
56 In accordance with the Census and Statistics Act 1905, all published estimates are subjected to a confidentiality process before release. This process is undertaken to minimise the risk of identifying particular individuals, families, households or dwellings in aggregate statistics, through analysis of published data.
57 To minimise the risk of identifying individuals in aggregate statistics, a technique is used to randomly adjust cell values. This technique is called perturbation. Perturbation involves small random adjustment of the statistics and is considered the most satisfactory technique for avoiding the release of identifiable statistics while maximising the range of information that can be released. These adjustments have a negligible impact on the underlying pattern of the statistics. After perturbation, a given published cell value will be consistent across all tables. However, adding up cell values to derive a total will not necessarily give the same result as published totals.
58 The introduction of perturbation in publications ensures that these statistics are consistent with statistics released via services such as TableBuilder. Caution should be exercised by users when deducing that there are nil people in an area with certain types of income. In general, no reliance should be placed on table cells with small values. Due to limitations of systems used to produce the publication, the median and average values in the data cubes have not been perturbed. These values will differ to those produced in services such as TableBuilder.
Reliability of estimates
59 Error in estimates produced using the Personal Income Tax and Migrants Integrated Datasets may occur due to false links, missed links and the non-random distribution of unlinked records.
60 In order to assess the success of the linking, the following subpopulation of linked records was identified as those most likely to have submitted a tax return:
61 This subpopulation of linked records were assessed against secondary sources such as the Estimated Resident Population (ERP), the 2011 Census and the 2011 Australian Census and Migrants Integrated Dataset (ACMID) in order to estimate the match rate. The match rate was estimated to be almost 90%. For this reason, the linked dataset was not weighted. It was believed that with a match rate of 90%, the linked dataset was sufficient to portray the population of migrants who submitted a tax return in the financial year and weighting was not required.
62 See Research Paper: Feasibility Study of Linking Migrant Settlement Records to Personal Income Tax Data, Aug 2014 (cat. no. 1351.0.55.051) for more information.
63 Given the high quality of the variables used (name and address) for the deterministic and probabilistic linking and the strict acceptance conditions used, the expected number of false links in 2009-10 PITMID is not expected to be significant. However, the number of false links in the dataset has not been calculated or estimated.
64 If the linking process is conducted in the future, SDB records linked in previous years will not be reprocessed. They will remain linked to the original PIT record. Due to the proposed enduring nature of the links, it was required that strict acceptance conditions be imposed on the linkage process to minimise the potential number of false links.
65 Due to the strict acceptance conditions imposed during the linkage process, it is possible that some legitimate records could have been missed. However, given the high quality of the linking variables used, it is likely that a high proportion of the unlinked records are simply non-tax lodgers and not missed links. However, the number of missed links in the dataset has not been calculated or estimated though the match rate of 90% indicates that the number of missed links may be about 10%.
66 Error introduced by under or over representation of characteristic based groups in unlinked records has not been mitigated by a calibration process. Due to the high quality of the linking variables, it is likely that a high proportion of the unlinked records are simply persons who did not lodge a taxation return.
Comparability with other data
67 Estimates from the 2013 Migrants PIT Project will differ from the estimates produced from other ABS collections and estimates produced from the SDB for several reasons. The estimates are a result of integrating data from two administrative data sources. The linked records are uncalibrated as there were no known population totals to benchmark to, and the resulting dataset is unique from both the PIT data and the SDB. Due to the quality issues mentioned in the Reliability of Estimates section, estimates should generally be treated with caution. Further information about the data and the linking methodology used is available in the Research Paper: Feasibility Study of Linking Migrant Settlement Records to Personal Income Tax Data, Aug 2014 (cat. no. 1351.0.55.051). This research paper provides a summary of the Migrant PIT Feasibility Study.
Australian Census and Migrant Integrated Dataset (ACMID)
68 The Australian Census and Migrants Integrated Dataset (ACMID) was created by integrating records from the SDB who had been granted a permanent visa between 1 January 2000 and Census night (9 August 2011) with the 2011 Census of Population of Housing. Approximately 90% of the records on the PITMID files could also be present on the ACMID.
69 The ACMID contains information such as Occupation of main job and Personal weekly income. However, this information is limited to persons aged 15 years and over on Census night. The information on Occupation and income on the PITMID includes all persons who linked to a PIT record, regardless of age. The ACMID also includes all persons aged 15 years and over on Census night, while the PITMID may only include those who earned an income above the tax free threshold. Therefore, while similar statistics can be derived from both the ACMID and the PITMID, they are not directly comparable due to differing population scopes.
Other ABS publications utilising PIT data
70 Estimates from the PITMID are not comparable to the PIT data included in the regional statistics publications; Wage and Salary Earner Estimates for Small Areas (cat. no. 5673.0.55.003) and Estimates of Personal Income for Small Areas (cat. no. 6524.0.55.002). These publications include the PIT data for the whole population, not just migrants. Consequently, personal income data for permanent migrants cannot be identified in these publications' data.
71 The ABS acknowledges the continuing support provided by the Department of Immigration and Border Protection (DIBP), the Department of Social Services (DSS) and the Australian Taxation Office (ATO) for the Migrant PIT Data Integration Project. The provision of data as well as ongoing assistance provided by our stakeholders is essential to enable this important work to be undertaken. The enhancing of migrant related statistics through data linkage by the ABS would not be possible without their cooperation and support.
72 For further information about these and other statistics, please contact the National Information and Referral Service on 1300 135 070.
These documents will be presented in a new window.