1900.0 - Data Integration Project Register, Australia  
Latest ISSUE Released at 11:30 AM (CANBERRA TIME) 04/03/2019  First Issue
   Page tools: Print Print Page Print all pages in this productPrint All

This document was added or updated on 16/08/2021.

Projects:
The Multi-Agency Data Integration Project (MADIP)
Linked Employer-Employee Database (LEED)
Australian Census Longitudinal Dataset (ACLD)
Business Longitudinal Analysis Data Environment (BLADE)



THE MULTI-AGENCY DATA INTEGRATION PROJECT

Commencement date: 01/06/2015
Project status: Ongoing

Project Summary

The Multi-Agency Data Integration Project (MADIP) is a secure data asset combining information on healthcare, education, government payments, income and taxation, employment, and population demographics (including the Census) over time.

Underpinning the MADIP data asset is a ‘spine’ which is currently based on a reference period of June 2019. As such persons new to the population since June 2019 will not be present in the asset. The Spine and the MADIP data asset are updated approximately annually.

Benefits

MADIP shows how combining existing public data can be used to:
    • Inform policy development and better targeting of services, such as health and early childhood services, to people and communities who need them, and
    • Provide insight into the effectiveness of government programs to ensure they are delivering value to the Australian public.

Enduring Linkages to MADIP
Only data that are reasonably necessary for an approved purpose are shared and used in MADIP. This table describes the range of datasets that now have an enduring link to the MADIP data asset, which allows them to be updated for ongoing research.

DatasetDescription

Australian Bureau of Statistics
Census of Population and Housing 2011
Census of Population and Housing 2016
Census data provides a rich snapshot of the nation and informs government, community and businesses. Current ABS policy restricts the linking of data from more than one Census. This means data from multiple Censuses cannot be brought together for longitudinal MADIP analysis. Exceptions are the Australian Census Longitudinal Dataset which brings together a 5% sample of Census data, and in future, the Census Time Capsule, available from 2100. The ABS has committed to undertaking public consultation on any proposed changes to this policy.
National Health Survey (2014-15, 2017-18)Information on Australian’s health and wellbeing such as medical conditions, health and lifestyle risk factors, mental health and use of health services.
Survey of Disability, Ageing, and Carers (2018)Information about people with a disability, older people (aged 65 and over) and their carers.
ABS Business Characteristics Survey (2005-06 to 2017-18) 4An annual survey providing estimates in business use of information technology, innovation, and a broad range of other non-financial business characteristics.
BLADE Core Dataset
  • Indicative data items (2001-02 to 2018-19)
  • BLADE Locations data (2018-19)
  • Business Activity Statement (2001-02 to 2018-19)
  • Business Income Tax (2001-02 to 2017-18)
  • Pay As You Go (2001-02 to 2018-19)
An economic data tool combining tax, trade, and intellectual property information with ABS data to provide a better understanding of the Australian economy and business performance over time. A limited set of BLADE information is available in MADIP. For the full list of data in BLADE see the asset summary below.

Australian Taxation Office
Personal Income Tax data:
  • Client Register (2006- June 2020)
  • Payment Summary (2010-11 to 2019-20)
  • Income Tax Return (2010-11 to 2018-19)
Information about taxpayers’ occupation and income, employment payments and amounts withheld during a financial year, and all persons with a registered tax file number (TFN) for tax and superannuation purposes.
Single Touch Payroll (2020-2021)Information about employees' salaries and wages, pay as you go (PAYG) withholding and superannuation as reported through the Single Touch Payroll system.
JobKeeper (2020-2021)Information about JobKeeper subsidy payments to businesses affected by coronavirus (COVID-19).

Department of Health
Pharmaceutical Benefits Scheme (2006-June 2020)Information about the use of prescription medications & services subsidised under the PBS.
Medicare Benefits Schedule (2006-June 2020)Information on the usage of Medicare subsidised health care services.
Centralised Register of Medical Practitioners (Provider Directory) (2011-2016)

Australian Immunisation Register (AIR) (2020-2021)
Information about registered medical practitioners, including specialties.

Information about COVID-19 and other vaccination status, including core demographics.

Services Australia
Medicare Consumer Directory (2006-June 2020) 1All persons with an active Medicare enrolment.

Department of Social Services
DOMINO Centrelink Administrative data (2009-June 2020) 2Snapshots of characteristics of recipients of government payments such as Age Pension and Newstart Allowance.
Data Exchange (July 2015 - June 2018)Program performance information that contains de-identified data on clients that receive social services including their demographics and services being delivered.

Department of Education, Skills and Employment
Australian Early Development Census (AEDC) (2009, 2012, 2015, 2018)Measures how well children in their first year of full time school are developing across 5 important domains.
Australian Apprenticeships Incentives Program (2006-2019)Information on apprenticeships and traineeships, AAIP payments, and Trade Support Loan (TSL) payments.
Higher Education Information Management System Data (HEIMS) (2005-2019)Information on higher education studies including course type, student status and study load.

National Disability Insurance Agency
National Disability Insurance Scheme (NDIS) data (June 2019)Information on primary disability, including services attained, by participants in the National Disability Insurance Scheme.

State and Territory Registrars of Births, Deaths, and Marriages
Death Registrations (2005-2019)Information relating to registered death records from Australian States and Territories.

Department of Home Affairs
Migration data:
  • Visa information & Citizenship grants (2000-2018)
  • Client information (1984-2018)
  • NOM Traveller data (2004-2018)
  • Skilled Migration Points (2005-2019)
  • Adult Migrant English Program data (2003-2019)
Information about various migrant types, including permanent, skilled, temporary and other migrant programs, including their demographics and movement over time.




1. Previously known as the Medicare Enrolments Database
2. Previously known as Social Security and Related Information
3. The ABS has enabled the integration of some BLADE datasets with information about employer characteristics to employee data in MADIP. BLADE Core data is available for request. Use of Business Income Tax data is limited to the creation of flags indicating types of businesses or for the creation of new categorical data items only.
4. Business Characteristics Survey data is available for request. Use of the data is limited to the creation of flags indicating types of businesses or for the creation of new categorical data items only.

Note: Other sources of data have been explored for inclusion in MADIP or for once-off linkage to MADIP data for specific projects. For a list of 2019 linkages to MADIP, please refer to MADIP data and legislation. For specific projects, please refer to the relevant categories in this publication.

Publications/Results

15/01/2020 - Microdata: Multi-Agency Data Integration Project Modular Product (cat. no. 1700.0)
The MADIP Modular Product, 2011-2016 relates to the resident Australian population between 1 January 2011 and 31 December 2016. It includes key demographic, social, healthcare, education, government payment and income information for this population over the period 2011-2016 and is organised into modules.

2019 - ‘Introducing the longitudinal MADIP and its role in understanding income dynamics in Australia’ (Biddle, N, Breunig, R, Markham, F & Wokker, C)

25/03/2019 - Microdata: Multi-Agency Data Integration Project, Australia (cat. no. 1700.0)
The MADIP Basic Longitudinal Extract, 2011-2016 (2011-2016 Cohorts) relates to the resident Australian population between 1 January 2011 and 31 December 2016. This microdata product includes key demographic, social, healthcare, education, government payment, and income information for this population over the period 2011-2016.

29/06/2018 - Microdata: Multi-Agency Data Integration Project, Australia (cat. no. 1700.0)
The MADIP Basic Longitudinal Extract, 2011-2016 (2011 Cohort) microdata product relates to the Australian population in 2011 and includes key demographic, social, healthcare, government payment, and income information for this population over the period 2011-2016.

12/12/2017 - MADIP Case Studies
This webpage presents four case studies highlighting the types of insights that can be gained by analysing MADIP data.

Privacy Impact Assessments

29/11/2019 – Update to 2018 Privacy Impact Assessment – 2019 MADIP PIA Update

04/04/2018 - Independent Privacy Impact Assessment – Multi-Agency Data Integration Project (MADIP)

28/08/2018 - Independent Privacy Impact Assessment - Linkage of National Health Survey 2014/15 with the Multi-Agency Data Integration Project (MADIP)




BUSINESS LONGITUDINAL ANALYSIS DATA ENVIRONMENT (BLADE)

Commencement date: 10/10/2013
Project status: Ongoing

Project Summary

The Business Longitudinal Analysis Data Environment (BLADE) combines tax data about businesses with information from ABS surveys and other administrative data sources to enable a better understanding of the Australian economy through statistical analysis.

Benefits

Authorised researchers can use BLADE to study how businesses fare over time and the factors that drive performance, innovation, job creation, competitiveness and productivity.


Data Custodian Dataset(s)

Australian Bureau of StatisticsBusiness Register extract (2001-02 to 2018-19)
Business Characteristics data (2005-06 to 2017-18)
Business Characteristics Survey: Management Capabilities Module variables (2015-16)
ABS Survey data: Business Expenditure on Research and Development (financial years 2004-05 to 2011-12, 2013-14, 2015-16, 2017-18)
ABS Survey data: Private Non-Profit Expenditure on Research and Development (financial years 2008-09, 2011-12, 2012-13, 2014-15, 2016-17)
ABS Survey of Research and Experimental Development, Government (financial years 2008-09, 2011-12, 2012-13, 2014-15, 2016-17, 2017-18)
ABS Survey Data: Economic Activity Survey (2006-07 to 2016-17)
Energy Water and Environment Survey (2008-09, 2011-12, 2017-18, 2018-19)

Australian Taxation OfficeBusiness Activity Statement (2001-02 to 2018-19)
Business Income Tax (2001-02 to 2017-18)
Pay As You Go (2001-02 to 2018-19)

Intellectual Property AustraliaIntellectual Property Longitudinal Research Data (2001-02 to 2017-18)

Department of Home AffairsMerchandise Exports data (2003-04 to 2018-19)
Merchandise Imports data (2003-04 to 2018-19)


Publications/Results

30/04/20 - Microdata: Business Longitudinal Analysis Data Environment, Businesses in Australia 2018-19 (cat. no. 8178.0)
TableBuilder product that enables counts of businesses by various characteristics that were active in the Australian economy in the 2018-19 financial year.

05/04/2019 - Research Paper: Approaches to Analysing Micro-Drivers of Aggregate Productivity, March 2019 (cat. no. 1351.0.55.164)
This research used BLADE data to explore methods of estimating firm contributions to aggregate productivity.

16/06/2016 - Data Integration Case Study - BLADE

10/07/2015 - Information Paper: Construction of the Expanded Analytical Business Longitudinal Database, 2001-02 to 2012-13 (cat. no. 8171.0)
This paper provides a summary of the methodology used to create the EABLD (now known as BLADE), the initial set of data included and a description of potential uses.




LINKED EMPLOYER-EMPLOYEE DATABASE (LEED)

Commencement date: 12/11/2013
Project status: Ongoing

Project Summary

The Linked Employer-Employee Dataset (LEED) combines employer information and employee information into an enduring cross-sectional linked dataset. LEED capitalises on data already combined through the Business Longitudinal Analysis Data Environment (BLADE).

LEED can provide unique insights into labour market issues from both a supply and demand perspective and is used to inform labour productivity, firm profitability, job creation, and the effects of policies and business practices on both employees and employers.

This project builds on the outcomes of the LEED prototype (2013) and subsequent feasibility study in 2015 which demonstrated the value of the dataset. In 2018, the ABS developed the current LEED as an enduring data asset.

Benefits

The LEED enables more detailed analysis of the Australian labour market than was previously possible - across the three lenses of jobs, persons and businesses.
    • Jobs - provides information on the jobs held in Australia each year (more than 19 million in 2016-17), including new information about the composition of jobs and about jobs in small geographic areas.
    • Persons - supports detailed statistical analysis on an annual basis, as well as providing new insights on employment (such as multiple job-holding).
    • Businesses - provides further insights into topics such as job creation and destruction as industries change over time, and elements that assist in exploring the drivers of firm-level performance.

Due to the rich detail of the underlying data, the LEED enables analysis at small geographic areas, such as 2,288 Statistical Areas across Australia, as well as at Local Government Areas and Electoral Divisions. In the 2019 publication of Jobs in Australia, the ABS released Spotlight articles for each of Australia's Local Government Areas, which highlight key statistics about jobs.

The LEED underpins both the Jobs in Australia and Personal Income in Australia publications. Outputs from the LEED are an essential input into the quarterly labour accounts - a core ABS offering.


Data CustodianDataset(s)

Australian Bureau of StatisticsABS Business Register (2011-12 to 2016-17)

Australia Taxation OfficePersonal Income Tax (2011-12 to 2016-17)
Business Activity Statement (2011-12 to 2016-17)
Business Income Tax (2001-02 to 2016-17)
Pay As You Go (2001-02 to 2016-17)


Publications/Results

17/12/2019 - Personal Income in Australia, 2011-12 to 2016-17 (cat. no. 6624.0.55.002)
Formerly Estimates of Personal Income for Small Areas, this release provides a comprehensive range of income indicators across small geographic areas. This publication is now based on the LEED, ensuring better consistency with Jobs in Australia.

27/09/2019 - Microdata: Jobs in Australia, 2011-12 to 2016-17 (cat. no. 6160.0.00.001)
Release of Jobs in Australia data through TableBuilder. This enables users to build their own customised tables from the Linked Employer-Employee Dataset microdata, including for State and Commonwealth Electoral Divisions.

01/08/2019 - Jobs in Australia, 2011-12 to 2016-17 (cat. no. 6160.0)
Jobs in Australia (JIA) provides aggregate statistics from the recently developed Linked Employer-Employee Dataset. It provides information about filled jobs in Australia, the people who hold them, and their employers. JIA provides data across 2,288 Statistical Areas as well as Local Government Areas.

04/04/2019 - Microdata: Jobs in Australia, 2011-12 to 2015-16 (cat. no. 6160.0.00.001)
Release of Jobs in Australia data through TableBuilder. This enables users to build their own customised tables from the Linked Employer-Employee Dataset microdata, including for State and Commonwealth Electoral Divisions. Update for 2016-17 data forthcoming.

18/01/2016 - Microdata: Employee Earnings and Jobs, Australia, 2011-12 (cat. no. 6311.0.55.001)
Employee Earnings and Jobs (EEJ) microdata is a 10% weighted sample of the linked employer-employee data that the ABS developed through the LEED Foundation projects and is available through the Data Lab.

11/12/2015 - Information Paper: Construction of Experimental Statistics on Employee Earnings and Jobs from Administrative Data, Australia, 2011-12 (cat. no. 6311.0)
This paper provides the background to the LEED Foundation projects, a description of the data sources, integration methodology and confidentiality process, summary of results, and limitations of the experimental statistics.

14/05/2015 - Research Paper: Use of a Prototype Linked Employer-Employee Database to Describe Characteristics of Productive Firms, May 2015 (cat. no. 1351.0.55.055)
This study uses a prototype linked employer-employee database (LEED) to analyse both employee and firm characteristics to identify factors that explain differences in labour productivity across firms and industries.



AUSTRALIAN CENSUS LONGITUDINAL DATASET (ACLD)

Commencement date: 11/10/2013
Project status: Ongoing

Project Summary

The Australian Census Longitudinal Dataset (ACLD) combines a 5% sample of data from the 2006, 2011, and 2016 Censuses.

Benefits

Viewing the population over time highlights the effect of policies longer term. This allows for the study of the changing patterns in social and economic conditions at the individual level, gives insight into the pathways that tend to lead to particular outcomes, and how these pathways vary for different population groups.

The ACLD has been used to investigate such questions as:
    • Were Australians who were unemployed in 2011 and had moved regions by 2016 more likely to be employed than those who had remained in the same area?
    • What are the characteristics of persons transitioning into aged care facilities? How do they compare to those that don’t go into aged care facilities?
    • What are recent school leavers up to?
    • How is family structure changing over time? What are the characteristics of persons who are transitioning into single parenthood?


Data CustodianDataset(s)

Australian Bureau of Statistics2016 Census of Population and Housing
2011 Census of Population and Housing
2006 Census of Population and Housing


Publications/Results

18/12/2019 - Australians' journeys through life: Stories from the Australian Census Longitudinal Dataset (cat. no. 2081.0)
This analytical release highlights various journeys through life for Australians. Areas of analysis include employment, education and housing transitions. The most recent release explores the characteristics and transitions from 2006 to 2016 for Australians approaching retirement age.

20/03/2019 - Microdata: Australian Census Longitudinal Dataset, ACLD (cat. no. 2080.0)
This release provides information about the ACLD microdata available in TableBuilder and DataLab. It includes the original 2006-11 dataset published in 2013. The 2011-16 dataset was most recently updated in March 2019 and the three-wave (2006-11-16) data was also released in March 2019.

20/03/2019 - Information Paper: Australian Census Longitudinal Dataset, Methodology and Quality Assessment, 2006-2016 (cat. no. 2080.5)
This publication provides in-depth information about the sampling and linking methodologies, and linkage results, It includes information about the original 2006-11 dataset published in 2013, the 2011-16 dataset most recently updated in March 2019, and the three-wave (2006-11-16) data released in March 2019.

26/02/2019 - Australians' journeys through life: Stories from the Australian Census Longitudinal Dataset (cat. no. 2081.0)
This analytical release highlights various journeys through life for Australians. Areas of analysis include employment, education and housing transitions. The current release focuses on a series of interactive maps which enable users to choose a particular region and explore, in depth, the employment status of people who moved out of that region and of those who stayed.

28/11/2014 - Census Data Enhancement project: An update, 2011 (cat. no. 2062.0)
This publication summarises the outcomes of the 2011 Census Data Enhancement (CDE) project, including the creation of the ACLD.


Back to top of the page