Microdata: Multi-Agency Data Integration Project, MADIP

The MADIP Modular Product is a longitudinal data asset combining social, health, education, income and taxation data for the Australian population

Overview

The Multi-Agency Data Integration Project (MADIP) is a partnership among seven Australian Government agencies to combine longitudinal information on healthcare, education, government payments and personal income tax with population demographics (including the Census) to create a comprehensive picture of Australia over time.

More information about the project can be found on the Multi-Agency Data Integration Project page.

Approved researchers are able to use deidentified MADIP data in the secure ABS DataLab , to look at patterns and trends in the Australian population and provide new insights into the development and evaluation of government policies, programs and services.

Available products

The MADIP Modular Product, 2011-2019 is a longitudinal data asset combining social, health, education, income and taxation data for the Australian population. This product can be used to study how socioeconomic characteristics predict government service usage and changes over time. This product also allows for analysis of changes in social, health and economic outcomes for sub-populations such as Aboriginal and Torres Strait Islander peoples, young people, older Australians, welfare recipients, and regional communities.

Applying for access

Researchers affiliated with Australian Government or academic research organisations can apply to use the MADIP Modular Product in DataLab for in-depth analysis using a range of statistical software packages.

Information about the DataLab can be found on the About DataLab page.

To find out how to apply for access to the MADIP Modular Product in the DataLab, contact data.services@abs.gov.au.

Further information to assist users in understanding and accessing microdata is available from the Microdata Entry Page.

Support

A User Guide for the MADIP Modular Product is provided for approved researchers in the DataLab.

For additional information about and support for using the MADIP Modular Product, contact data.services@abs.gov.au.

For technical support in using the DataLab, contact microdata.access@abs.gov.au.

Further information

The MADIP Privacy Impact Assessment page provides further information about the independently conducted assessment of MADIP’s compliance with the Australian Privacy Principles and targeted stakeholder consultation.

Further information about ABS statistical data integration is available on the ABS Data Integration page.

MADIP Modular Product, 2011-2019

The Multi-Agency Data Integration Project (MADIP) Modular Product, 2011-2019 is the third MADIP longitudinal microdata product made available in the DataLab for approved researchers. The MADIP Modular Product has replaced the previously available MADIP Basic Longitudinal Extracts. 

An overview of the MADIP Modular Product can be found in the Data File and Structure section.

Data File and Structure

The MADIP Modular Product (2011-2019)  provides MADIP data as a set of separate files or ‘modules’ (e.g. a Personal Income Tax module, a Medicare Benefits Schedule module, a Geography module) from which users select based on their research need. The modules contain relevant person level information from each source. Only the modules needed for an approved project are provided to the researcher.

Modules in the MADIP Modular Product include:

  • Core Module - Demographic information derived from Census, Death Registrations, DOMINO Centrelink Administrative Data, Medicare Consumer Directory, and Personal Income Tax data
  • Geography Module - Geographic information derived from Census, DOMINO Centrelink Administrative Data, Medicare Consumer Directory, and Personal Income Tax data
  • Medicare Benefits Schedule (MBS), 2011 to June 2020 data;
  • Pharmaceutical Benefits Scheme (PBS), 2011 to June 2020 data;
  • DOMINO Centrelink Administrative Data (DOMINO), 2011 to June 2020;
  • Personal Income Tax (PIT), 2010-11 to 2017-18 data;
  • Apprentice and Trainee (AT), 2011-2019 data; 
  • Census of Population and Housing, 2011 (Census 2011) data;
  • Census of Population and Housing, 2016 (Census 2016) data; 
  • Higher Education, 2011-2019 data;
  • National Health Survey (NHS 2014-15 and NHS 2017-18) data, and
  • Survey of Disability, Aging, and Carers 2018 (SDAC 2018) data.

A complete list of data items for the DataLab can be accessed from the Data downloads section. These contain details for each data item including the full output classification and any special codes used.

Test files containing synthetically generated randomised data have been created based on the Modular Product and are available from the Data downloads section. These test files allow researchers to become familiar with a modules structure and prepare code prior to accessing the product through the DataLab.

Users intending to apply for access to the MADIP Modular Product should review the data item list and test files to check that the level of detail in the MADIP Modular Product meets their project's data requirements. Customised MADIP data products can be considered when more detailed data is required. The ABS is also an Accredited Integrating Authority, and additional data sources can be integrated with MADIP for approved projects and subject to agreement from data custodians. To discuss your data needs or enquire about data integration contact data.services@abs.gov.au.

Data downloads

Data files

Quality declaration

Institutional environment

General information about the institutional environment of the ABS; including the legislative obligations, financing and governance arrangements, and mechanisms for scrutiny of operations, can be found on the ABS Institutional Environment page.

Information about the ABS and its role as an Accredited Integrating Authority under the Commonwealth Data Integration Interim Arrangements can be found on the ABS Integrating Authority Accreditation page.

Further information about MADIP can be found on the Multi-Agency Data Integration Project.

The MADIP Modular Product is released in the secure ABS DataLab, and in accordance with the conditions specified in the Census and Statistics (Information Access and Release) Determination 2018 made under the Census and Statistics Act 1905. This ensures that confidentiality is maintained whilst enabling micro level data to be released.

Relevance

MADIP is a partnership among seven Australian Government agencies to combine longitudinal information on healthcare, education, government payments and personal income tax with population demographics to create a comprehensive social picture of Australia over time.

The MADIP Modular Product can be used to study how socioeconomic characteristics predict government service usage and changes over time. These datasets also allow for analysis of changes in social, health and economic outcomes for sub-populations, such as Aboriginal and Torres Strait Islander peoples, young people, older Australians welfare recipients, and regional communities.

Timeliness

The MADIP Modular Product 2011-2019, released in May 2021, relates to the resident Australian population between 1 January 2011 and 31 December 2019, with selected datasets to 30 June 2020. The specific time periods to which the different source of information in the product are defined in the Data File and Structure section.

The MADIP Modular Product is currently updated annually. 

Accuracy

All reasonable attempts have been made to ensure the accuracy of the MADIP Modular Product. However, users should consider the following limitations when interpreting analytical results from this product:

  • Differences in the scope of the source datasets - each administrative dataset has a different eligible population;
  • Differences in the purpose of collection - administrative source datasets contain data collected by Australian Government agencies for administrative purposes; only Census and survey data were collected for statistical purposes;
  • Differences in the way similar concepts are measured - for example, income information collected through tax returns may be defined differently from income collected in the Census, and the questions, form types, and timing of data collection are different.
  • The MADIP Modular Product is underpinned by data linkage methods using the Person Linkage Spine.
  • No additional editing, cleaning or imputation was conducted on each dataset beyond what was conducted on the source datasets by the data custodian agencies.

Additional information can be found in the User Guides for the MADIP Modular Product which is provided for approved researchers in the DataLab.

Coherence

Estimates derived from the MADIP Modular Product may differ to those derived from MADIP source datasets, or other similar sources. This is due to factors including:

  • The data linkage using the Person Linkage Spine that underpins the MADIP Modular Product.
  • Variability in population scope and reference periods between source datasets.
  • Differences in the purposes of the source datasets and the way similar concepts are measured.

Interpretability

A User Guide for the MADIP Modular Product is provided for approved researchers in the DataLab. For more information about and support for using the MADIP Modular Product contact data.services@abs.gov.au.

Accessibility

Researchers affiliated with Australian Government or academic research organisations can apply for access to use the MADIP Modular Product in DataLab for in-depth analysis using a range of statistical software packages.

Information about the DataLab can be found on the About DataLab page. To find out how to apply for access to the MADIP Modular Product in the DataLab, visit the Microdata Entry Page or contact data.services@abs.gov.au.

Previous catalogue number

This release previously used catalogue number 1700.0.