Microdata: Person Level Integrated Data Asset (PLIDA)

PLIDA, formerly MADIP, is a longitudinal data asset combining social, health, education, income and taxation data for the Australian population.

Release date and time
29/06/2018 11:30am AEST

Overview

The Person Level Integrated Data Asset (PLIDA) is a secure data asset combining information on health, education, government payments, income and taxation, employment, and population demographics (including Census) over time.

The ABS is trusted as the accredited Integrating Authority for PLIDA. The project is enabled through a partnership of agencies, which includes:

  • Australian Bureau of Statistics
  • Australian Taxation Office
  • Department of Education
  • Department of Health, Disability and Ageing
  • Department of Social Services
  • Services Australia
  • Department of Home Affairs

Approved researchers can use de-identified PLIDA data to look at patterns and trends in the Australian population and provide new insights into the development and evaluation of government policies, programs, and services.

PLIDA Modular Product

All access to PLIDA data is via the PLIDA Modular Product (PMP), a detailed microdata product available to approved researchers in the ABS DataLab. Researchers can request access to the PMP by completing the ABS DataLab project proposal for detailed and integrated microdata. Access is subject to approval by PLIDA data custodians. 

Data Files and Structure

PLIDA is comprised of a collection of datasets that have been linked together using the Person Linkage Spine.

The PMP is a collection of discrete modules corresponding to different datasets (for example, modules relating to Personal Income Tax, the Medicare Benefits Schedule and Higher Education) from which users select according to their research needs.  Modules are requested and approved individually. Only the modules needed for an approved project are provided to the researchers. 

Some PMP modules can be linked with the Business Longitudinal Analysis Data Environment (BLADE) using certain person-to-business relationships, such as employee to employer.  

A complete list of PMP modules and data items can be accessed from the Data downloads section below. Most PMP modules are updated on a regular basis. The update schedule is determined by the data supply agreements between the ABS and data custodians and the scheduling of ABS data integration activities. 

PLIDA Core modules

The ABS has developed Core modules as part of the PMP to make PLIDA easier to use and reduce duplication of effort among PLIDA users. 

Core modules provide a streamlined way to access commonly used data items in PLIDA. They are pre-built datasets that consolidate common data items from multiple existing sources into a single, consistent set of derived data items. They simplify access to essential information and reduce the need for researchers to create these derived data items themselves. This approach supports: 

  • Consistency of key data items across research projects.
  • Faster insights by performing common data transformations centrally, researchers have more time to focus on their areas of expertise.
  • Data minimisation by reducing the need to request and access multiple source datasets to achieve similar outcomes. 

Current Core modules include: 

  • Core Demographics: Includes month and year of birth, sex or gender, and country of birth.
  • Core Locations: Provides geocoded location data for individuals.
  • Core Scoping: Helps define populations of interest for a given time period.
  • Core Relationships: Captures partner and parent-child relationships.
  • Core Income: Consolidates weekly and annual income information for individuals. 
What’s included in the Core modules? 

Each Core module contains one or more files with derived data items (for example, month and year of birth). These derived data items are created using a rules-based approach that consolidates information from multiple source datasets. 

To support flexibility, single source files are also included in Core modules. This allows researchers to apply their own derivation methods if preferred. 

Every Core module is accompanied by comprehensive explanatory material, including: 

  • Detailed descriptions of the derivation process.
  • Guidance on how to use the module effectively.
  • Information about known limitations and caveats about the data. 
Illustration of PLIDA core module structure: multiple source tables feed into a combined table, which is paired with individual source tables.

The image is a diagram in the form of an equation enclosed in a grey box. Inside the box, on the left of the equal sign there are the words “Core Module”. On the right of the equal sign there are the words “Combined table(s)”, a plus sign and the words “Single source tables”. The words “Source datasets in PLIDA” sit outside of the grey box.

Above the words “1. Combined table(s)” on the right of the equal sign, there are 4 colour-coded data tables representing PLIDA data. Three smaller tables labelled "Source datasets in PLIDA" sit outside the grey box and are made up of yellow, blue and red coloured columns. Corresponding yellow, blue and red arrows point downward from these source tables to a larger combined table inside the grey box. This combined table has multiple columns in different colours (yellow, blue and red) flowing directly from the original source dataset tables above, and a green column indicating where a yellow and blue column from the source datasets have been combined to create a new, green column in the combined table. This represents data from the 3 source datasets being merged to create a new combined table in Core Module.

To the right of the combined table is a plus sign, followed by a stack of three individual tables in the source dataset colours of yellow, blue and red. These are labelled "2. Single source tables" and represent the specific data items from the source datasets that were used to build the combined table.

Core modules are available as part of the PMP in the ABS DataLab environment. Researchers can request Core modules via myDATA as part of their PLIDA project proposal. 

Each DataLab project is granted access to the appropriate edition of Core modules based on the Census data included in the project. 

Privacy and Re-identification Risk Management 

Core modules are developed from existing PLIDA datasets that are already available to approved users in DataLab. The ABS applies the same rigorous privacy protections to Core modules as it does to all PLIDA datasets. This includes: 

  • Safe Data Risk Assessments for each dataset.
  • Review by the ABS Disclosure Review Committee, where appropriate.
  • Strict adherence to the Five Safes Framework and the Separation Principle. 

The 2024–25 PLIDA Privacy Impact Assessment update confirmed that Core modules do not increase re-identification risk when used within the existing PLIDA governance framework. 

Applying for access

Researchers affiliated with Australian Government or academic research organisations can apply to use the PLIDA Modular Product in the DataLab for in-depth analysis using a range of statistical software packages.

Information about the DataLab can be found on the About DataLab page.

To find out how to apply for access to the PLIDA Modular Product in the DataLab, email info@mydata.abs.gov.au.

Further information to assist users in understanding and accessing microdata is available from the Microdata Entry Page.

Support

A User Guide for the PLIDA Modular Product is provided for approved researchers in the DataLab.

For additional information about and support for using the PLIDA Modular Product, or for technical support using the Datalab, email info@mydata.abs.gov.au.

Further information

The PLIDA/MADIP Privacy Impact Assessments page provides further information about the independently conducted assessment of PLIDA including compliance with the Australian Privacy Principles and targeted stakeholder consultation.

Further information about ABS statistical data integration is available on the ABS Data Integration page.

Data downloads

PLIDA: Data Item List

Previous catalogue number

This release previously used catalogue number 1700.0.

Back to top of the page