4324.0.55.004 - Microdata: Smoker Status, Australia, 2017-18 Quality Declaration 
Latest ISSUE Released at 11:30 AM (CANBERRA TIME) 04/02/2020  First Issue
   Page tools: Print Print Page Print all pages in this productPrint All

Using DataLab

The DataLab environment allows real time access to detailed microdata files from the 2017–18 Smoker Status pooled dataset through a portal to a secure ABS environment. The level of detail provided within the DataLab exceeds that previously provided in Expanded CURFs and provides researchers greater access to the data.

The DataLab is an interactive data analysis solution available for users to run advanced statistical analyses, for example, multiple regression and structural equation modelling. The DataLab environment contains up-to-date versions of SPSS, Stata, SAS and R analytical languages. Controls in the DataLab have been put in place to protect the identification of individuals and organisations. All output from DataLab sessions is cleared by an ABS officer before it is released.

For more information about the DataLab, please see the About the DataLab page.

DataLab test file

Two test files have been created to enable researchers and analysts to become more familiar with the data structure and prepare code and programs before applying for or beginning a DataLab session.

The test files do not contain real data, and cannot be used for analysis. They mimic the structure of the detailed microdata in that they have the same data items and allowed values.

A Person level and Household level test file are available as free downloads from the Downloads section.

Counts and weights

The following table shows the number of records on each level and the weighted counts, where applicable. The data includes persons aged 15 years and over.

Record counts
Weighted counts
(if applicable)

1. Household
2. Person

There are two weight variables on the detailed microdata files:
  • Household weight (NHIHHWT) - benchmarked to produce household estimates.
  • Person weight (NHIFINWT) - benchmarked to produce the total population aged 15 years and over.


Every record on each level of the file is uniquely identified. The identifiers ABSHIDD and ABSPID appear on both levels of the file.

Each household has a unique thirteen digit random identifier, ABSHIDD. This identifier appears on the household level and is repeated on each level on each record pertaining to that household. The combination of identifiers uniquely identifies a record at a particular level as shown below:

1. Household = ABSHIDD
2. Selected person = ABSHIDD + ABSPID

The household record identifier, ABSHIDD, assists with linking people from the same household, and also with household characteristics such as geography (located on the household level) to the person records.

Back to top of the page