Australian Bureau of Statistics

Rate the ABS website
ABS Home > Statistics > By Catalogue Number
2901.0 - Census Dictionary, 2011  
Latest ISSUE Released at 11:30 AM (CANBERRA TIME) 23/05/2011   
   Page tools: Print Print Page RSS Feed RSS Bookmark and Share Search this Product  
2011 Census Dictionary >> Glossary >> Data processing


Data processing

Completed Census forms are delivered to the Data Processing Centre (DPC) as soon as possible after Census Night. They are then put through a number of processes.

The first processing stage in the 2011 Census is precapture. During this process forms are checked to ensure key fields have been completed and extraneous material removed. The forms are then prepared for data capture.

The data capture stage is the second stage of input processing. This stage encompasses a number of processes, including:
    • Scanning, which captures an image of each page of each form;
    • Intelligent Character Recognition (ICR), which converts any mark box or hand-written responses found on an image into computer processable information;
    • Repair, which is a mixture of automatic and clerical processes aimed at correcting any data not confidently captured by ICR; and
    • Data Load, where captured data is stored, ready for the coding processes.
The third stage includes reconciling all dwellings and all persons within these dwellings, as well as some basic coding , and ensures that the final counts produced are within established benchmarks.

The fourth stage includes Automatic Coding and Computer Assisted Coding (CAC). All hand-written textual responses are examined automatically to see if a classification code can be allocated based on the response provided. Where a classification code cannot be allocated automatically, CAC is used to allocate the classification code.

Quality control checks are constantly made for coding accuracy. The editing process is largely automatic, with some fields being reset based on other responses on the individual Census forms. All Census data are extensively validated before data are released.

See also Data quality, Data release, Derivations and imputations, Intelligent Character Recognition (ICR).




Previous PageNext Page

Bookmark and Share. Opens in a new window


Commonwealth of Australia 2014

Unless otherwise noted, content on this website is licensed under a Creative Commons Attribution 2.5 Australia Licence together with any terms, conditions and exclusions as set out in the website Copyright notice. For permission to do anything beyond the scope of this licence and copyright terms contact us.