4602.0 - Environmental Issues: People's Views and Practices, Mar 2006  
ARCHIVED ISSUE Released at 11:30 AM (CANBERRA TIME) 21/11/2006   
   Page tools: Print Print Page Print all pages in this productPrint All



TECHNICAL NOTE DATA QUALITY


INTRODUCTION

1 The estimation process for this survey ensures that estimates of persons and households calibrate exactly to independently produced population totals at broad levels. The known population totals, commonly referred to as 'benchmarks', are produced according to the scope of the survey. The same is true for estimates of persons and households produced in this survey. However, in these cases the person and household benchmarks are actually estimates themselves and not strictly known population totals.


2 Since this survey was last conducted, the process for producing person and household benchmarks has been refined. Whilst this process is still under review, it represents a significant improvement to the previous method and person and household benchmarks produced using the new method are considered sufficient quality for use in household survey estimation. In addition, measures of the variability in person and household benchmarks have been incorporated into household estimates for the first time. These changes may result in unexpected movements in total households (at some broad levels) due to revised benchmark methodology.


3 A paper entitled A Revised Method for Estimating the Number of Households in Australia (cat. no. 3107.055.007), describing these issues in detail is currently being developed and is due for release in early 2007.



RELIABILITY OF ESTIMATES

4 Since the estimates in this publication are based on information obtained from occupants of a sample of dwellings, they are subject to sampling variability. That is, they may differ from those estimates that would have been produced if all dwellings had been included in the survey. One measure of the likely difference is given by the standard error (SE), which indicates the extent to which an estimate might have varied by chance because only a sample of dwellings was included. There are about 2 chances in 3 (67%) that a sample estimate will differ by less than one SE from the number that would have been obtained if all dwellings had been included, and about 19 chances in 20 (95%) that the difference will be less than two SEs. Another measure of the likely difference is the relative standard error (RSE), which is obtained by expressing the SE as a percentage of the estimate.


5 Due to space limitations, it is impractical to print the SE of each estimate in the publication. Instead, a table of SEs is provided to enable readers to determine the SE for an estimate from the size of that estimate (see table T1). The SE table is derived from a mathematical model, referred to as the 'SE model', which is created using the data collected in this survey. It should be noted that the SE model only gives an approximate value for the SE for any particular estimate, since there is some minor variation between SEs for different estimates of the same size.


6 This publication contains estimates for persons and households. Table T1 gives SEs for estimates of households, while SEs for estimates of persons are presented in T2. Tables containing estimates of households are found in Chapters 2 and 3, while Chapter 4 contains estimates of persons.



CALCULATION OF STANDARD ERROR

7 An example of the calculation and the use of SEs in relation to estimates of persons is as follows. Table 4.1 shows that the estimated number of persons aged 18 years and over in New South Wales who do not travel to work or study was 211,800. Since this estimate is between 200,000 and 300,000, table T2 shows that the SE for New South Wales will lie between 19,150 and 22,600 and can be approximated by interpolation using the following general formula:


Equation: RSE computation 06


8 Therefore, there are about 2 chances in 3 that the value that would have been produced if all persons in New South Wales had been included in the survey will fall within the range 192,200 to 231,400 and about 19 chances in 20 that the value will fall within the range 172,600 to 251,000. This example is illustrated in the diagram below.

Diagram: Calculation of Standard Error


9 Similarly, SEs are calculated for household level estimates using table T1 instead of table T2. For example, table 2.1 shows that the estimated number of households in Australia that neither recycle nor reuse any waste items included in the survey was 64,400. This estimate is between 50,000 and 100,000, so the SE for this estimate will be between 5,650 and 8,100. This can be approximated using the same interpolation formula as above, with the resulting SE being 6,400 (rounded to the nearest 100).

Diagram: Calculation of Standard Error


10 Therefore, there are about 2 chances in 3 that the value that would have been produced if all households in Australia had been included in the survey will fall within the range 58,000 and 70,800. And about 19 chances in 20 that the value will fall within 51,600 and 77,200.


11 In general, the size of the SE increases as the size of the estimate increases. Conversely, the RSE decreases as the size of the estimate increases. Very small estimates are thus subject to such high RSEs so that their value for most practical purposes is unreliable. In the tables in this publication, only estimates with RSEs of less than 25% are considered reliable for most purposes. Estimates with RSEs of 25% and greater are preceded by an asterisk (e.g. *2.1) to indicate they are subject to high SEs and should be used with caution.



PROPORTIONS AND PERCENTAGES

12 Proportions and percentages formed from the ratio of two estimates are also subject to sampling errors. The size of the error depends on the accuracy of both the numerator and the denominator. A formula to approximate the RSE of a proportion is given below. This formula is only valid when x is a subset of y.


Equation: RSE formula 06


13 For example, in table 4.1, the estimate for the total number of persons aged 18 years and over in New South Wales is 3,193,000. The estimated number of persons aged 18 years and over who do not travel to work or study was 211,800, so the proportion of persons aged 18 years and over in New South Wales who does not travel to work or study is 211,800/3,193,000 or 6.6%. The SE of the total number of persons aged 18 years and over in New South Wales may be calculated by interpolation as 46,815 or 46,800 rounded to the nearest 100. To convert this to a RSE we express the SE as a percentage of the estimate, or 46,800/3,193,000 = 1.5%. The SE for the number of persons aged 18 years and over in New South Wales who do not travel to work or study was calculated above as 19,600, which converted to a RSE is 19,600/ 211,800 = 9.3%. Applying the above formula, the RSE of the proportion is Equation: RSE percent calculation 06giving an SE for the proportion (6.6%) of 0.6 percentage point (i.e. 6.6 x .092).


14 Therefore, there are about 2 chances in 3 that the proportion of persons aged 18 years and over in New South Wales who do not travel to work or study is between 6.0% and 7.2% and 19 chances in 20 that the proportion is within the range 5.4% to 7.8%.



DIFFERENCES

15 Published estimates may also be used to calculate the difference between two survey estimates (of numbers or percentages). Such an estimate is subject to sampling error. The sampling error of the difference between two estimates depends on their SEs and the relationship (correlation) between them. An approximate SE of the difference between two estimates (x-y) may be calculated by the following formula:


Equation: SE formula 06


16 While this formula will only be exact for differences between separate and uncorrelated characteristics or sub populations, it is expected to provide a good approximation for all differences likely to be of interest in this publication.



NON-SAMPLING ERROR

17 The imprecision due to sampling variability, which is measured by the SE, should not be confused with inaccuracies that may occur because of imperfect reporting by respondents, errors made in collection such as in recording and coding data, and errors made in processing the data. Inaccuracies of this kind are referred to as non-sampling error, and they may occur in any enumeration, whether it be a full count or a sample. It is not possible to quantify non-sampling error, but every effort is made to reduce it to a minimum. This is done by careful design of questionnaires, intensive training and supervision of interviewers, and efficient operating procedures.

T1 Standard errors for household level estimates

NSW
Vic.
Qld.
SA
WA
Tas.
NT
ACT
Aust.
Size of estimate
no.
no.
no.
no.
no.
no.
no.
no.
no.

100
100
80
130
70
90
80
130
100
110
200
180
140
220
130
160
150
200
160
180
300
240
200
290
180
220
200
260
210
240
500
370
310
420
280
330
290
360
290
340
700
470
410
530
360
410
370
440
350
430
1,000
610
550
670
480
530
470
540
440
540
1,500
820
740
870
640
690
610
670
550
710
2,000
1 000
920
1 040
780
830
720
790
640
860
2,500
1 150
1 100
1 200
900
950
800
900
700
1 000
3,000
1 300
1 200
1 350
1 050
1 050
900
950
800
1 100
3,500
1 450
1 350
1 450
1 150
1 150
1 000
1 050
850
1 200
4,000
1 600
1 500
1 600
1 250
1 250
1 050
1 100
900
1 350
5,000
1 850
1 700
1 800
1 400
1 450
1 200
1 250
1 000
1 500
7,000
2 300
2 150
2 200
1 750
1 750
1 400
1 450
1 200
1 850
10,000
2 800
2 650
2 650
2 100
2 100
1 650
1 700
1 400
2 300
15,000
3 600
3 350
3 300
2 600
2 600
1 950
2 050
1 650
2 900
20,000
4 200
3 950
3 850
3 000
3 000
2 150
2 300
1 850
3 450
30,000
5 250
4 900
4 700
3 600
3 650
2 450
2 700
2 100
4 300
40,000
6 100
5 700
5 400
4 100
4 100
2 700
3 000
2 350
5 050
50,000
6 850
6 350
6 000
4 500
4 550
2 900
3 250
2 500
5 650
100,000
9 550
8 700
8 150
5 800
6 000
3 450
4 100
3 050
8 100
150,000
11 450
10 250
9 700
6 550
6 950
3 700
4 650
3 350
9 900
200,000
12 950
11 450
10 900
7 100
7 650
3 900
5 050
3 600
11 400
300,000
15 300
13 250
12 700
7 900
8 700
4 100
-
3 900
13 800
500,000
18 600
15 650
15 300
8 800
10 100
4 300
-
-
17 450
1,000,000
23 700
19 000
19 300
9 800
12 000
-
-
-
23 600
2,000,000
29 300
22 200
23 750
10 450
13 850
-
-
-
31 400
5,000,000
37 250
25 900
30 150
-
-
-
-
-
44 550
10,000,000
-
-
-
-
-
-
-
-
56 950

- nil or rounded to zero (including null cells)

T2 Standard errors for person level estimates

NSW
Vic.
Qld.
SA
WA
Tas.
NT
ACT
Aust.
Size of estimate
no.
no.
no.
no.
no.
no.
no.
no.
no.

100
140
130
170
90
100
70
60
90
180
200
250
240
290
180
190
150
140
170
300
300
360
340
400
260
280
220
220
240
400
500
530
510
590
400
420
340
370
370
570
700
690
660
750
530
550
450
500
480
720
1,000
900
860
960
700
720
590
680
620
910
1,500
1 210
1 160
1 250
930
970
790
930
810
1 190
2,000
1 480
1 420
1 510
1 140
1 190
960
1 140
970
1 430
2,500
1 750
1 650
1 750
1 350
1 400
1 100
1 300
1 100
1 650
3,000
1 950
1 850
1 950
1 500
1 550
1 250
1 450
1 250
1 850
3,500
2 150
2 050
2 150
1 650
1 700
1 350
1 600
1 350
2 050
4,000
2 350
2 250
2 300
1 800
1 900
1 450
1 750
1 450
2 200
5,000
2 750
2 600
2 650
2 050
2 150
1 650
1 950
1 600
2 550
7,000
3 400
3 200
3 200
2 500
2 650
2 000
2 300
1 900
3 100
10,000
4 200
4 000
3 900
3 000
3 200
2 350
2 650
2 200
3 800
15,000
5 350
5 000
4 850
3 700
4 000
2 750
3 000
2 550
4 800
20,000
6 250
5 850
5 600
4 200
4 600
3 100
3 250
2 800
5 600
30,000
7 800
7 250
6 800
5 000
5 600
3 500
3 500
3 150
7 000
40,000
9 100
8 400
7 800
5 600
6 350
3 800
3 650
3 400
8 150
50,000
10 150
9 350
8 600
6 100
6 950
4 000
3 700
3 550
9 150
100,000
14 150
12 800
11 550
7 650
9 050
4 550
3 700
3 950
12 950
150,000
16 950
15 150
13 500
8 550
10 350
4 750
3 550
4 100
15 750
200,000
19 150
16 950
15 000
9 150
11 300
4 850
3 350
4 150
18 050
300,000
22 600
19 650
17 250
9 900
12 650
4 850
-
4 150
21 700
500,000
27 350
23 350
20 300
10 700
14 250
4 750
-
-
27 100
1,000,000
34 600
28 650
24 650
11 450
16 100
-
-
-
36 050
2,000,000
42 500
34 000
29 100
11 650
17 500
-
-
-
47 150
5,000,000
53 350
40 600
34 600
-
-
-
-
-
65 300
10,000,000
-
-
-
-
-
-
-
-
81 800

- nil or rounded to zero (including null cells)