TECHNICAL NOTE DATA QUALITY
1 When interpreting the results of a survey it is important to take into account factors that may affect the reliability of the estimates. Estimates in this publication are subject to both non-sampling and sampling errors.
2 Non-sampling errors may arise as a result of errors in the reporting, recording or processing of the data and can occur even if there is a complete enumeration of the population. These errors can be introduced through inadequacies in the questionnaire, treatment of non-response, inaccurate reporting by respondents, errors in the application of survey procedures, incorrect recording of answers and errors in data capture and processing.
3 The extent to which non-sampling error affects the results of the survey is difficult to measure. Every effort is made to reduce non-sampling error by careful design and testing of the questionnaire, efficient operating procedures and systems, and the use of appropriate methodology.
4 Some of the items collected in the Business Characteristics Survey are dynamic in nature and the concepts measured are subject to evolution and refinement over time; it is not possible to measure the impact of these changes on data quality.
5 While all attempts are made to ensure that the questions are unambiguous and not subject to misinterpretation, some of the concepts measured in the BCS require the provider to make a subjective judgement or assessment. It is not possible to accurately quantify the impact of these issues on data quality
6 The 2008-09 Business Characteristics Survey had a response rate of 94%.
7 The difference between estimates obtained from a sample of businesses, and the estimates that would have been produced if the information had been obtained from all businesses, is called sampling error. The expected magnitude of the sampling error associated with any estimate can be estimated from the sample results. One measure of sampling error is given by the standard error (SE), which indicates the degree to which an estimate may vary from the value that would have been obtained from a full enumeration (the 'true' figure). There are about two chances in three that a sample estimate differs from the true value by less than one standard error, and about nineteen chances in twenty that the difference will be less than two standard errors.
8 An example of the use of standard error can be demonstrated using the total proportion of businesses with any innovation. As presented in this release, the estimated proportion of innovation-active businesses was 39.8%. The standard error of this estimate was 0.9%. There would be about two chances in three that a full enumeration would have given a figure in the range 38.9% to 40.7%, and about nineteen chances in twenty that it would be in the range 38.0% to 41.6%.
9 In this publication, indications of sampling variability are measured by relative standard errors (RSEs). The relative standard error is a useful measure in that it provides an immediate indication of the percentage errors likely to have occurred due to sampling, and thus avoids the need to refer also to the size of the estimate. Relative standard errors are shown in the Relative Standard Error table in this section. RSEs for all data included in this release (including data cube content) are available upon request.
10 Rather than using the estimated proportion from the survey data, a value of 50% has been used in the calculation of the RSE to annotate proportion estimates. This avoids inconsistencies between the way very low and very high proportions are annotated. Relative standard errors for estimates in this publication have been calculated using the actual standard error and the survey estimate (referred to as x) in the following manner:
Using the previous example, the standard error for the estimated proportion of businesses which introduced innovation was 0.825%. Multiplied by 100 and then divided by 50 gives an RSE calculated on this basis of 1.65%. It is these figures that appear in the table appended to this chapter.
For the tables in this publication, estimates with RSEs between 10% and 25% are annotated with the symbol '^'. These estimates should be used with caution as they are subject to sampling variability too high for some purposes. Estimates with RSEs between 25% and 50% are annotated with the symbol '*', indicating that the estimates should be used with caution as they are subject to sampling variability too high for most practical purposes. Estimates with an RSE greater than 50% are annotated with the symbol '**', indicating that the sampling variability causes the estimates to be considered too unreliable for general use.
For estimates of proportion the symbol '^' means that the estimate from full enumeration could lie more than a decile away so the estimate should be used with caution. For example a proportion estimate of 30% annotated with '^' means the full enumeration value could lie beyond the range 20% to 40%. The symbol '*' means the estimate from full enumeration could lie more than a quartile away and is subject to sampling variability too high for most practical purposes. A proportion estimate of 30% annotated with '*' means the full enumeration value could lie beyond the range 5% to 55%. Proportion estimates annotated with the symbol '**' have a sampling error that causes the estimates to be considered too unreliable for general use.
Readers of this release should note that most of the data have an RSE of less than 10%.
Relative Standard Error - Summary indicators of Innovation, 2008 - 09
200 or more persons
|Estimated number of businesses as at 30 June 2009 |
|Businesses with introduced innovation (innovating businesses) |
|Businesses with innovative activity which was: |
|still in development |
|Businesses with any innovative activity (innovation-active businesses) |