Australian Bureau of Statistics

Rate the ABS website
ABS @ Facebook ABS @ Twitter ABS RSS ABS Email notification service
Education Services

Education Services homepage

Teacher Statistical Literacy

Back to Education Services home page

Concepts and definitions

Click on the triangles to open a section. The table below is a list of the concepts covered in each section.

    Hide details for StatisticsStatistics

    Statistics are numerical data that have been organised to serve a useful purpose. A major role of the ABS is to provide the Australian community with statistics that will help them make informed decisions. Statistical information provided by the ABS is used widely in Australia by governments, business people, researchers, members of the public, teachers and students.

    Data are observations or facts which, when collected, organised and evaluated, become information or knowledge.

    Data item
    A data item is the smallest piece of information that can be obtained from a survey or census.

    A dataset is data collected for a particular study. A dataset represents a collection of elements; and for each element, information on one or more characteristics is included.

    An outlier is an extreme value of the data. It is an observation value that is significantly different from the rest of the data. There may be more than one outlier in a set of data.
    Sometimes, outliers are significant pieces of data and should not be ignored. In other instances, they occur as a result of an error or misinformation and should be ignored. The decision to include or exclude an outlier needs to be clearly justified when discussing results.

    The weights (in kilograms) of 30 students were measured and recorded in the stem and leaf plot shown in Figure 1. In this case, the stem is the whole number values and the leaves are the decimal values. The outliers are 56.3 and 67.7.

    Stem Leaf

    584 4 9
    590 0 2 3 8
    600 2 4 5 7 8 9
    611 2 4 4 5 6 7 9 9
    621 2 3 7

    Fig 1 Stem and leaf plot

    Hide details for VariablesVariables

    A variable is any measurable characteristic or attribute that can have different values for different subjects. Height, age, amount of income, country of birth, grades obtained at school and type of housing are examples of variables.

    An observation is a single piece of data about a variable

    Independent variable
    An independent variable is the variable whose values are independent of changes in the values of other variables. It its the variable deliberately controlled or changed to assess changes in the dependent variable.

    Dependent variable
    A dependent variable depends on the independent variable.

    Categorical variables
    Nominal variable
    A nominal variable describes a name or category. For example, for the variable 'method of travel to school' all its values are words such as bus, walk, car and tram. Nominal variables are often referred to as categorical variables.

    Ordinal variable
    An ordinal variable is a number that represents a category. For example, postcodes and school year levels.

    Numerical variables
    A numerical variable is one that describes a numerically measured value. Numerical variables can be either discrete or continuous.

    Continuous variable
    A continuous variable is a numeric variable that can take any value within a certain range. For example, distance, age and temperature are continuous variables.

    Discrete variable
    A discrete variable can only take a finite number of values within a certain range. An example of a discrete variable is the number of children in a family – a family can have 0,1,2 or 3 children but not 2.5.

    Class interval
    A class interval is a group of data values for a variable. The intervals are generally the same size – for example, 4-6, 7-9 and 10-12. However, the intervals may have different sizes such as 4-6, 7-9 and 10-14. The boundaries of class intervals must not overlap so that each observation can be allocated to only one interval.

    Show details for SamplingSampling
    Hide details for Frequency and distributionFrequency and distribution

    The frequency (f) of a particular observation is the number of times the observation occurs in that data.

    Cumulative frequency
    Cumulative frequency is the total of a frequency and all frequencies below it in a frequency distribution. It is the running total of frequencies.

    Relative frequency
    Relative frequency is another term for proportion. It is the number of times a particular observations occurs divided by the total number of observations.

    The distribution of a variable is the pattern of values of the observations.

    Show details for Graphs and displaysGraphs and displays
    Show details for Summary statisticsSummary statistics

    List of items in each category

    Commonwealth of Australia 2008

    Unless otherwise noted, content on this website is licensed under a Creative Commons Attribution 2.5 Australia Licence together with any terms, conditions and exclusions as set out in the website Copyright notice. For permission to do anything beyond the scope of this licence and copyright terms contact us.