Right menu

Featured resource


Home > Topdrawer > Statistics > Good teaching > Data reduction

Default object view. Click to create a custom template, Node ID: 13203, Object ID: 20885

Data reduction

Data reduction

Data reduction is the process of summarising data to characterise the data set succinctly in one or several numerical values.

A data set can be summarised with a measure of centre: the mean, median or mode.

The article on the AAMT website What is 'Typical' for Different Kinds of Data? uses data from Melbourne Cup winners to explore and explain what 'typical' might mean in regards to:

  • the mode
  • the median
  • the mean
  • both the median and mean
  • using numerical attributes within categorical data.

You can read about the use of measures of centre in context and in the media in What's Average?.

A measure of spread and clusters of data can be used to summarise a data set. In the middle years, spread is usually described with the range of the entire data set. The range of the middle half (50%) of the data is also useful.

Combining the contribution of the median and measures of spread is the box plot created from the five-number summary, which includes the median, maximum and minimum (range), and interquartile range (middle 50% of the data).

The presence of outliers in data sets may influence the measures that summarise a data set.

Yes

Yes

Name Class Section
Document Central tendency Folder 17
Document Box plots Folder 17
Document Influence of outliers Folder 17
Document Mean, median and mode Folder 17
Document Year 7: Calculate mean, median, mode and range for sets of data. Interpret these statistics in the context of data Infobox 3
Document Year 7: Describe and interpret data displays using median, mean and range Infobox 3
Document Source Infobox 3