Teaching:TUW - UE InfoVis WS 2007/08 - Gruppe 07 - Aufgabe 1 - Histogram

From InfoVis:Wiki
Jump to navigation Jump to search

Definitions[edit]

In statistics, a histogram is a graphical display of tabulated frequencies. A histogram is the graphical version of a table that shows what proportion of cases fall into each of several or many specified categories. The histogram differs from a bar chart in that it is the area of the bar that denotes the value, not the height, a crucial distinction when the categories are not of uniform width.
[Lancaster, 1974]


A histogram is used when we want to show frequencies of a continous variable. The continous variable can, of course, assume all values within an interval and the histogram reflects this by covering the whole of the interval concerned.
[Wallgreen et al., 1996]


Explanation[edit]

For drawing the histogram, the numerical values are first put in ascending order. Then the data field is split into j classes. Each class has a defined width (class interval) - normally it is 1, but in special cases it is useful to choose a width > 1 - e.g. for displaying frequencies concerning distinct age groups. Under "Examples" we consider a histogram of varying class intervals.

Example[edit]

As an example we consider values given from the table below to create a histogram (right image). The data derived from the first column of the table shows the class affiliation. According to the table values these classes are aligned along the x-axis of the histogram. The heigth of each histogram bar (frequency density) can be obtained by dividing the absolute frequenzy values (second column) by the corresponding class intervals. The resulting value of each class is then printed along the y-axis of the histogram. Once this is properly done, the histogram shows the amount of frequency according to any of the classes. Hence the histogram is a powerful visualisation that can make information accessable in just a few seconds, considering that the table on the left side is representing the same information, it is easy to understand the power of graphical visualisation.

Frequency density (height of a column) = n j / w j
Frequency (area) = Width x Height

class j absolute frequency nj class interval w j
1 (=> 24 ≤ x ≤ 26 ) 36 3
2 (=> 26 < x ≤ 28 ) 37 2
3 (=> 28 < x ≤ 33 ) 17 5
Histogram Illustration

Related Links[edit]

References[edit]

  • [Lancaster, 1974] H.O. Lancaster, An Introduction to Medical Statistics. John Wiley and Sons. 1974. ISBN 0 471 51250-8
  • [Wallgreen et al., 1996] Anders Wallgreen, Britt Wallgreen, Rolf Persson, Ulf Jorner and Jan-Aage Haaland. Graphing Statistics & Data: Creating Better Charts. SAGE Publications, Thousand Oaks, London, New Delhi, 1996.
  • [Evans, 2004] Geraint Evans, Histograms. CensusAtScool. Retrieved at: November 08, 2007. http://www.censusatschool.ntu.ac.uk/files/histogram.pdf