Teaching:TUW - UE InfoVis WS 2008/09 - Gruppe 02 - Aufgabe 1 - Scatterplot

From InfoVis:Wiki
Jump to navigation Jump to search

A scatterplot (also called a scatter chart, scatter diagram or scatter graph [Wikipedia]) is a diagram in which the values of two variables are applied to the horizontal and vertical axes of a cartesian coordinate system. The resulting point in the graph represents one record from a data set. The distribution pattern of points from multiple records reveals the correlation among the selected variables in the data set. The scatterplot is not to be confused with the correlation plot [Information Technology Lab, NIST #2] which treats already adopted correlation coefficients in different data groups, while the term correlation diagram does not seem to be bound.

Revealed Information

Type of Correlation

correlation patterns -> type of correlation (regression function, "scatterplot smoothing" [NetMBA]) sign, strength (TODO: add about figures with: perfect positive, strong tight negative, weak loose positive, no correlation, clusters) 4 figures from [University of Illinois]:

  • perfect positive
  • high negative
  • low positive
  • no correlation

1 screen from scatterplot tool [NLVM] with regression line

Density, Outlyers and Clusters

density (-> cluster analysis) & outlyers

  • 1 image for clusters
  • 1 with outlyer

Scatterplots of Higher Dimensions

Not necessarily two variables, higher dimensions displayed spacially or by point properties (color, size, shape)

TODO: add figure with colored 3D plot, sunflower plot [addictedtor.free.fr], [York University], jitter plot

Treating Discrete Data

[Wikipedia, DE]

References