Teaching:TUW - UE InfoVis WS 2008/09 - Gruppe 02 - Aufgabe 1 - Scatterplot: Difference between revisions

From InfoVis:Wiki
Jump to navigation Jump to search
No edit summary
No edit summary
Line 1: Line 1:
A scatterplot (also called a ''scatter chart'', ''scatter diagram'' or ''scatter graph'' [Wikipedia]) is a diagram in which the values of two variables are applied to the horizontal and vertical axes of a cartesian coordinate system. The resulting point in the graph represents one record from a data set. The distribution pattern of points from multiple records reveal the correlation between the selected variables in the data set. The scatterplot is not to be confused with the ''correlation plot'' [Information Technology Lab, NIST #2] which treats already adopted correlation coefficients, while the term ''correlation diagram'' does not seem to be bound.
A scatterplot (also called a ''scatter chart'', ''scatter diagram'' or ''scatter graph'' [Wikipedia]) is a diagram in which the values of two variables are applied to the horizontal and vertical axes of a cartesian coordinate system. The resulting point in the graph represents one record from a data set. The distribution pattern of points from multiple records reveal the correlation between the selected variables in the data set. The scatterplot is not to be confused with the ''correlation plot'' [Information Technology Lab, NIST #2] which treats already adopted correlation coefficients of different data groups, while the term ''correlation diagram'' does not seem to be bound.


=Revealed Information=
=Revealed Information=
Line 5: Line 5:
correlation patterns -> type of correlation
correlation patterns -> type of correlation
(regression line, regression "path", "scatterplot smoothing" [NetMBA])
(regression line, regression "path", "scatterplot smoothing" [NetMBA])
sign, strength (TODO: add about figures with: perfect positive, strong tight negative, weak loose positive, no correlation, clusters)


density & outlyers
density (-> cluster analysis) & outlyers


=Scatterplots of higher dimensions=
=Scatterplots of higher dimensions=

Revision as of 18:16, 30 October 2008

A scatterplot (also called a scatter chart, scatter diagram or scatter graph [Wikipedia]) is a diagram in which the values of two variables are applied to the horizontal and vertical axes of a cartesian coordinate system. The resulting point in the graph represents one record from a data set. The distribution pattern of points from multiple records reveal the correlation between the selected variables in the data set. The scatterplot is not to be confused with the correlation plot [Information Technology Lab, NIST #2] which treats already adopted correlation coefficients of different data groups, while the term correlation diagram does not seem to be bound.

Revealed Information

correlation patterns -> type of correlation (regression line, regression "path", "scatterplot smoothing" [NetMBA]) sign, strength (TODO: add about figures with: perfect positive, strong tight negative, weak loose positive, no correlation, clusters)

density (-> cluster analysis) & outlyers

Scatterplots of higher dimensions

Not necessarily two variables, higher dimensions displayed spacially or by point properties (color, size, shape)

Treating Discrete Data

[Wikipedia, DE]

References