Teaching:TUW - UE InfoVis WS 2008/09 - Gruppe 02 - Aufgabe 1 - Scatterplot: Difference between revisions

From InfoVis:Wiki
Jump to navigation Jump to search
No edit summary
mNo edit summary
Line 1: Line 1:
A scatterplot (also called a ''scatter chart'', ''scatter diagram'' or ''scatter graph'' [Wikipedia]) is a diagram in which the values of two variables are applied to the horizontal and vertical axes of a cartesian coordinate system. The resulting point in the graph represents one record from a data set. The distribution pattern of points from multiple records reveal the correlation between the selected variables in the data set. The scatterplot is not to be confused with the ''correlation plot'' [Information Technology Lab, NIST #2] which treats already adopted correlation coefficients of different data groups, while the term ''correlation diagram'' does not seem to be bound.
A scatterplot (also called a ''scatter chart'', ''scatter diagram'' or ''scatter graph'' [Wikipedia]) is a diagram in which the values of two variables are applied to the horizontal and vertical axes of a cartesian coordinate system. The resulting point in the graph represents one record from a data set. The distribution pattern of points from multiple records reveals the correlation among the selected variables in the data set. The scatterplot is not to be confused with the ''correlation plot'' [Information Technology Lab, NIST #2] which treats already adopted correlation coefficients in different data groups, while the term ''correlation diagram'' does not seem to be bound.


=Revealed Information=
=Revealed Information=


==Type of Correlation==
correlation patterns -> type of correlation
correlation patterns -> type of correlation
(regression line, regression "path", "scatterplot smoothing" [NetMBA])
(regression function, "scatterplot smoothing" [NetMBA])
sign, strength (TODO: add about figures with: perfect positive, strong tight negative, weak loose positive, no correlation, clusters)
sign, strength (TODO: add about figures with: perfect positive, strong tight negative, weak loose positive, no correlation, clusters)


==Density, Outlyers and Clusters==
density (-> cluster analysis) & outlyers
density (-> cluster analysis) & outlyers


=Scatterplots of higher dimensions=
=Scatterplots of Higher Dimensions=


Not necessarily two variables, higher dimensions displayed spacially or by point properties (color, size, shape)
Not necessarily two variables, higher dimensions displayed spacially or by point properties (color, size, shape)
TODO: add figure with colored 3D plot, sunflower plot [addictedtor.free.fr], [York University], jitter plot


=Treating Discrete Data=
=Treating Discrete Data=
Line 26: Line 30:
*NetMBA: http://www.netmba.com/statistics/plot/scatter/
*NetMBA: http://www.netmba.com/statistics/plot/scatter/
*ChartItNow: http://www.chartitnow.com/scatter%20diagram.html
*ChartItNow: http://www.chartitnow.com/scatter%20diagram.html
*addictedtor.free.fr: http://addictedtor.free.fr/graphiques/graphcode.php?graph=59
*York University: http://www.math.yorku.ca/SCS/sasmac/sunplot.html

Revision as of 18:37, 30 October 2008

A scatterplot (also called a scatter chart, scatter diagram or scatter graph [Wikipedia]) is a diagram in which the values of two variables are applied to the horizontal and vertical axes of a cartesian coordinate system. The resulting point in the graph represents one record from a data set. The distribution pattern of points from multiple records reveals the correlation among the selected variables in the data set. The scatterplot is not to be confused with the correlation plot [Information Technology Lab, NIST #2] which treats already adopted correlation coefficients in different data groups, while the term correlation diagram does not seem to be bound.

Revealed Information

Type of Correlation

correlation patterns -> type of correlation (regression function, "scatterplot smoothing" [NetMBA]) sign, strength (TODO: add about figures with: perfect positive, strong tight negative, weak loose positive, no correlation, clusters)

Density, Outlyers and Clusters

density (-> cluster analysis) & outlyers

Scatterplots of Higher Dimensions

Not necessarily two variables, higher dimensions displayed spacially or by point properties (color, size, shape)

TODO: add figure with colored 3D plot, sunflower plot [addictedtor.free.fr], [York University], jitter plot

Treating Discrete Data

[Wikipedia, DE]

References