Visual Exploration of Multivariate Graphs by Roll-Up and Selection: Difference between revisions

From InfoVis:Wiki
Jump to navigation Jump to search
No edit summary
(All revisions matched the spam blacklist (www.google.at/url?), blanking)
 
Line 1: Line 1:
[[Image:Wat_pivotgraph_overview.jpg|Rolled-up example diagram done by PivotGraph]]<br>Multivariate diagramm rolled up in two dimensions [Wattenberg, 2006]


[[Wattenberg, Martin|Martin Wattenberg]] [Wattenberg, 2006] described in his paper [http://www.google.at/url?sa=t&source=web&ct=res&cd=1&url=http%3A%2F%2Fwww.research.ibm.com%2Fvisual%2Fpapers%2Fpivotgraph.pdf&ei=BgMMSoHAEMmP_QbV--ywBA&usg=AFQjCNHXHWZWVxEM8tCSNzoUxTNH50hXRg&sig2=NGoK6wAVWEoPzeeHT_Dc0w Visual Exploration of Multivariate Graphs] how Multivariate Graphs can be easily analyzed by rolling-up on certain dimensions. The paper focuses on a software tool called PivotGraph (unfortunately not available) but describes a strategy that could also be used in other tools.
{{Definition|'''Multivariate Graphs''' are multi-dimensional: Each node in a graph is associated with several attributes. Common examples are social networks where the nodes might contain information about gender, age etc. or Markov chains where each node has a certain state. The paper mentioned above (as well as the software tool PivotGraph) focuses on graphs with discrete categorical dimensions in the nodes.
<br>[Wattenberg, 2006]}}
== Multivariate graphs - common approach ==
Two visualizations of multivariate graphs are very popular: "Node-and-link diagrams" and matrix views. "Node-and-link diagrams" represent a network topography, additional information may be coded by labeling and coloring nodes, matrix views (normally only 2D) create a main-matrix of one characteristic in which the sub-characteristic is repeated.
== Problems in analyzing multivariate graphs ==
Both visualizations - node-and-link diagram and matrix diagram allow only poor analysis of the information contained in the nodes. Easy questions like "How many men are in the network?" may require the analysist to count every node.
[[Image:Wat_pivotgraph_roll-up.jpg|thumb|Rolling up a network diagram]]
== Solution: Roll-up on one (two) dimension(s) ==
The main simplification in PivotGraph is to roll a graph up in any dimension(s): Replace all nodes with the same value in these dimensions with a node representing the amount of nodes by its size. Aggregate the connections in the same way.
This way you can select up to two dimensions that form a grid, e.g. gender and company. Instead of the large graph, you receive a simpler graph showing the aggregated connections between nodes that correspond to a certain gender and a certain company.
[[Image:Wat_pivotgraph_select.jpg|thumb|Selecting along a node attribute in a social network]]
== Solution: Selection ==
As addition to roll-up - parts of the diagram can be selected, e.g. all nodes representing females.
== Considerations ==
{{Quotation|Although the basic idea [...] is quite simple, it turns out there are a number of subtile challenges that need to be addressed for the visualization to remain legible.| [Wattenberg, 2006]}}
For optimal understanding, the size of the nodes reflects the amount of aggregated data, just as the thickness of the connections. The nodes may be colored according any other attribute, but the color of the lines should be used to indicate ratio of the weight (connections per node) .
Although, when using a straight grid layout as PivotGraph does, the connections should not be straight to avoid overlapping. Slightly bent arrows avoid occlusion problems.
[[Image:Wat_pivotgraph_singledim.jpg|thumb|Rolling up in one dimensions leads to a one-axis graph]]
The application allows the user also to roll-up in only one dimension. This leads to a graph with just one axis, nevertheless this layout can contain important information, e.g. the communication between several locations in a company.
An important part of the analysis is the user-interaction. The user may chose and the dimensions to roll up at any point, the diagram is automatically updated. User testing has shown, that this transition has to smoothly interpolate between the graphs, not only calculating the straight interpolation but maybe also intermediate graphs and displaying the transitions of the most important nodes and connections.
Wattenberg describes in his paper three user testings in which all user responded very positive on the transitions between different graphs.
[[Image:Wat_pivotgraph_ui.jpg|thumb|User Interface of PivotGraph]]
== PivotGraph user interface ==
The user interface is quite simple and basic to operate. The main window displays the current scatterplot and the dimensions, the right side allows the user to change the current view by rolling-up on attributes or selecting specific attribute-values. Nodes display additional information on mouse-over.
== Advantages and disadvantages ==
[[Image:pgnl.png|250px|Node-and-Link Diagram]] [[Image:pgmatrix.png|250px|Matrix View]] [[Image:pgpg.png|150px|Multivariate Graph]]<br>
The three figures above all represent the very same dataset, from left to right: Node-and-link diagram, matrix view and a multivariate graph generated by the PivotGraph application. Matrix views tend to offer the advantage of a comprehensive overview of all individual connections over node-and-link diagrams. However, to the unexperienced user they might happen to hide some information, as due to the emphasis on the first sort dimension (namely "gender") some connections can seem obscure. Therefore, the multivariate graph view is to be preferred when inspecting relations on two dimensions at once.
The multivariate graph technology also comes with some disadvantages: Roll-up and selection transformations might not allow certain aspects of graph topolgy to be preserved: Acyclic graphs might become cyclic, disconnected graphs might become connected for instance.  Therefore, matrix views might be more suitable for experienced users, multivariate graphs on the other hand might prove to be more readable to an unexperienced audience.
== Limitations and future work ==
The tool has some limitations that must be taken into consideration. First of all, the visualization is (except for node-coloring) limited to 2D. An extension to 3D would be possible, but this might cause severe occlusion problems and destroy the simplicity of the visualization.
Another limitation is that the technique works only on discrete-valued dimensions. Continuous values are more difficult to aggregate, this could lead to a large amount of connections.
A last limitation that may be addressed in future (just like the continuous data) is the representation of multivariate edges. A similar approach could be used to aggregate edges that contain additional information, however these graphs are less common in practice than graphs with multivariate nodes. 
== References ==
[http://www.google.at/url?sa=t&source=web&ct=res&cd=1&url=http%3A%2F%2Fwww.research.ibm.com%2Fvisual%2Fpapers%2Fpivotgraph.pdf&ei=BgMMSoHAEMmP_QbV--ywBA&usg=AFQjCNHXHWZWVxEM8tCSNzoUxTNH50hXRg&sig2=NGoK6wAVWEoPzeeHT_Dc0w Visual Exploration of Multivariate Graphs]
[[Category: Techniques]]
[[Category: Interaction_Techniques]]

Latest revision as of 14:03, 22 August 2013