Evaluating Visual Table Data Understanding: Difference between revisions

From InfoVis:Wiki
Jump to navigation Jump to search
No edit summary
No edit summary
 
(13 intermediate revisions by 3 users not shown)
Line 6: Line 6:
== Short description ==
== Short description ==


Evaluation of how information visualization supports exploration for visual table data.
The authors present a controlled experiement investigating how the layout of visual table data affects the  
A controlled experiment designed to evaluate how the layout of table data affects the user understanding
understanding of the data and the exploration process. So this is not a information visualization technique per
and his exploration process. This experiment raised interesting problems from the design phase to the data
se but an analysis how much and in which ways different infovis techniques influence the understanding of the data
analysis. In this paper, we focus on evaluating how information
by the users. The authors first detail the tasks the users had to perform, describe the experiment procdeure and
visualization supports exploration for visual table data.
conclude with lessons learnt especially during the data collection and analysis phase.
We present a controlled experiment designed to evaluate how the layout of table data affects the user understanding
and his exploration process. This experiment raised interesting problems from the design phase to the data
analysis. A a task taxonomy, the experiment procedure and clues about data collection and analysis are presented. Concluding with lessons learnt from this experiment and the discussion of the format of future evaluation.


=== Controlled Experiment ===
=== Controlled Experiment ===
Line 28: Line 25:
=== Datasets ===
=== Datasets ===


Two real datasets where chosen for the experiment. The datasets had to be interesting and of reasonable size to keep the experiment subjects interested. One dataset was the university master´s grades and the second dataset was from the CIA World Factbook. This dataset consists of statistics of productivity, export, transportation for most of the countries in the world.
Two real datasets where chosen for the experiment. The datasets had to be interesting and of reasonable size to keep the experiment subjects motivated. One dataset was the universities master´s grades and the second dataset was from the CIA World Factbook. This dataset consists of statistics of productivity, export, transportation for most of the countries in the world.


== Tasks ==
== Tasks ==


The experiment subjects had to complete tasks of different difficulties. The datasets where divided in three difficulties. At a high difficulty level of the dataset the experiment subjects have interpret a lot of the data, while looking at easier parts of the dataset the readability of the datasets where tested.<br/>
The experiment subjects had to complete tasks of different difficulties. The datasets where divided in three difficulties. At a high difficulty level of the dataset the experiment subjects have interprated a lot of the data, while looking at easier parts of the dataset the readability of the datasets where tested.<br/>
The experiment consisted of the following tasks: <br/><br/>
The experiment consisted of the following tasks: <br/><br/>
*'''Low Level: on represenation  visual coding'''
*'''Low Level: on represenation  visual coding'''
Line 61: Line 58:
== Figures ==
== Figures ==
Pictures of the data sets in different sort orders.<br/>
Pictures of the data sets in different sort orders.<br/>
[[Image:VisualTables1.GIF|center|VisualTables]]
[[Image:VisualTables1.GIF|center|VisualTables]]
:::::::::::::::::::::::: [Henry et Fekete, 2006]
<br/><br/>
<br/><br/>
[[Image:VisualTables2.GIF|center|VisualTables]]
[[Image:VisualTables2.GIF|center|VisualTables]]
:::::::::::::::::::::::: [Henry et Fekete, 2006]
<br/><br/>
<br/><br/>
Task complexity<br/>
Task complexity<br/>
[[Image:VisualTables4.GIF|center|VisualTables]]
[[Image:VisualTables4.GIF|center|VisualTables]]
:::::::::::::::::::::::: [Henry et Fekete, 2006]
<br/><br/>
<br/><br/>


Line 93: Line 93:
context visualization for tabular information in Proceedings of the SIGCHI conference on Human factors in computing systems: celebrating interdependence ACM Press, Boston, Massachusetts, United States 1994 318-322
context visualization for tabular information in Proceedings of the SIGCHI conference on Human factors in computing systems: celebrating interdependence ACM Press, Boston, Massachusetts, United States 1994 318-322


*[[Schneiderman%2C_Ben|Ben Schneiderman]], The Eyes Have It: A Task by Data Type Taxonomy for Information Visualization. in IEEE Symposium on Visual Languages, (1996), p 336.
*[[Shneiderman%2C_Ben|Ben Shneiderman]], The Eyes Have It: A Task by Data Type Taxonomy for Information Visualization. in IEEE Symposium on Visual Languages, (1996), p 336.
 
[[Category:Techniques]]

Latest revision as of 11:42, 19 February 2008

Authors


Short description

The authors present a controlled experiement investigating how the layout of visual table data affects the understanding of the data and the exploration process. So this is not a information visualization technique per se but an analysis how much and in which ways different infovis techniques influence the understanding of the data by the users. The authors first detail the tasks the users had to perform, describe the experiment procdeure and conclude with lessons learnt especially during the data collection and analysis phase.

Controlled Experiment

In the experiment three table ordering algorithms commonly used to reorder rows and columns of table data where used. The first order is alphabetic (A) and considered as a control order. The two others (C) and (T) are issued from the field of bioinformatics. (C) is a hierarchical clustering followed by a linearization. (T) is based on the traveling salesman problem. They are used to reorder DNA microarrays (visual table data presenting gene expressions).

Datasets

Two real datasets where chosen for the experiment. The datasets had to be interesting and of reasonable size to keep the experiment subjects motivated. One dataset was the universities master´s grades and the second dataset was from the CIA World Factbook. This dataset consists of statistics of productivity, export, transportation for most of the countries in the world.

Tasks

The experiment subjects had to complete tasks of different difficulties. The datasets where divided in three difficulties. At a high difficulty level of the dataset the experiment subjects have interprated a lot of the data, while looking at easier parts of the dataset the readability of the datasets where tested.
The experiment consisted of the following tasks:

  • Low Level: on represenation visual coding

Readability:
Find a specified student or course
Give the grade for a given student and course
Find how many students followed a given course

Interpretation:
Find the course where students performs better/worse
Find the student with the highest/lowest grades

  • Medium Level: on groups and outliers

Readability:
Circle a group
Give it a representative element of the group
Circle outliers

Interpretation:
Give it a meaningful name
Give a representative element of the group
Explain why the are outliers


  • High Level: on correlations and trends

Readability:
Give two Courses correlated/uncorrelated

Interpretation:
Give two Courses correlated/uncorrelated
Give the main trends of this master (in terms of topics)
Explain how students/courses have been ordered

Figures

Pictures of the data sets in different sort orders.

VisualTables
VisualTables
[Henry et Fekete, 2006]



VisualTables
VisualTables
[Henry et Fekete, 2006]



Task complexity

VisualTables
VisualTables
[Henry et Fekete, 2006]



Lessons Learned

  • Do not use real datasets
  • Do not ask too much
  • Do not exclude free comments or open questions
  • Do not leave too much freedom
  • Do not underestimate training

Important Citations

Main challenges of Infosky:

We chose two real datasets in order to conduct this experiment in a realistic context.
[Fekete et Henry, 2006]



We encountered problems to create relevant questionnaires for real datasets.
[Fekete et Henry, 2006]



Analyzing free comments, explanations or open questions is a nightmare.
[Fekete et Henry, 2006]


Reference articles

  • Ramana Rao, Stuart Card. The table lens: merging graphical and symbolic representations in an interactive focus +

context visualization for tabular information in Proceedings of the SIGCHI conference on Human factors in computing systems: celebrating interdependence ACM Press, Boston, Massachusetts, United States 1994 318-322

  • Ben Shneiderman, The Eyes Have It: A Task by Data Type Taxonomy for Information Visualization. in IEEE Symposium on Visual Languages, (1996), p 336.