Teaching:TUW - UE InfoVis WS 2009/10 - Gruppe 09 - Aufgabe 3
Jump to navigation
Jump to search
Aufgabenstellung
Zu verbessernde Grafik
Review of the Existing Graphic
- The current graphic contains many non-data-ink elements which should be reduced to no more that what is necessary to make the data ink understandable [Few, 2004]. Examples of such non-data-ink elements are
- The yellow/brown border around the real graphic
- Border lines
- Box around heading
- Shadow below heading and legend
- The main topic of the graph is to recognize the connection between the CO2 emissions (total and per capita) and the gross domestic product of countries. The region totals displayed in light gray in the background can be omitted, as this data is more distracting than useful.
- The data point labels distract the reader from the actual data as the black font color stands out too much. Furthermore, data points are categorized into several regions by using different colors, whereas the corresponding labels are always black, making the connection between the data point and its label difficult. Also, for very small data points, the color is unrecognizable, making the distinction between different regions impossible.
- The world map in the legend is also drawing attention away from the main data of the graph, a simple legend stating the different regions and their respective color would be enough. Also, the size of the map is too small to be actually useful.
- The main focus of the graph is the comparison between the CO2 emissions per capita and GDP per capita, so this information is represented as the 2d position of the data points, as position is the most accurate visual variable [Cleveland and McGill, 1984].
- Representing the total CO2 emissions for each country using the size of the data point helps understanding the impact of a particular country on the global CO2 emissions. Without this information, it would not be obvious that although Luxembourg has higher CO2 emissions per capita that the USA, its total CO2 emissions are far lower. The encoding of this information in the area of the marker, however, does not support comparison between countries with similar CO2 emissions or the extraction of absolute values.
- Encoding the population size using the font size of the country label adds another variable to the dataset. This information enables the viewer to make comparisons between different countries based on their population size. The graph shows that although China and India have roughly the same population size, India has much lower total CO2 emissions. The problem, however, is that the size of the label not only depends on the font size, but also on the length of the country name, thus making countries with longer names appear bigger. This makes the encoding unreliable.
Improvements
- All non-necessary non-data-ink has been removed, which includes
- The yellow border around the graph
- Border lines
- The box around the heading
- Shadows below the heading and the legend box
- The remaining non-data-ink has been de-emphasized by using a lighter gray for the grid lines, the axes, the axes labels and the legend. Furthermore, non-data-ink has been regularized by using consistent font sizes throughout the graph. Unnecessary bold text in the legend has also been removed.
- In addition to coloring the data points also their respective labels have been color coded. This makes it easier to associate the right labels with the data points. In addition, for small data points, coloring the label as well as the data point makes the distinction between different regions much easier. It also removes the distraction caused by the formerly all black labels as the labels now carry additional information themselves. This additional color coding immediately shows that the majority of African countries is located in the bottom left corner (low GDP, low CO2 emissions), the majority of Latin American countries is located in the center of the graph (average GDP, average CO2 emissions) and that almost all European countries are located on the top right corner of the graph (high GDP, high CO2 emissions), a fact that was not immediately visible in the original graph.
- The legend for region color coding has been simplified by replacing the world map by an alphabetical list of regions.
- The encoding of the total CO2 emissions per country using the area of the data point has been removed from the graph, because
- it is not obvious to the user if the emissions are represented in the diameter of the point or in its area
- comparison between countries with similar CO2 emissions is impossible
- no exact values can be extracted from the visualisation
- Also the different font sizes have been removed from the graph, as the size of the label not only depends on the actual font size, but also on the length of the label (i.e. the country name), making comparisons between different countries impossible.
- The top 10 countries emitting the most CO2 have been marked in the scatterplot, further details on these countries have been included in a second diagram.
- A second diagram showing information about the top 10 CO2 emission countries has been included. It shows a comparison of total CO2 emissions and population size using two bar graphs. This second graph immediately shows the discrepancy between China and the USA: both have roughly the same CO2 emissions, but the USA only have a quarter of the population of China.
References
[Cleveland and McGill, 1984] W. S. Cleveland and R. McGill. Graphical Perception: Theory, Experimentation, and Application to the Development of Graphical Methods. Journal of the American Statistical Association, 79:531-554, 1984
[Few, 2004] Stephen Few. Show Me the Numbers: Designing Tables and Graphs to Enlighten. Analytics Press, 2004, Chapter 7 – General Design For Communication
[Few, 2007] Stephen Few. Visualising Change: An Innovation in Time-Series Analysis. White Paper, SAS Institute, 2007
CO2 Emissions data set: http://manyeyes.alphaworks.ibm.com/manyeyes/datasets/carbon-dioxide-emissions-co2-thousan-2/versions/1
http://manyeyes.alphaworks.ibm.com/manyeyes/datasets/carbon-dioxide-emissions-co2-kg-co/versions/1