About
Randy Krum
President of InfoNewt.
Data Visualization and Infographic Design

Infographic Design

Infographics Design | Presentations
Consulting | Data Visualizations

DFW DataViz Meetup

Join the DFW Data Visualization and Infographics Meetup Group if you're in the Dallas/Fort Worth area!

Search the Cool Infographics site

Custom Search

Subscriptions:

 

Feedburner

The Cool Infographics® Gallery:

How to add the
Cool Infographics button to your:

Cool Infographics iOS icon

- iPhone
- iPad
- iPod Touch

 

Read on Flipboard for iPad and iPhone

Featured in the Tech & Science category

Flipboard icon

Twitter Feed
From the Bookstore

Caffeine Poster

The Caffeine Poster infographic

« Abby & Chris: Wedding Invitation | Main | Visual Customer Service in the Social Age »
Friday
Aug292014

False Visualizations: Sizing Circles in Infographics

Accuracy is the most important aspect of an infographic design!

Last week, the article The Truth about the Ice Bucket Challenge by Julia Belluz on Vox Media included the infographic, Where We Donate vs. Diseases That Kill Us, that used proportionally sized circles as its data visualization. The problem with this design is that the circle sizes don’t match the values shown. This is a false visualization and significantly over exaggerates the smaller amounts of money contributed to each charity and the deaths attributed to each cause.

This causes problems because readers often just look at the visuals without reading the actual numbers. They start with the assumption that a visualization accurately represents the data. The Vox Media story and infographic already have over 12,000 shares on Facebook, and this is a great case study for designers to understand how important it is to visualize data accurately.

As readers, we see the area of two-dimensional shapes on the page to represent the different values, but design software only allows width and height adjustments to size shapes. Designers make the mistake of adjusting the diameter of circles to match the data instead of the area, which incorrectly sizes the circles dramatically. It takes some geometry calculations in a spreadsheet to find the areas and then calculate the appropriate diameters for each circle. To demonstrate, I created this corrected version of the infographic.

False Visualizations: Sizing Circles in Infographics Revised

My Google Docs spreadsheet of the correct circle area and diameter calculations is available here.

Assuming this was a design mistake, and there was no intent to deceive the audience, this is a common mistake that many designers make.  So many designers, that I included an entire section on this topic in the Cool Infographics book to help designers understand how to size the area of circles.

I made one other improvement to the corrected design above by removing the color legend and listing the charities and causes of death right next to the appropriate circles. This makes the whole visualization easier for the audience to read by eliminating the need to look back-and-forth from the circles to the color legend to figure out what each circle represents.  Placing the text next to each circle keeps the information in the reader’s field of view which minimizes eye movement.

Sticking with the circles data visualization style, I wanted to take the design a little bit further. I would recommend one of two alternate improvements.  First, adding colored connecting lines is one way to make it easier for the audience to find the related circles in the columns sorted in descending order.

False Visualizations: Sizing Circles in Infographics Revised Lines

A second alternative would be to sort the lists to line up the related circles.  This makes it much easier for the audience to see the direct comparisons between charitable contributions and death rates related to the same cause.

False Visualizations: Sizing Circles in Infographics Revised Descending Sort

I’m passing over any discussion about whether using proportionally sized circles (a bubble chart) is the best visualization method for this data. If a designer makes the choice to use sized shapes, my point is that the data visualizations in the infographic must match the numbers using area.  David Mendoza published a good analysis worth reading and designed an alternative way to visualize the data in his article, This Bubble Chart Is Killing Me.

How else would you improve this design?

NOTE: I was able to contact the designer who created the infographic at Vox Media, and he had already realized his error after the infographic had been published. As I had guessed, he had mistakenly adjusted the diameter of the circles instead of the area. He told me that he’s working on updating the official infographic design in the article, but it hasn’t been published on the Vox Media site yet.


 

PrintView Printer Friendly Version

EmailEmail Article to Friend

Reader Comments (11)

so...I guess I wouldn't ordinarily say anything, but since this chart is being updated anyway, it might be helpful to note that "prostate" is spelled wrong....unless there is a disease that puts you face down on the ground. I did not proof the whole chart very closely, it just jumped out at me when I looked at the full size image. :)

Otherwise--thanks so much for the variations and for the correction of size representations. Makes it much more clear.
August 29, 2014 | Unregistered CommenterMiranda
nice articel. one question: is there a reason why the two top most circles are not the same size (diameter 2 vs. 2.5)?
August 29, 2014 | Unregistered Commenterchris
Great work! I may also add that the focus on fatalities is of course itself a distortion. A "fair" or "complete" graphic would include a third column that shows how many people are annually getting treated for (or diagnosed with) the respective illness… COPD and diabetes would probably jump really high, and so would breast and prostate cancer.
August 29, 2014 | Unregistered CommenterBernhard
Miranda - Nice Catch! I didn't even see that typo, and I just typed it in correctly in my revised version.

Chris - I'm just guessing, but I think the top two circles are different to reinforce two separate units of measure in the data. Dollars vs. Deaths. Avoids any impression that the top two circles are somehow equivalent.

Bernhard - You bring up one of many discussions about the data itself, which is why I focused on just visualizing the data they did include correctly. Why only the one major charity for each disease? The solution to Heart Disease is diet and exercise, so should it have equal funding to cancer? Suicide is not a disease, and has no prescription cure that can be researched unless you include depression.
August 29, 2014 | Registered CommenterRandy
this is very nice blog i really like it
August 30, 2014 | Unregistered Commenterfjackets online store
Accuracy is the most important aspect in a data visualization, but apparently spelling is not. Thanks to MIranda, I corrected the typo!
August 30, 2014 | Registered CommenterRandy
Sharing these common mistakes is really important because many people new to the practice of visualization do not have the proper education and experience. I wrote about another poor use of circles in this post http://decisionviz.wordpress.com/2014/05/08/example-of-big-data-company-ruining-visualization/
September 1, 2014 | Unregistered CommenterDecisionViz
I think you're just riding coattails of someone's virality. .. if this is a common problem, why not use / focus on some other (s)? I won't be sharing this, sorry...because I also don't fully agree, and think your sacrificing data representation for people too lazy to read and who'd rather make hasty assessments.
September 4, 2014 | Unregistered CommenterChris L.
This is a great study for designers to do your research before complaining too heavily. Why should area be any better? It is, but still not quite right. See "Flannery's compensation".
September 7, 2014 | Unregistered CommenterJim
This is a phenomenal article. Thank you for providing such an illustrative description, as well as your original data set. Unfortunately, I'm still unsure as to how you calculated the listed diameter values from the monetary values provided. How did you calculate those diameter values?

Thanks again for the invaluable insight!
June 16, 2016 | Unregistered CommenterNick D.
You, as the designer, choose the diameter for the first circle, and the rest of the diameters are calculated relative to that first "Master Circle". The units don't matter, so use whatever your design tool offers (inches, pixels, cm, etc.). I usually choose the circle diameter for the largest value first to fit within my design canvas, so I know that all of the circles will fit.

In the Google Docs spreadsheet, I chose a diameter of 2 inches to represent $257M, and the spreadsheet calculates the rest of the diameters based on the proportional differences in dollar amounts. Check the formulas.
June 17, 2016 | Registered CommenterRandy
Editor Permission Required
While we are migrating to a new site design, you must have editing permission for this entry in order to post comments.