About
Randy Krum
President of InfoNewt.
Data Visualization and Infographic Design

Infographic Design

Infographics Design | Presentations
Consulting | Data Visualizations

DFW DataViz Meetup

Join the DFW Data Visualization and Infographics Meetup Group if you're in the Dallas/Fort Worth area!

Search the Cool Infographics site

Custom Search

Subscriptions:

 

Feedburner

The Cool Infographics® Gallery:

How to add the
Cool Infographics button to your:

Cool Infographics iOS icon

- iPhone
- iPad
- iPod Touch

 

Read on Flipboard for iPad and iPhone

Featured in the Tech & Science category

Flipboard icon

Twitter Feed
From the Bookstore

Caffeine Poster

The Caffeine Poster infographic

« Infographic Posters for Data Lovers | Main | The Simpsons 20th: Comedy [Family] Tree »
Friday
Dec182009

Word Spectrums! The Online Infographic Battleground



On Chris Harrison's site, there are a number of graphics that he calls Word Spectrums.  More like a battleground, Chris is using the enormous amount of data from websites that has been made public by Google.  This is an advanced form of a word cloud that visualizes related words and their relative connections to the two topics.  (FYI, since this is based on raw Google data, foul language does appear in some of them).


Using Google's enormous bigram dataset, I produced a series of visualizations that explore word associations. Each visualization pits two primary terms against each other. Then, the use frequency of words that follow these two terms are analyzed. For example, "war memorial" occurs 531,205 times, while "peace memorial" occurs only 25,699. A position for each word is generated by looking at the ratio of the two frequencies. If they are equal, the word is placed in the middle of the scale. However, if there is a imbalance in the uses, the word is drawn towards the more frequently related term. This process is repeated for thousands of other word combinations, creating a spectrum of word associations. Font size is based on a inverse power function (uniquely set for each visualization, so you can't compare across pieces). Vertical positioning is random.


Chris has created and shared a number of different versions on the Word Spectrum page of his website, and you can see high-resolutions PDFs of each there.

Want to try your own?  Building on Chris' idea, Jeff Clark from Neoformix has created interactive Word Spectrums using either Twitter or News as the source that lets you enter your own terms to compete.  I especially like the idea of pitting two competing brands against one another.




PrintView Printer Friendly Version

EmailEmail Article to Friend

Reader Comments (1)

I like these! I think the power of words and their frequency is a great way to quickly display information in a visually appealing way.

I'm working on a little pet project that I hope to go live with in the next few weeks that convey news stories in a similar (but much less sophistated!) way.

Thanks for sharing this - I particularly like the word spectrum.

Jeff

December 18, 2009 | Unregistered CommenterDuffer
Editor Permission Required
While we are migrating to a new site design, you must have editing permission for this entry in order to post comments.