<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
  <title>Insights</title>
  <subtitle>Things I figure out, often short and simple ones, but ones I think are interesting to a larger audience.</subtitle>
  <link rel="alternate" type="text/html" href="http://eagereyes.org/topics/Insights"/>
  <link rel="self" type="application/atom+xml" href="http://eagereyes.org/taxonomy/term/1/atom/feed"/>
  <id>http://eagereyes.org/taxonomy/term/1/atom/feed</id>
  <updated>2007-05-31T14:30:47-04:00</updated>
  <entry>
    <title>Caring about the Data</title>
    <link rel="alternate" type="text/html" href="http://eagereyes.org/blog/caring-about-the-data.html" />
    <id>http://eagereyes.org/blog/caring-about-the-data.html</id>
    <published>2007-09-04T23:07:15-04:00</published>
    <updated>2007-09-07T17:09:25-04:00</updated>
    <author>
      <name>Robert Kosara</name>
    </author>
    <category term="blog" />
    <category term="Insights" />
    <summary type="html"><![CDATA[<p>
<a href="/blog/caring-about-the-data.html"><img src="/media/attachments/titanicvisualizations.png" alt="Titanic visualizations" height="220" width="610" /></a>
</p>
<p>
When demonstrating <a href="/references/Kosara_TVCG_2006.html">Parallel Sets<sup>(ref)</sup></a> to guests and visitors, I often use the Titanic data set, because people can relate to it and it is entirely categorical. I like pointing out interesting facts the visualization shows (like that the second class was smaller than the first class), but it's really just a collection of numbers to show what the visualization can do. Some of the people I have shown this to feel different about it, though.     ]]></summary>
    <content type="html"><![CDATA[<p>
<a href="/blog/caring-about-the-data.html"><img src="/media/attachments/titanicvisualizations.png" alt="Titanic visualizations" height="220" width="610" /></a>
</p>
<p>
When demonstrating <a href="/references/Kosara_TVCG_2006.html">Parallel Sets<sup>(ref)</sup></a> to guests and visitors, I often use the Titanic data set, because people can relate to it and it is entirely categorical. I like pointing out interesting facts the visualization shows (like that the second class was smaller than the first class), but it's really just a collection of numbers to show what the visualization can do. Some of the people I have shown this to feel different about it, though. <!--break-->
</p>
<p>
Perhaps the first reaction that made me think of the Titanic as not just numbers was an artist who asked me how I could present such a horrible incident with such lovely colors. Of course I was aware that almost 1500 people died when the Titanic sank, and most of them in horrible ways. But it's also almost 100 years ago now and just so abstract and distant that it never occurred to me to be specifically sensitive about it.
</p>
<table align="center" border="0" width="300">
	<tbody>
		<tr>
			<td><a href="/media/attachments/ParallelSets-Titanic-Kosara.png" target="_blank"><img src="/media/attachments/ParallelSets-Titanic-Kosara-thumb.png" height="209" width="200" /></a></td>
		</tr>
		<tr>
			<td><i>Titanic visualization using parallel sets. Too neat and colorful?</i> </td>
		</tr>
	</tbody>
</table>
<p>
Another incident involved a Titanic buff who pointed out an error in the data: the only girl in the first class did not survive (as our data says), but rather drowned because here parents would not let her get into a life boat without them. This was another thing I was not prepared for, because first of all I assumed the data set to be correct, and second I was only using it to show off a visualization method, why would I care about the correctness of the data?
</p>
<table align="center" border="0" width="300">
	<tbody>
		<tr>
			<td><a href="/media/LVA/BestOf/Titanic_Brandejsky_Buturovic_Kilzer.jpg" target="_blank"><img src="/media/LVA/BestOf/Titanic_Brandejsky_Buturovic_Kilzer.serendipityThumb.jpg" height="137" width="200" /></a></td>
		</tr>
		<tr>
			<td><i>Titanic visualization using an information graphics style. Easy to read, but every figure represents a number of people. Also perhaps a bit too playful?</i> </td>
		</tr>
	</tbody>
</table>
<p>
Of course, telling a good story about a data set requires understanding and interest. The impact of <a href="http://video.google.com/videoplay?docid=2670820702819322251">Hans Rosling's TED talk</a> (which I keep linking because it has made a lot of visualization people rethink why they do what they do) comes as much from his obvious interest in the data (and its relevance) as from his presentation style and gapminder graphics. Would we care that much about a presentation that just used the data to show off the gapminder visualization?
</p>
<p>
Interesting in this regard is also Swivel's <a href="http://www.swivel.com/official">official source program</a>, which allows organizations (like UNESCO, WHO, OECD, etc.) to publish their own data directly on Swivel, providing trustworthy data for people to scrutinize and compare with other data. Swivel's focus is clearly not visualization (and I like to <a href="/VisCrit/Swivel-vs-Many-Eyes.html">chastise them for that</a>), but the data. Is this a good approach? Not if you're only looking for the best visualization techniques, but in terms of data availability and analysis, it's an absolute necessity to build credibility and a stock of relevant and interesting data sets. 
</p>
<table align="center" border="0" width="300">
	<tbody>
		<tr>
			<td><a href="/media/LVA/BestOf/Titanic_Dabrowski_Jakl_May.png" target="_blank"><img src="/media/LVA/BestOf/Titanic_Dabrowski_Jakl_May.serendipityThumb.png" height="200" width="199" /></a></td>
		</tr>
		<tr>
			<td><i>Titanic visualization using an extended pie chart. Each dot represents a person; color, size, and placement describe properties sex, age, and class/survival, respectively.</i> </td>
		</tr>
	</tbody>
</table>
<p>
&nbsp;
</p>
<p>
Ultimately, who cares about a visualization or a particular statistical method to figure something out? The only people who even know about <a href="http://en.wikipedia.org/wiki/John_Snow_(physician)#Cholera">John Snow's use of graphics</a> are visualization people, but I doubt that the people cared whose lives he saved. What counts are the outcomes, not the tools that get you there.  
</p>
Does that mean visualization is irrelevant? <a href="/blog/visualization-sets-information-free.html">Of course not</a>. But we need to understand the context we are working in and we must become better at telling stories about the data. The only way to do that is to care about the data we are analyzing at least as much as about the new visualization technique we developed just for it.<br />
    ]]></content>
  </entry>
  <entry>
    <title>Rethinking the User</title>
    <link rel="alternate" type="text/html" href="http://eagereyes.org/blog/rethinking-the-user.html" />
    <id>http://eagereyes.org/blog/rethinking-the-user.html</id>
    <published>2007-05-31T14:30:47-04:00</published>
    <updated>2007-05-31T14:30:47-04:00</updated>
    <author>
      <name>Robert Kosara</name>
    </author>
    <category term="blog" />
    <category term="Insights" />
    <summary type="html"><![CDATA[<p>A discussion at a <a href="http://kathrin.dagstuhl.de/07221/">seminar in Dagstuhl</a>  (Germany) on Information Visualization has led to an interesting insight: what if we completely misunderstood who the users are for visualization? Especially in light of the current developments for <a href="/blog/visualization-sets-information-free.html">broadly usable visualization</a>, we need to rethink the types and levels of expertise that we can expect.    ]]></summary>
    <content type="html"><![CDATA[<p>A discussion at a <a href="http://kathrin.dagstuhl.de/07221/">seminar in Dagstuhl</a>  (Germany) on Information Visualization has led to an interesting insight: what if we completely misunderstood who the users are for visualization? Especially in light of the current developments for <a href="/blog/visualization-sets-information-free.html">broadly usable visualization</a>, we need to rethink the types and levels of expertise that we can expect.<!--break--></p><p>Two presentations in a session about Visual Analytics (by Stephan Diehl and Daniel Keim) presented work that had been based on visualization, but was not in itself visual. Diehl talked about a system for software development that was informed by insights from an exploratory visualization of version control data, and Keim gave a great overview over the problems when trying to &quot;sell&quot; visualization to potential users and funding agencies.</p><p>The current view of course is that visualization systems should be designed in a way that is useful to domain experts with little to no knowledge of visualization. In reality, of course these are not the actual users of visualizations: rather, the tools are run by their developers, in communication with the domain experts. Also, the idea of broadly available visualization tools á la <a href="/VisCrit/Swivel-vs-Many-Eyes.html">Swivel and Many-Eyes</a>  presents us with a completely new type of user: the casual (but interested) non- or semi-expert.</p><p>So let&#39;s face it: we&#39;re deluding ourselves with our current user model. A much more realistic taxonomy of users (IMHO) is the following:</p><ul><li><strong>Visualization Experts</strong>. We develop the tools, we use them. Simple as that. We use external data, and we communicate with domain experts, but we do not hand the tools over. We know how to read our displays, and so we can make things that are far more advanced and complex than we would expect somebody without experience in visualization to understand.</li><li><strong>Casual Users.</strong> The people who actually use visualizations they did not develop themselves are casual users, who are curious about something, or who just like playing with something visual. These users need general (i.e., not data-specific) tools that will be much simpler, and that will need to follow known interaction paradigms as much as possible. </li></ul>Understanding this will make our visualizations much more useful in practice, and we will have more time doing productive work instead of chasing after users that simply don&#39;t exist.    ]]></content>
  </entry>
</feed>
