Data is being set free: the United Nations have started a new website called UN Data to share the data collected by a number of UN agencies. 55 million data records are waiting to be explored and visualized. The search interface is very nice and usable, but still lacks power.
In contrast to many other interfaces (most notably the horrible mess at the world bank), their querying is quite good and implemented well. Views update without the need for a page reload, and the interface elements react to what is being displayed. The data can be sorted, filtered, and even pivot tables can be created. It is possible to download the data in four formats: a rather pointless and bloated XML (which just doesn’t make sense for tables) and tabular with three different separators (comma, semicolon, or pipe).
I immediately thought, “this would be perfect in combination with Trendalyzer!” – and sure enough, this was not done by the UN alone, but together with gapminder. I’m sure they will be incorporating this as a data source soon, which should be interesting.
What is missing? While filtering is nice, combining all the different data is where the true power would come in. It is not possible to add data dimensions to a table from a different source, which would make a lot of sense to do since most data sets share at least the country name and year. Comparing development to income, health to education, etc. is where the true value of such data lies. The folks at Swivel realized that more than a year ago and made it easy to combine data sets in almost any way imaginable. That is really needed here as well.
The other thing that is sorely lacking is programmatic access. There does not seem to be an API to enumerate the data sources and get to the actual data, which would be kind of an obvious thing to do. Web 2.0 has brought us APIs for photographs, maps, and bloggers’ feelings, but not for the really relevant data like census or world health and development data.
It’s a great start, and good to see this happen. Hopefully, more data sources and better data access will follow. Data indeed wants to be free.