Skip to content

Parallel Sets: Visual Analysis of Categorical Data

The discrete nature of categorical data makes it a particular challenge for visualization. Methods that work very well for continuous data are often hardly usable with categorical dimensions. Only few methods deal properly with such data, mostly because of the discrete nature of categorical data, which does not translate well into the continuous domains of space and color. Parallel Sets is a new visualization method that adopts the layout of parallel coordinates, but substitutes the individual data points by a frequency-based representation. This abstracted view, combined with a set of carefully designed interactions, supports visual data analysis of large and complex data sets. The technique allows efficient work with meta data, which is particularly important when dealing with categorical datasets. By creating new dimensions from existing ones, for example, the user can filter the data according to his or her current needs. We also present the results from an interactive analysis of CRM data using Parallel Sets. We demonstrate how the flexible lay- out eases the process of knowledge crystallization, especially when combined with a sophisticated interaction scheme.

Fabian Bendix, Robert Kosara, Helwig Hauser, Parallel Sets: Visual Analysis of Categorical Data, IEEE Symposium on Information Visualization (InfoVis), pp. 133–140, 2005. DOI: 10.1109/INFOVIS.2005.27
bibtex
@inproceedings{Bendix:InfoVis:2005,
	year = 2005,
	title = {Parallel Sets: Visual Analysis of Categorical Data}, 
	author = {Fabian Bendix and Robert Kosara and Helwig Hauser}, 
	booktitle = {IEEE Symposium on Information Visualization (InfoVis)}, 
	pages = {133–140}, 
	doi = {10.1109/INFOVIS.2005.27}, 
	abstract = {The discrete nature of categorical data makes it a particular challenge for visualization. Methods that work very well for continuous data are often hardly usable with categorical dimensions. Only few methods deal properly with such data, mostly because of the discrete nature of categorical data, which does not translate well into the continuous domains of space and color. Parallel Sets is a new visualization method that adopts the layout of parallel coordinates, but substitutes the individual data points by a frequency-based representation. This abstracted view, combined with a set of carefully designed interactions, supports visual data analysis of large and complex data sets. The technique allows efficient work with meta data, which is particularly important when dealing with categorical datasets. By creating new dimensions from existing ones, for example, the user can filter the data according to his or her current needs. We also present the results from an interactive analysis of CRM data using Parallel Sets. We demonstrate how the flexible lay- out eases the process of knowledge crystallization, especially when combined with a sophisticated interaction scheme.}, 
}