2
2 Introduction: Visualization vs Communication Data Visualization Algorithm OutputInput Communication Channel Visualization The effectiveness of a visualization algorithm depends on the amount of information that can be transmitted

3
3 Introduction: Information-theoretic Visualization Framework Can more information be shown? Visualization Algorithm Yes Visualization Data Visualization should be information-dependent e.g. for flow visualization, the flow near the salient flow features shouldn’t be missed Information Information in Visualization No Information in Data Stop

4
4 Outline 1.Realization of this visualization framework for static flow a.Measurement of information in data (vector field) b.Measurement for information in visualization (streamlines) c.Information comparison between data & visualization 2.An information-aware streamline placement algorithm Place streamlines to highlight salient flow features Automatically stop when no more information can be shown

6
6 Information in Vector Fields Concept Treat the vector field as a data source that generates vector orientation as outcome The more diverse the vector orientations, the more information is contained in the vector field Measurement Estimate the distribution of the vector orientation Compute the entropy of this distribution as the measurement Vector field Polar Histogram

7
7 Information in Vector Fields (contd’)

8
8 Entropy Field and Seeding Measure the entropy around each point’s neighborhood Vector Field Entropy field: higher value means more information in the corresponding region Entropy-based seeding: Places streamlines on the region with high entropy

9
9 Information in Streamlines Given information: estimate the vector along the streamlines by their trajectory Reconstructed information: synthesize the vectors of the empty regions What kind of information is represented by the streamlines? A synthesized vector field

10
10 The Information Comparison between Data/Visualization Conditional entropy H(X|Y): The information in X not represented by Y An effective visualization should represent most information in the data, i.e. H(X|Y) should be small Vector Field X H(X)H(X)H(Y)H(Y) Streamlines Y H(X|Y)H(X|Y)

11
11 H(X)H(X) Conditional Entropy and Joint Entropy H(X|Y)H(X|Y) H(Y) H(X, Y) H(Y)H(Y) – = Vector field from the streamlines H(Y): entropy of the distribution of only vectors Y on all points Input vector field H(X, Y): entropy of the joint distribution of both vectors X and Y on all points H(X, Y): Joint Entropy of both X and Y H(Y): Entropy of Y streamlines

12
12 Conditional Entropy Field and Seeding Measure the under-represented information in each region Streamlines Conditional entropy field Conditional-entropy-based seeding: Place more seeds on regions with higher under-represented information

13
13 Removal of Redundant Streamlines Remove the streamlines that cannot reduce the conditional entropy Computing conditional entropy per streamline is slow Approximation Discard a streamline if all its points are too close to existing streamlines Use a smaller distance threshold for regions with higher entropy After the removal of redundant streamlines Initially placed seeds

21
21 Acknowledgement The authors would like to thank Torsten Möller, Carrie Stein, and the anonymous reviewers for their comments This work was supported in part by NSF ITR Grant ACI NSF RI Grant CNS NSF Career Award CCF DOE SciDAC grant DE-FC02-06ER25779 Questions?