Interactive features visualization in KNIME

Type
Author
Thomas Laurent 0000-0001-7686-3249
License/Openness
Description

KNIME workflow to visualize a dataset described by multiple quantitative features (ex: a list of samples or cells, each described with multiple morphological features) as a 3D cloud of points (each point corresponding to one sample/cell) as well as a line plot (1 line per sample/cell).

For the 3D plot, the workflow uses Principal Component Analysis (PCA) for dimensionality reduction, ie it simplifies the information for each sample from n-features to 3 pseudo-features which are used as x,y,z-coordinates for each sample.

The workflow is interactive and so selecting in one panel of the figure will highlight in the other panel too.

It was originally published for the visualization of phenotypic kidney features in zebrafish, but the workflow is generic by design and can be reused for any quantitative feature set. 

Just make sure that the PCA represents a decent % of the original data variance (at least 70%). Otherwise the PCA plot will not be representative of the original data-distribution.

Additional keywords