Advanced Plotting
Getting Ready
dat2=pd.read_csv("C:\\Users\duan\Desktop\PythonDataProcessingVisualization\meanByClass.txt", sep='\s+')dat2
Explore a fake gene expression data modified from iris.csv

Advanced Line Plot

Get rid of the legend

Separate the features

Plotting on a Secondary Y-axis

Plot a subset of columns

Selective Plotting on Secondary Y-axis

Targeting different subplots by passing an ax argument

Adjusting spacing between subplots

Advanced Bar Plots
Looking at one replicate a time

Looking at all replicates at the same time


stacked boxes

Advanced Histogram

Scatter Plot


Parallel Coordinates
Parallel coordinates is a plotting technique for plotting multivariate data. It allows one to see clusters in data and to estimate other statistics visually. Using parallel coordinates points are represented as connected line segments. Each vertical line represents one attribute. One set of connected line segments represents one data point. Points that tend to cluster will appear closer together




Andrews Curves
Andrews Curves are smoothed versions of Parallel Coordinates

A potential issue when plotting a large number of columns is that it can be difficult to distinguish some series due to repetition in the default colors. To remedy this, we can either loop through different colors using rainbow() function. Or DataFrame plotting supports the use of the colormap= argument, which accepts either a Matplotlib colormap or a string that is a name of a colormap registered with Matplotlib



RadViz


Last updated
Was this helpful?
