<< normality CHE3007S partial_dependance >>

CHE3007S >> CHE3007S > pairplot

pairplot

Function to plot pairwise relatioships within a dataset. A grid will be created with each variable plotted against the other. The diagonals will dispay the frequency distributions of the data, which can be plotted as a histogram or with kernel density estimation of the distribution

Syntax

pairplot(header, data, column, png_name)
pairplot(header, data, column, png_name, sheader, sdata)
pairplot(header, data, column, png_name, sheader, sdata, hue)
pairplot(header, data, column, png_name, sheader, sdata, hue, hue_bin)
pairplot(header, data, column, png_name, sheader, sdata, hue, hue_bin, n_points)

Parameters

header:

m x 1 vector of headers for numeric data

data:

n x m matrix of numeric data, with matrix of column vectors

column:

vector of columns indices or header labels which will be plotted

png_name:

name for the png file to be exported

sheader:

p x 1 matrix of corresponding headers for string/text data

sdata:

n x p matrix of string/text data with data as column vectors

hue:

label of field on which to split the data into different groups and colours for plotting

hue_bin:

if hue column is corresponding to a numeric field, this is the the number of bins to split the data into linearly over range, the default for this is 5.

n_points:

number of points for the kernel density estimation of the frequency distributions to be estimated over, more points result in a smoother curve. Default is 1000.


Report an issue
<< normality CHE3007S partial_dependance >>