Name

Formula

Variants

Nr instances

n

p n , log ( n ) , log ( n p )

Nr features

p

log ( p ) , % categorical

Sample mean

μ

Sample median

X ˜

Sample var

σ 2

Sample min

max X

Sample max

min X

Sample std

σ

Percentile

P i

q1, q25, q75, q99

Interquartile Range (IQR)

q 75 q 25

Normalized mean

μ max X

Normalized median

X X ˜ max X

Sample range

max X min X

Sample Gini

Median absolute deviation

m e d i a n ( X X ˜ )

Average absolute deviation

a v g ( X X ˜ )

Quantile Coefficient Dispersion

( q 75 q 25 ) ( q 75 + q 25 )

Coefficient of variance

Outlier outside 1% or 99%

%samples outlier 1% or 99%

Outlier 3 STD

%samples outside 3 σ

Normal test

k-th moments

5th to 10th moments

Skewness

Feature skewness

max, min, μ, σ, skewness, kurtosis

Kurtosis

μ 4 σ 4

max, min, μ, σ, skewness, kurtosis

Correlation

ρ

max, min, μ, σ, skewness, kurtosis

Covariance

C o v

max, min, μ, σ, skewness, kurtosis

Sparsity

# U n i q u e v a l u e s n

max, min, μ, σ, skewness, kurtosis

ANOVA p-value

p A N O V A n

max, min, μ, σ, skewness, kurtosis

Coeff of variation

σ x μ x

Norm. entropy

H ( X ) log 2 n

max, min, μ, σ