Derivation of the Exact Moments of the Distribution of Pearsons Correlation over Permutations of Data

02/20/2020

∙

Pearson's correlation is one of the most widely used measures of association today, the importance of which to modern science cannot be understated. Two of the most common methods for computing the p-value for a hypothesis test of this correlation method are a t-statistic and permutation sampling. When a dataset comes from a bivariate normal distribution under specific data transformations a t-statistic is exact. However, for datasets which do not follow this stipulation, both approaches are merely estimations of the distribution of over permutations of data. In this paper we explicitly show the dependency of the permutation distribution of Pearson's correlation on the central moments of the data and derive an inductive formula which allows the computation of these exact moments. This has direct implications for computing the p-value for general datasets which could lead to more computationally accurate methods.

READ FULL TEXT

Derivation of the Exact Moments of the Distribution of Pearsons Correlation over Permutations of Data

Moments of the Distribution of Pearsons Correlation over Permutations of Data

T-statistic for Autoregressive process

The distribution of Yule's 'nonsense correlation'

Exact first moments of the RV coefficient by invariant orthogonal integration

Moments of permutation statistics and central limit theorems

Method of Moments Histograms

Yule's "nonsense correlation" for Gaussian random walks

Derivation of the Exact Moments of the Distribution of Pearsons Correlation over Permutations of Data

Related Research

Moments of the Distribution of Pearsons Correlation over Permutations of Data

T-statistic for Autoregressive process

The distribution of Yule's 'nonsense correlation'

Exact first moments of the RV coefficient by invariant orthogonal integration

Moments of permutation statistics and central limit theorems

Method of Moments Histograms

Yule's "nonsense correlation" for Gaussian random walks