Skip to main content

Universally Consistent K-Sample Tests via Dependence Measures

Sambit Panda*, Cencheng Shen*, Ronan Perry, Jelle Zorn, Antoine Lutz, Carey E. Priebe, Joshua T. Vogelstein
Statistics & Probability Letters,

*Equal Contribution

Abstract

The K-sample testing problem involves determining whether K groups of data points are each drawn from the same distribution. Analysis of variance is arguably the most classical method to test mean differences, along with several recent methods to test distributional differences. In this paper, we demonstrate the existence of a transformation that allows K-sample testing to be carried out using any dependence measure. Consequently, universally consistent K-sample testing can be achieved using a universally consistent dependence measure, such as distance correlation and the Hilbert-Schmidt independence criterion. This enables a wide range of dependence measures to be easily applied to K-sample testing.


← Previous
hyppo: A Multivariate Hypothesis Testing Python Package

Next →
When no answer is better than a wrong answer: a causal perspective on batch effects