Two-sample Kolmogorov-Smirnov test using a Bayesian nonparametric approach


Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

In this paper, a Bayesian nonparametric approach to the two-sample problem is proposed. Given two samples \(\text{X} = {X_1}, \ldots ,{X_{m1}}\;\mathop {\text~}\limits^{i.i.d.} F\) and \(Y = {Y_1}, \ldots ,{Y_{{m_2}}}\mathop {\text~}\limits^{i.i.d.} G\), with F and G being unknown continuous cumulative distribution functions, we wish to test the null hypothesis H0: F = G. The method is based on computing the Kolmogorov distance between two posterior Dirichlet processes and comparing the results with a reference distance. The parameters of the Dirichlet processes are selected so that any discrepancy between the posterior distance and the reference distance is related to the difference between the two samples. Relevant theoretical properties of the procedure are also developed. Through simulated examples, the approach is compared to the frequentist Kolmogorov–Smirnov test and a Bayesian nonparametric test in which it demonstrates excellent performance.

About the authors

L. Al-Labadi

Dept. Math. & Comput. Sci.

Author for correspondence.
Email: luai.allabadi@utoronto.ca
Canada, Mississauga

M. Zarepour

Dept. Math. and Statist.

Email: luai.allabadi@utoronto.ca
Canada, Ottawa

Supplementary files

Supplementary Files
Action
1. JATS XML

Copyright (c) 2017 Allerton Press, Inc.