This is joint work with leading geoscientist Dr. Susan Brantley in Pennstate, her postdoctoral scholar Tao Wen and Allison Herman. I did a thorough simulation study to compare different two-sample parametric tests (including Welch’s t-test and its permutation and Bootstrap versions) and non-parametric tests (including Wilcoxon and Brunner-Munzel) based on empirical distributions over the multiple datasets and evaluated power/size under different cases of unequal variances and imbalanced sample sizes.
The classical parametric form of the Behrens-Fisher problem considers the hypothesis of equal means in the presence of potentially different variances. This problem of heteroscedasticity (unequal variances) is usually encountered in Geoscientific data with concentrations of various analytes following differently scaled distributions.
This problem has been further extended to the non-parametric case when the distributions of the samples are unknown. In the non-parametric tests like WMW, if the samples have unequal variances, the rejection of the null hypothesis only means the distributions are not equal stochastically. However, this still doesn’t imply whether one distribution is greater than the other regarding some location parameter, e.g., median (Chen, 2000). Moreover, in such cases, the WMW test does not maintain its level (Pratt, 1964).
There have been several papers in the literature that give solutions to non-parametric BF problem under different conditions. For example, a partial solution to the non-parametric Behrens-Fisher problem was provided by (Babu, 2002) for testing the equality of the medians of two continuous distributions having the same shape, but possibly unequal variances. (Brunner, 2000) proposed a generalized version of WMW rank test to solve the non-parametric BF problem and gave extensive simulations to show WMW test may be conservative or liberal depending on the ratio of the sample sizes and the variances of the underlying distribution functions.
We consider comparisons between t-test, WMW test BM test (also known as Generalized Wilcoxon test) and presented our findings for NW PA and Bradford datasets. This paper will appear soon in Environmental Science: Processes & Impacts.