The Wilcoxon Two-sample Statistic on Strongly Mixing processes

RJ Serfling

The Wilcoxon Two-sample Statistic on Strongly Mixing processes

Serfling, RJ. (1968). The Wilcoxon Two-sample Statistic on Strongly Mixing processes. Annals of Mathematical Statistics, 39(4), 1202-1209.

Copy citation

Abstract

On the basis of independent samples $\{X_1, \cdots, X_m\}$ and $\{Y_1, \cdots, Y_n\}$ with distributions $F$ and $G$, respectively, the hypothesis that $F \equiv G$ may be tested. Given the functional forms $F(x_1, \cdots, x_m)$ and $G(y_1, \cdots, y_n)$ of the sampling distributions except for values of certain parameters, the likelihood ratio approach, for example, can be used. In this case it is not crucial to assume that the samples are random, i.e., that $F(x_1, \cdots, x_m) = F(x_1) \cdots F(x_m)$ and $G(y_1, \cdots, y_n) = G(y_1) \cdots G(y_n)$, although such a simplification is useful whenever realistic. However, the nonparametric treatment of the problem has relied heavily on the assumption of random samples. Yet if the samples arise as realizations of two stochastic processes, the assumption of randomness is not realistic except in the case of renewal processes. Thus it is desirable to extend the scope of established nonparametric procedures to more general applications. The present paper deals with the Wilcoxon two-sample statistic. Among the desirable features of this statistic, when defined on independent random samples, is its asymptotically normal distribution, which for large samples facilitates a test of the hypothesis that $F \equiv G$ and a calculation of the power for any alternative $(F, G)$. It shall be seen that these aspects are true also when the samples arise from stochastic processes belonging to a wide class, including strictly stationary strongly mixing processes. Assume that the samples $\{X_1, \cdots, X_m\}$ and $\{Y_1, \cdots, Y_n\}$ are independent of each other, but let the random variables within a sample be possibly dependent. Assume that the functions $F(\cdot)$ and $G(\cdot)$ are continuous. The hypothesis $H: F \equiv G$ may be tested (conservatively) by testing the hypothesis $H_0: \gamma = 0$, where $\gamma = 2P\{Y > X\} - 1$. A representation of the Wilcoxon two-sample statistic is the $U$-statistic with sign function as kernel, \begin{equation*}\tag{1.1}U = (mn)^{-1}\sum^m_{i=1} \sum^n_{j=1} s(Y_j - X_i),\end{equation*} where $s(u) = -1, 0, 1$ according as $u < 0, = 0, > 0$. Since $Es(Y - X) = \gamma$, the statistic $U$ affords a natural basis for testing $H_0$. Under appropriate conditions, the statistic $Z = m^{\frac{1}{2}}(U - \gamma)$ has a limiting normal distribution with mean 0 and variance \begin{equation*}\tag{1.2}A^2 = 4 \lim_{k\rightarrow\infty} k^{-1} \operatorname{Var}\lbrack\sum^k_{i=1} G(X_i)\rbrack + 4c \lim_{k\rightarrow\infty}k^{-1} \operatorname{Var}\lbrack\sum^k_{i=1} F(Y_i)\rbrack,\end{equation*} as $m$ and $n \rightarrow \infty$ such that $m/n$ has a limit $c \neq 0$. The main conclusions of this nature are given in Theorems 3.1 and 3.2. Some areas of application are indicated in Section 4. The business of dealing with the quantity $A^2$ is discussed in Section 5. The limiting behavior of $Z$ is obtained by consideration of a statistic asymptotically equivalent in distribution but more amenable to the direct application of central limit theory, an approach put forth by Hoeffding [3] in dealing with a wide class of $U$-statistics as defined on a single sample of mutually independent rv's. The present contribution adapts the method to a single, but important, (two-sample) $U$-statistic with dependence allowed within samples. Define: \begin{equation*}\tag{1.3}W = m^{-\frac{1}{2}} \sum^m_{i=1} \lbrack f_{10}(X_i) - \gamma\rbrack + m^{\frac{1}{2}}n^{-1} \sum^n_{j=1} \lbrack f_{01} (Y_j) - \gamma\rbrack,\end{equation*} where $f_{10}(t) = Es(Y - t) = 1 - 2G(t)$ and $f_{01}(t) = Es(t - X) = 2F(t) - 1$. Since $Ef_{10}(X) = Ef_{01}(Y) = \gamma$, we have $EW = E(Z - W) = 0$. In Section 2 we find conditions such that $E(Z - W)^2 \rightarrow 0$, in which case it follows by Chebyshev's inequality that $(Z - W) \rightarrow 0$ in probability and hence that the statistics $Z$ and $W$ have the same limiting distribution (if any). The application of central limit theory to $W$ is through the sums $\sum^m_1 f_{10}(X_i)$ and $\sum^n_1 f_{01}(Y_j)$, or equivalently through $m^{-\frac{1}{2}} \sum^m_1 G(X_i)$ and $n^{-\frac{1}{2}}\sum^n_1 F(Y_j)$. If each of these independent normed sums has a limiting normal distribution, then $W$ is asymptotically normal, as $m$ and $n \rightarrow \infty$ such that $m/n \rightarrow c \neq 0$. Relevant central limit theorems for sums of dependent variables are utilized in Section 3

Publications Info

To contact an RTI author, request a report, or for additional information about publications by our experts, send us your request.

publications@rti.org

RTI shares its evidence-based research - through peer-reviewed publications and media - to ensure that it is accessible for others to build on, in line with our mission and scientific standards.

Recent Publications

Article

Patient-reported outcome improvements following scalp hair regrowth among patients with Alopecia Areata: analysis of the ALLEGRO-2b/3 trial

December 2025

Article

Plain language summary of mortality rates of patients with Parkinson’s disease psychosis who were treated either with pimavanserin or with different second-generation (atypical) antipsychotics

December 2025

Article

Biological parenthood rates among men with sickle cell disease

December 2025

Article

Patterns of felt stigma among rural-dwelling people who use drugs: A latent class analysis

December 2025

Article

One voice and vision: How the RISE network built a collective identity as the foundation for strategic dissemination

December 2025

Article

Estimating community-level prevalence of opioid use disorder: Extrapolating from Medicaid claims data and other publicly available data sources in Ohio, USA

December 2025

Article

Experiences of parents who receive a false-positive CK-MM screening for their newborn

December 2025

Article

Evaluating the efficacy and safety of milrinone for prevention of post-patent ductus arteriosus closure syndrome (the MIDAS trial) in extremely preterm infants: A multicentre, double-masked, randomised, placebo-controlled trial

December 2025

View All Publications