Abstract
The multisample version of the Cucconi rank test for the two-sample location-scale problem is proposed. Even though little known, the Cucconi test is of interest for several reasons. The test is compared with some Lepage-type tests. It is shown that the multisample Cucconi test is slightly more powerful than the multisample Lepage test. Moreover, its test statistic can be computed analytically whereas several others cannot. A practical application example in experimental nutrition is presented. An R function to perform the multisample Cucconi test is given.
Similar content being viewed by others
References
Adamson GCD, Nash DJ (2013) Long-term variability in the date of monsoon onset over western India. Clim Dyn 40:2589–2603
Akkouchi M (2005) On the convolution of gamma distributions. Soochow J Math 31:205–211
Baumgartner W, Weiss P, Schindler H (1998) A nonparametric test for the general two-sample problem. Biometrics 54:1129–1135
Bausch J (2012) On the efficient calculation of a linear combination of chi-square random variables with an application in counting string vacua, arXiv:1208.2691v2
Boos DD, Zhang J (2000) Monte Carlo evaluation of resampling-based hypothesis tests. J Am Stat Assoc 95:486–492
Buning H, Thadewald T (2000) An adaptive two-sample location-scale test of Lepage-type for symmetric distributions. J Stat Comput Simul 65:287–310
Castano-Martinez A, Lopez-Blazquez F (2005) Distribution of a sum of weighted noncentral chi-square variables. Test 14:397–415
Cucconi O (1968) Un nuovo test non parametrico per il confronto tra due gruppi campionari. Giornale degli Economisti 27:225–248
Gerhard D, Hothorn LA (2010) Rank transformation in haseman-elston regression using scores for location-scale alternatives. Hum Hered 69:143–151
Hajek J, Sidak Z, Sen PK (1998) Theory of rank tests, 2nd edn. Academic Press, New York
Kruskal WH, Wallis WA (1952) Use of ranks in one criterion variance analysis. J Am Stat Assoc 47:583–621
Lepage Y (1971) A combination of Wilcoxon’s and Ansari-Bradley’s statistics. Biometrika 58:213–217
Lindsay BG, Pilla RS, Basak P (2000) Moment-based approximations of distributions using mixtures: theory and applications. Ann Inst Math Stat 52:215–230
Lunde A, Timmermann A (2004) Duration dependence in stock prices: an analysis of bull and bear markets. J Bus Econ Stat 22:253–273
Marozzi M (2007) Multivariate tri-aspect non-parametric testing. J Nonparametr Stat 19:269–282
Marozzi M (2009) Some notes on the location-scale Cucconi test. J Nonparametr Stat 21:629–647
Marozzi M (2012a) A combined test for differences in scale based on the interquantile range. Stat Pap 53:61–72
Marozzi M (2012b) A modified Hall–Padmanabhan test for the homogeneity of scales. Commun Stat – Theory Methods 41(16–17):3068–3078
Marozzi M (2012c) A modified Cucconi test for location and scale change alternatives. Colomb J Stat 35:369–382
Marozzi M (2013) Nonparametric simultaneous tests for location and scale testing: a comparison of several methods. Commun Stat–Simul Comput 42(6):1298–1317
Manly BFJ, Francis RICC (2002) Testing for mean and variance differences with samples from distributions that may be non-normal with unequal variances. J Stat Comput Simul 72(8):633–646
Moore DS, McCabe GP (2009) Introduction to the practice statstics, 6th edn. Freeman, New York
Muccioli C, Belford R, Podgor M, Sampaio P, de Smet M, Nussenblatt R (1993) The diagnosis of intraocular inflammation and cytomegalovirus retinitis in HIV-infected patients by laser flare photometry. Ocul Immunol Inflamm 4(2):75–81
Murakami H (2007) Lepage-type statistic based on the modified Baumgartner statistic. Comput Stat Data Anal 51:5061–5067
Murakami H (2008) A multisample rank test for location-scale parameters. Commun Stat–Simul Comput 37:1347–1355
Neuhauser M (2000) An exact two-sample test based on the Baumgartner–Weiss–Schindler statistic and a modification of Lepage’s test. Commun Stat–Theory and Methods 29:67–78
Neuhauser M (2001) An adaptive location-scale test. Biom J 43:809–819
Neuhauser M, Kotzmann J, Walier M, Poulin R (2010) The comparison of mean crowding between two groups. J Parasitol 96:477–481
Oden NL (1991) Allocation of effort in Monte Carlo simulation for power of permutation tests. J Am Stat Assoc 86:1074–1076
Pesarin F, Salmaso L (2010) Permutation tests for complex data. Wiley, Chichester
Podgor MJ, Gastwirth JL (1994) On non-parametric and generalized tests for the two-sample problem with location and scale change alternatives. Stat Med 13:747–758
Palar K, Sturm R (2009) Potential societal savings from reduced sodium consumption in the U.S. adult population. Am J Health Promot 24:49–57
Puri ML (1965) On some tests of homogeneity of variances. Ann Inst Stat Math 17:323–330
Rublik F (2005) The multisample version of the Lepage test. Kybernetika 41:713–733
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix 1
Cucconi (1968) does not organize the formal results on \(\sum _{i=1}^{n_{k}}R_{ki}^{2}\) and \(\sum _{i=1}^{n_{k}}( n+1-R_{ki})^{2}\) in lemmas and theorems but gives an outline of some derivations. Here we reorganize the results on the sums of squared ranks and squared antiranks in a clearer manner by reporting the whole proof of all results. Well known results about sum of powers of the first \(n\) natural numbers will be used in this section.
Theorem 1
Under the null hypothesis
Proof
Consider the population of the first \(n\) squared natural numbers \( 1,2^{2},...,n^{2}\). \(\frac{1}{n_k} \sum _{i=1}^{n_{k}}R_{ki}^{2}\) may be seen as the random variable defined by the mean of a random sample of \(n_{k}\) values drawn without replacement from this population. Since the mean of the sample means is equal to the population mean it follows that
\(\forall k\), and the thesis follows immediately. \(\square \)
Theorem 2
Under the null hypothesis
Proof
With simple algebra we first compute the variance \(\tau ^2\) of the population of the first \(n\) squared natural numbers
Now, since the sampling is without replacement
\(\forall k\), and the thesis follows immediately. \(\square \)
The following lemmas will be used for proving Theorem 3.
Lemma 1
Under the null hypothesis
Proof
Straightforward (we deliberately not factor the result). \(\square \)
Lemma 2
Under the null hypothesis
Proof
It is \(E\left( \,\sum _{i=1}^{n_{k}}R_{ki}^{3}\right) =\frac{n_k}{n}\sum _{j=1}^{n}j^{3}=n_kn\left( n+1\right) ^{2}/4\) and it is
Finally it follows with simple algebra that
\(\forall k\). \(\square \)
Theorem 3
Under the null hypothesis
Proof
It is
It remains to compute
Using Lemma 1 it follows that
Using Theorem 1, Theorem 2 and Lemma 2 it follows with simple algebra that
and finally that
\(\forall k\).\(\square \)
Appendix 2
An R function for performing the multisample Cucconi test follows.
To analyze the data considered in Sect. 6 run the following code.
Rights and permissions
About this article
Cite this article
Marozzi, M. The multisample Cucconi test. Stat Methods Appl 23, 209–227 (2014). https://doi.org/10.1007/s10260-014-0255-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10260-014-0255-x