Abstract
In 1933, A.N. Kolmogorov (1933a) published a short but landmark paper in the Italian Giornale dell’Istituto Italiano degli Attuari. He formally defined the empirical distribution function (EDF) and then enquired how close this would be to the true distribution F(x) when this is continuous. This leads naturally to the definition of what has come to be known as the Kolmogorov statistic (or sometimes the Kolmogorov- Smirnov statistic) D, and Kolmogorov not only then demonstrates that the difference between the EDF and F(x) can be made as small as we please as the sample size n becomes larger, but also gives a method for calculating the distribution of D at specified points, for finite n, and uses this to give the asymptotic distribution of D. The ideas in this paper have formed a platform for a vast literature, both of interesting and important probability problems, and also concerning methods of using the Kolmogorov statistic (and other statistics) for testing fit to a distribution. This literature continues with great strength today, after over 50 years, showing no signs of diminishing. It is evident that the ideas set in motion by Kolmogorov are of paramount importance in statistical analysis, and variations on the probabilistic problems, including modern methods of treating them, continue to hold attention.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Anderson. T.W.. and Darling, D.A. (1952). Asymptotic theory of certain goodness of fit criteria based on stochastic processes. Ann. Math. Statist., 23. 193–212
Birnbaum. Z.W. and Tingey. F.H. (1951). One-sided confidence contours for distribution functions. Ann. Math. Statist., 22. 592–596.
Birnbaum, Z.W. (1952) Numerical tabulation of the distribution of Kolmogorov’s statistics for finite sample size, J. Amer. Statist. Assoc., 47, 425–441
Cramér. H. (1928). On the composition of elementary errors. Second paper Statistical Applications. Skand. Aktuarietidskrift, 11, 171–180.
Chang. L.C. (1955). On the ratio of an empirical distribution function to the theoreti¬cal distribution function. Acta Math. Sinica, 5. 347–368 [also Selected Translations in Math. Statist. Proh.. 4, (1963), 17-38].
Cheng. P. (1958). Non-negative jump points of an empirical distribution function relative to a theoretical distribution function. Acta Math. Sinica. 8. 333–347 [also Selected Translations in Math. Statist. Proh„ 3. (1962). 205–224],
Darling. DA. (1955). The Cramer-Smirnov test in the parametric case. Ann. Math. Statist.. 26. 1–20.
David. F.N, and Johnson N.L. (1948). The probability integral transformation when parameters arc estimated from the sample. Biometrika, 35. 182–192.
Donsker. M.D. (1952). Justification and extension of Doob’s heuristic approach to the Kolmogorov-Smirnov theorems. Ann. Math Statist. 23, 277–281.
Doob. J.L. (1949). Heuristic approach to the Kolmogorov Smirnov theorems, Ann. Math. Statist.. 20. 393–403.
Drion. E.F. (1952). Some distribution-free tests for the difference between two empirical cumulative distribution functions, Ann. Math. Statist., 23, 563–574.
Durbin, J. (1973). Distribution Theory for Tests Based on the Sample Distribution Function. CBMS-NSF Regional Conference Series in Applied Mathematics. Society for Industrial and Applied Mathematics, Philadelphia, Pa.
Durbin, J. (1975). Kolmogorov-Smirnov tests when parameters arc estimated with applications to tests of exponentiality and tests on spacings. Biometrika. 62. (1). 5–22.
Feller. W. (1948). On the Kolmogorov-Smirnov limit theorems for empirical distributions, Ann. Math. Statist.. 19. 177–189.
Gibbons. J.D. (1983). Kolmogorov-Smirnov symmetry test, in Encyclopedia of Statistical Sciences (S. Kotz. N.L. Johnson, and C.B. Read, cds.) vol. 4. Wiley, New York. pp. 396–398.
Gnedenko. B.V. (1952), Some results on the maximum discrepancy between two empirical distributions. Dokl. Akad. Sauk SSSR, 82, 661–663 [also Selected Translations in Math. Statist. Prob.. 1, (1961), 73–76],
Gnedenko. B.V., and Korolyuk, V.S. (1951): On the maximum discrepancy between two empirical distributions. Dokl. Akad. Nauk SSSR. 80. 525–528 [also Selected Translations in Math. Statist. Prob., 1 (1961). 13–22].
Gnedenko, B.V. and Mihalevic, V.S. (1952a). On the distribution of the number of excesses of one empirical distribution function over another. Dokl. Akad. Nauk SSSR. 82, 841–843 [also Selected Translations in Math. Statist. Prob., 1 (1961). 83–85].
Gnedenko, B. V.. and Mihalevic, V.S. (1952b). Two theorems on behaviour of empirical distribution functions. Dokl. Akad. Nauk SSSR. 85. 25–27 [also Selected Translations in Math. Statist. Prob., 1 (1961). 55–58].
Gnedenko. B.V., and Rvaceva, E.L. (1952). On a problem of the comparison of two empirical distributions. Dokl. Akad. Nauk SSSR. 82. 513–516 [also Selected Translations in Math. Statist. Prob., 1 (1961), 69–72].
Guilbaud, O. (1988). Exact Kolmogorov-type tests for left truncated and,’or right- censored data, J. Amer. Statist. Assoc.. 83, 213–221.
Hall, W.J, and Wellner. JA. (1980). Confidence band for a survival curve from censored data. Biometrika, 67. 133–143.
Kac, M., Kiefer, J., and Wolfowitz. J. (1955). On tests of normality and other tests of goodness of fit based on distance methods. Ann. Math. Statist., 26. 189–211.
Khmaladze, E.V. (1986). Introduction to Kolmogorov (1933). In Teoriia Veroiatnostei I Maiematicheskaia Statistika. Nauka: Moscow.
Kendall, M.G., and Stuart, A. (1979). The Advanced Theory of Statistics, 4th ed. McMillan. New York.
Kolmogorov, A. (1931). Über die analytischen Methoden in der Wahrscheinlichkeitsrechnung. Math. Ann., 104. 415–458.
Kolmogorov, A. (1933a). Sulla determinazione empirica di una legge di distribuzionc, 1st. Ital. Attuari. G.. 4. 1–11.
Kolmogorov, A. (1933b) Über die Grenzwertsätze der Wahrscheinlichkeitsrechnung. Bull. (Izvestija) Acad. Sei. URSS, 363–372.
Koroljuk, V.S. (1955). On the discrepancy of empiric distributions for the case of two independent samples, Izv. Akad. Nauk SSSR Ser. Mat., 19, 91–96 [also Selected Translations in Math. Statist. Prob, 4 (1963). 105–122].
Kotz, S., Johnson. N.L, and Read C-B. (eds.) (1989). Kolmogorov. Andrei Nikoleyevteh. in Encyclopedia of Statistical Sciences, Suppl. Volume. Wiley, New York, 78–80.
Kuiper, N.H. (1960): Tests concerning random points on a circle. Proc. Koninkl. Neder. Akad. van. Wetenschappen. A. 63. 38–47.
Lockhart. R A., and Stephens, M.A. (1985a). Goodness-of-fit tests for the gamma distribution. Technical report. Department of Mathematics and Statistics. Simon Fraser University.
Lockhart. R.A.. and Stephens. M.A. (1985b). Goodness-of-fit tests for the von Mises distribution. Biometrika. 72. 647– 652.
Massey, F.J. (1950). A note on the estimation of a distribution function by confidence limits, Ann. Math. Statist., 21. 116–119.
Massey, FJ. (1951a) The Kolmogorov-Smirnov tests for goodness of fit, J. Amer. Statist. Assoc., 46. 68–78.
Massey, F.J. (1951b). The distribution of the maximum deviation between two sample cumulative step functions, Ann. Math. Statist., 22, 125–128.
Massey, K.J. (1952). Distribution table for the deviation between sample cumulatives. Ann. Math. Statist., 23, 435–441.
Miller. L.H. (1956). Table of percentage points of Kolmogorov statistics, J. Amer. Statist. Assoc., 51, 111–121.
Niederhausen, H. (1981a). Tables of significant points for the variance-weighted Kolmogorov-Smirnov statistics. Technical report. Department of Statistics. Stanford University.
Niederhausen. H. (1981b). Sheffer polynomials for computing exact Kolmogorov- Smirnov and Renyi type distributions, Ann. Statist. 5. 923–944.
Pettitt, A.N.. and Stephens, M.A. (1977): The Kolmogorov-Smirnov goodness-of-fit statistics with discrete and grouped data. Technometrics. 19. 205–210.
Pyke. R. (1959). The supremum and infimum of the Poisson process, Ann. Math. Statist.. 30. 568–576.
Renyi, A. (1953). On the theory of order statistics. Acta Math. Acad. Sci. Hungary. 4, 191–231.
Sahler, W. (1968). A survey of distribution-free statistics based on distances between distribution functions. Metrika, 13, 149–169.
Shiryaev, A.N. (1989). Kolmogorov’s life and creative activities. Ann. Prob. 17. 866–944.
Smirnov, N.V. (1936, 1937 ). Sur la distribution de w2 (criterium de M. R. von Mises) C. R. Acad. Sci. (Paris), 202, (1936), 449–452, [paper with the same title in Russian. Recueil Math.. 2 (1937), 973–993],
Smirnov, N.V. (1939a). Ob uklonenijah empiricheskoi krivoi rasprcdelenija. Recueil Math. Mat. Sbornik, N.S., 6 (48), 13–26.
Smirnov, N.V. (1939b). On the estimation of the discrepancy between empirical curves of distributions for two independent samples, Bull. Math. Univ. Moscou, 2, 2.
Smirnov, N.V. (1944). Approximate laws of distribution of random variables from empirical data, Uspehi Mat. Nauk, 10, 179–206.
Smirnov, N.V. (1947). Sur un critére de symetrie de la loi de distribution d’une variable aléatoire. Akad. Nauk SSSR. C.R. (Doklady), 56. 11–14.
Smirnov, N.V. (1948). Table for estimating the goodness of fit of empirical distributions, Ann. Math. Statist., 19, 279–281.
Smirnov, N.V. (1949). On the Cramér-von Mises criterion (in Russian), Uspehi. Mat. Nauk. 14. 196–197.
Stephens, MA. (1970). Use of the Kolmogorov-Smirnov, Cramer-von Mises and related statistics without extensive tables, J. Roy. Statist. Soc., Ser. B, 32, 115–122.
Stephens, M.A. (1974). EDF statistics for goodness-of-fit and some comparisons, J. Amer. Statist. Assoc., 69, 730–737.
Stephens. M.A. (1976). Asymptotic results for goodness-of-fit statistics with unknown parameters. Ann. Statist., 4, 357–369.
Stephens, M.A. (1983). Kolmogorov-Smirnov statistics; Kolmogorov-Smirnov tests of fit, in Encyclopedia of Statistical Sciences (S. Kotz. N.L. Johnson, and C.B. Read, eds.) vol. 4. Wiley. New York. 393–396; 398–402.
Stephens, M.A. (1986). Tests based on F.DF statistics, in Goodness-of-Fit Techniques (R.B. D’Agostino, and M.A. Stephens, eds). Marcel Dekker. New York, Chap. 4.
von Mises, R. (1931). Vorlesungen aus dem Gebiete der Angewandten Mathematik. 1, Wahrscheinlichkeitsrechnung und ihre Anwendung in der Statistik und theoretischen Physik. Springer: Wien.
Wald, A. and Wolfowitz, J. (1939). Confidence limits for continuous distribution functions. Ann. Math. Statist.. 10, 105–118.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1992 Springer-Verlag New York, Inc.
About this chapter
Cite this chapter
Stephens, M.A. (1992). Introduction to Kolmogorov (1933) On the Empirical Determination of a Distribution. In: Kotz, S., Johnson, N.L. (eds) Breakthroughs in Statistics. Springer Series in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-4380-9_9
Download citation
DOI: https://doi.org/10.1007/978-1-4612-4380-9_9
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-94039-7
Online ISBN: 978-1-4612-4380-9
eBook Packages: Springer Book Archive