Introduction to Kolmogorov (1933) On the Empirical Determination of a Distribution

Stephens, M. A.

doi:10.1007/978-1-4612-4380-9_9

M. A. Stephens³

Part of the book series: Springer Series in Statistics ((PSS))

8549 Accesses
25 Citations
1 Altmetric

Abstract

In 1933, A.N. Kolmogorov (1933a) published a short but landmark paper in the Italian Giornale dell’Istituto Italiano degli Attuari. He formally defined the empirical distribution function (EDF) and then enquired how close this would be to the true distribution F(x) when this is continuous. This leads naturally to the definition of what has come to be known as the Kolmogorov statistic (or sometimes the Kolmogorov- Smirnov statistic) D, and Kolmogorov not only then demonstrates that the difference between the EDF and F(x) can be made as small as we please as the sample size n becomes larger, but also gives a method for calculating the distribution of D at specified points, for finite n, and uses this to give the asymptotic distribution of D. The ideas in this paper have formed a platform for a vast literature, both of interesting and important probability problems, and also concerning methods of using the Kolmogorov statistic (and other statistics) for testing fit to a distribution. This literature continues with great strength today, after over 50 years, showing no signs of diminishing. It is evident that the ideas set in motion by Kolmogorov are of paramount importance in statistical analysis, and variations on the probabilistic problems, including modern methods of treating them, continue to hold attention.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Anderson. T.W.. and Darling, D.A. (1952). Asymptotic theory of certain goodness of fit criteria based on stochastic processes. Ann. Math. Statist., 23. 193–212
Article MathSciNet MATH Google Scholar
Birnbaum. Z.W. and Tingey. F.H. (1951). One-sided confidence contours for distribution functions. Ann. Math. Statist., 22. 592–596.
Article MathSciNet MATH Google Scholar
Birnbaum, Z.W. (1952) Numerical tabulation of the distribution of Kolmogorov’s statistics for finite sample size, J. Amer. Statist. Assoc., 47, 425–441
Article MathSciNet MATH Google Scholar
Cramér. H. (1928). On the composition of elementary errors. Second paper Statistical Applications. Skand. Aktuarietidskrift, 11, 171–180.
Google Scholar
Chang. L.C. (1955). On the ratio of an empirical distribution function to the theoreti¬cal distribution function. Acta Math. Sinica, 5. 347–368 [also Selected Translations in Math. Statist. Proh.. 4, (1963), 17-38].
MATH Google Scholar
Cheng. P. (1958). Non-negative jump points of an empirical distribution function relative to a theoretical distribution function. Acta Math. Sinica. 8. 333–347 [also Selected Translations in Math. Statist. Proh„ 3. (1962). 205–224],
MathSciNet Google Scholar
Darling. DA. (1955). The Cramer-Smirnov test in the parametric case. Ann. Math. Statist.. 26. 1–20.
Article MathSciNet MATH Google Scholar
David. F.N, and Johnson N.L. (1948). The probability integral transformation when parameters arc estimated from the sample. Biometrika, 35. 182–192.
MathSciNet MATH Google Scholar
Donsker. M.D. (1952). Justification and extension of Doob’s heuristic approach to the Kolmogorov-Smirnov theorems. Ann. Math Statist. 23, 277–281.
Article MathSciNet MATH Google Scholar
Doob. J.L. (1949). Heuristic approach to the Kolmogorov Smirnov theorems, Ann. Math. Statist.. 20. 393–403.
Article MathSciNet MATH Google Scholar
Drion. E.F. (1952). Some distribution-free tests for the difference between two empirical cumulative distribution functions, Ann. Math. Statist., 23, 563–574.
Article MathSciNet MATH Google Scholar
Durbin, J. (1973). Distribution Theory for Tests Based on the Sample Distribution Function. CBMS-NSF Regional Conference Series in Applied Mathematics. Society for Industrial and Applied Mathematics, Philadelphia, Pa.
MATH Google Scholar
Durbin, J. (1975). Kolmogorov-Smirnov tests when parameters arc estimated with applications to tests of exponentiality and tests on spacings. Biometrika. 62. (1). 5–22.
Article MathSciNet MATH Google Scholar
Feller. W. (1948). On the Kolmogorov-Smirnov limit theorems for empirical distributions, Ann. Math. Statist.. 19. 177–189.
Article MathSciNet MATH Google Scholar
Gibbons. J.D. (1983). Kolmogorov-Smirnov symmetry test, in Encyclopedia of Statistical Sciences (S. Kotz. N.L. Johnson, and C.B. Read, cds.) vol. 4. Wiley, New York. pp. 396–398.
Google Scholar
Gnedenko. B.V. (1952), Some results on the maximum discrepancy between two empirical distributions. Dokl. Akad. Sauk SSSR, 82, 661–663 [also Selected Translations in Math. Statist. Prob.. 1, (1961), 73–76],
MathSciNet MATH Google Scholar
Gnedenko. B.V., and Korolyuk, V.S. (1951): On the maximum discrepancy between two empirical distributions. Dokl. Akad. Nauk SSSR. 80. 525–528 [also Selected Translations in Math. Statist. Prob., 1 (1961). 13–22].
MathSciNet MATH Google Scholar
Gnedenko, B.V. and Mihalevic, V.S. (1952a). On the distribution of the number of excesses of one empirical distribution function over another. Dokl. Akad. Nauk SSSR. 82, 841–843 [also Selected Translations in Math. Statist. Prob., 1 (1961). 83–85].
MathSciNet MATH Google Scholar
Gnedenko, B. V.. and Mihalevic, V.S. (1952b). Two theorems on behaviour of empirical distribution functions. Dokl. Akad. Nauk SSSR. 85. 25–27 [also Selected Translations in Math. Statist. Prob., 1 (1961). 55–58].
MathSciNet MATH Google Scholar
Gnedenko. B.V., and Rvaceva, E.L. (1952). On a problem of the comparison of two empirical distributions. Dokl. Akad. Nauk SSSR. 82. 513–516 [also Selected Translations in Math. Statist. Prob., 1 (1961), 69–72].
MathSciNet MATH Google Scholar
Guilbaud, O. (1988). Exact Kolmogorov-type tests for left truncated and,’or right- censored data, J. Amer. Statist. Assoc.. 83, 213–221.
Article MathSciNet MATH Google Scholar
Hall, W.J, and Wellner. JA. (1980). Confidence band for a survival curve from censored data. Biometrika, 67. 133–143.
Article MathSciNet MATH Google Scholar
Kac, M., Kiefer, J., and Wolfowitz. J. (1955). On tests of normality and other tests of goodness of fit based on distance methods. Ann. Math. Statist., 26. 189–211.
Article MathSciNet MATH Google Scholar
Khmaladze, E.V. (1986). Introduction to Kolmogorov (1933). In Teoriia Veroiatnostei I Maiematicheskaia Statistika. Nauka: Moscow.
Google Scholar
Kendall, M.G., and Stuart, A. (1979). The Advanced Theory of Statistics, 4th ed. McMillan. New York.
MATH Google Scholar
Kolmogorov, A. (1931). Über die analytischen Methoden in der Wahrscheinlichkeitsrechnung. Math. Ann., 104. 415–458.
Article MathSciNet MATH Google Scholar
Kolmogorov, A. (1933a). Sulla determinazione empirica di una legge di distribuzionc, 1st. Ital. Attuari. G.. 4. 1–11.
MATH Google Scholar
Kolmogorov, A. (1933b) Über die Grenzwertsätze der Wahrscheinlichkeitsrechnung. Bull. (Izvestija) Acad. Sei. URSS, 363–372.
Google Scholar
Koroljuk, V.S. (1955). On the discrepancy of empiric distributions for the case of two independent samples, Izv. Akad. Nauk SSSR Ser. Mat., 19, 91–96 [also Selected Translations in Math. Statist. Prob, 4 (1963). 105–122].
Google Scholar
Kotz, S., Johnson. N.L, and Read C-B. (eds.) (1989). Kolmogorov. Andrei Nikoleyevteh. in Encyclopedia of Statistical Sciences, Suppl. Volume. Wiley, New York, 78–80.
Google Scholar
Kuiper, N.H. (1960): Tests concerning random points on a circle. Proc. Koninkl. Neder. Akad. van. Wetenschappen. A. 63. 38–47.
MathSciNet MATH Google Scholar
Lockhart. R A., and Stephens, M.A. (1985a). Goodness-of-fit tests for the gamma distribution. Technical report. Department of Mathematics and Statistics. Simon Fraser University.
Google Scholar
Lockhart. R.A.. and Stephens. M.A. (1985b). Goodness-of-fit tests for the von Mises distribution. Biometrika. 72. 647– 652.
Article MathSciNet MATH Google Scholar
Massey, F.J. (1950). A note on the estimation of a distribution function by confidence limits, Ann. Math. Statist., 21. 116–119.
Article MathSciNet MATH Google Scholar
Massey, FJ. (1951a) The Kolmogorov-Smirnov tests for goodness of fit, J. Amer. Statist. Assoc., 46. 68–78.
Article MATH Google Scholar
Massey, F.J. (1951b). The distribution of the maximum deviation between two sample cumulative step functions, Ann. Math. Statist., 22, 125–128.
Article MathSciNet MATH Google Scholar
Massey, K.J. (1952). Distribution table for the deviation between sample cumulatives. Ann. Math. Statist., 23, 435–441.
Article MathSciNet MATH Google Scholar
Miller. L.H. (1956). Table of percentage points of Kolmogorov statistics, J. Amer. Statist. Assoc., 51, 111–121.
Article MathSciNet MATH Google Scholar
Niederhausen, H. (1981a). Tables of significant points for the variance-weighted Kolmogorov-Smirnov statistics. Technical report. Department of Statistics. Stanford University.
Google Scholar
Niederhausen. H. (1981b). Sheffer polynomials for computing exact Kolmogorov- Smirnov and Renyi type distributions, Ann. Statist. 5. 923–944.
Article MathSciNet Google Scholar
Pettitt, A.N.. and Stephens, M.A. (1977): The Kolmogorov-Smirnov goodness-of-fit statistics with discrete and grouped data. Technometrics. 19. 205–210.
Article MATH Google Scholar
Pyke. R. (1959). The supremum and infimum of the Poisson process, Ann. Math. Statist.. 30. 568–576.
Article MathSciNet MATH Google Scholar
Renyi, A. (1953). On the theory of order statistics. Acta Math. Acad. Sci. Hungary. 4, 191–231.
Article MathSciNet MATH Google Scholar
Sahler, W. (1968). A survey of distribution-free statistics based on distances between distribution functions. Metrika, 13, 149–169.
Article MathSciNet MATH Google Scholar
Shiryaev, A.N. (1989). Kolmogorov’s life and creative activities. Ann. Prob. 17. 866–944.
Article MathSciNet MATH Google Scholar
Smirnov, N.V. (1936, 1937 ). Sur la distribution de w2 (criterium de M. R. von Mises) C. R. Acad. Sci. (Paris), 202, (1936), 449–452, [paper with the same title in Russian. Recueil Math.. 2 (1937), 973–993],
Google Scholar
Smirnov, N.V. (1939a). Ob uklonenijah empiricheskoi krivoi rasprcdelenija. Recueil Math. Mat. Sbornik, N.S., 6 (48), 13–26.
Google Scholar
Smirnov, N.V. (1939b). On the estimation of the discrepancy between empirical curves of distributions for two independent samples, Bull. Math. Univ. Moscou, 2, 2.
Google Scholar
Smirnov, N.V. (1944). Approximate laws of distribution of random variables from empirical data, Uspehi Mat. Nauk, 10, 179–206.
MATH Google Scholar
Smirnov, N.V. (1947). Sur un critére de symetrie de la loi de distribution d’une variable aléatoire. Akad. Nauk SSSR. C.R. (Doklady), 56. 11–14.
MATH Google Scholar
Smirnov, N.V. (1948). Table for estimating the goodness of fit of empirical distributions, Ann. Math. Statist., 19, 279–281.
Article MATH Google Scholar
Smirnov, N.V. (1949). On the Cramér-von Mises criterion (in Russian), Uspehi. Mat. Nauk. 14. 196–197.
Google Scholar
Stephens, MA. (1970). Use of the Kolmogorov-Smirnov, Cramer-von Mises and related statistics without extensive tables, J. Roy. Statist. Soc., Ser. B, 32, 115–122.
MATH Google Scholar
Stephens, M.A. (1974). EDF statistics for goodness-of-fit and some comparisons, J. Amer. Statist. Assoc., 69, 730–737.
Article Google Scholar
Stephens. M.A. (1976). Asymptotic results for goodness-of-fit statistics with unknown parameters. Ann. Statist., 4, 357–369.
Article MathSciNet MATH Google Scholar
Stephens, M.A. (1983). Kolmogorov-Smirnov statistics; Kolmogorov-Smirnov tests of fit, in Encyclopedia of Statistical Sciences (S. Kotz. N.L. Johnson, and C.B. Read, eds.) vol. 4. Wiley. New York. 393–396; 398–402.
Google Scholar
Stephens, M.A. (1986). Tests based on F.DF statistics, in Goodness-of-Fit Techniques (R.B. D’Agostino, and M.A. Stephens, eds). Marcel Dekker. New York, Chap. 4.
Google Scholar
von Mises, R. (1931). Vorlesungen aus dem Gebiete der Angewandten Mathematik. 1, Wahrscheinlichkeitsrechnung und ihre Anwendung in der Statistik und theoretischen Physik. Springer: Wien.
Google Scholar
Wald, A. and Wolfowitz, J. (1939). Confidence limits for continuous distribution functions. Ann. Math. Statist.. 10, 105–118.
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Simon Fraser University, Canada
M. A. Stephens

Authors

M. A. Stephens
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

College of Business and Management, University of Maryland at College Park, 20742, College Park, MD, USA
Samuel Kotz
Department of Statistics Phillips Hall, The University of North Carolina at Chapel Hill, 27599, Chapel Hill, NC, USA
Norman L. Johnson

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Stephens, M.A. (1992). Introduction to Kolmogorov (1933) On the Empirical Determination of a Distribution. In: Kotz, S., Johnson, N.L. (eds) Breakthroughs in Statistics. Springer Series in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-4380-9_9

Download citation

DOI: https://doi.org/10.1007/978-1-4612-4380-9_9
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-94039-7
Online ISBN: 978-1-4612-4380-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics