Hostname: page-component-8448b6f56d-cfpbc Total loading time: 0 Render date: 2024-04-24T08:32:33.763Z Has data issue: false hasContentIssue false

NONPARAMETRIC ESTIMATION OF REGRESSION FUNCTIONS WITH DISCRETE REGRESSORS

Published online by Cambridge University Press:  01 February 2009

Desheng Ouyang
Affiliation:
Shanghai University of Finance and Economics
Qi Li
Affiliation:
Texas A&M University and Tsinghua University
Jeffrey S. Racine*
Affiliation:
McMaster University
*
*Address correspondence to Jeffrey S. Racine, Department of Economics, McMaster University, Graduate Program in Statistics, McMaster University, Hamilton, ON L8S 4M4, Canada; e-mail: racinej@mcmaster.ca.

Abstract

We consider the problem of estimating a nonparametric regression model containing categorical regressors only. We investigate the theoretical properties of least squares cross-validated smoothing parameter selection, establish the rate of convergence (to zero) of the smoothing parameters for relevant regressors, and show that there is a high probability that the smoothing parameters for irrelevant regressors converge to their upper bound values, thereby automatically smoothing out the irrelevant regressors. A small-scale simulation study shows that the proposed cross-validation-based estimator performs well in finite-sample settings.

Type
ARTICLES
Copyright
Copyright © Cambridge University Press 2009

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

REFERENCES

Aitchison, J. & Aitken, C.G.G.. (1976) Multivariate binary discrimination by the kernel method. Biometrika 63, 413420.CrossRefGoogle Scholar
Bierens, H. (1983) Uniform consistency of kernel estimators of a regression function under generalized conditions. Journal of the American Statistical Association 78, 699707.CrossRefGoogle Scholar
Bowman, A.W., Hall, P. & Titterington, T.D.M. (1984) Cross-validation in nonparametric estimation of probabilities and probability densities. Biometrika 71, 341351.CrossRefGoogle Scholar
Fahrmeir, L. & Tutz, G. (1994) Multivariate Statistical Modeling Based on Generalized Linear Models. Springer-Verlag.CrossRefGoogle Scholar
Grund, B. & Hall, P. (1993) On the performance of kernel estimators for high-dimensional sparse binary data. Journal of Multivariate Analysis 44, 321344.CrossRefGoogle Scholar
Guerre, E., Perrigne, I. & Vuong, Q. (2000) Optimal nonparametric estimation of first price auction. Econometrica 68, 525574.CrossRefGoogle Scholar
Hahn, J. (1998) On the role of propensity score in efficient semiparametric estimation of average treatment effects. Econometrica 66, 315331.CrossRefGoogle Scholar
Hall, P. (1981) On nonparametric multivariate binary discrimination. Biometrika 68, 287294.CrossRefGoogle Scholar
Hall, P., Li, Q. & Racine, J.S. (2007) Nonparametric estimation of regression functions in the presence of irrelevant regressors. Review of Economics and Statistics 89, 784789.CrossRefGoogle Scholar
Hall, P., Racine, J. & Li, Q. (2004) Cross-validation and the estimation of conditional probability densities. Journal of the American Statistical Association 99, 10151026.Google Scholar
Hall, P. & Wand, M. (1988) On nonparametric discrimination using density differences. Biometrika 75, 541547.CrossRefGoogle Scholar
Härdle, W., Hall, P. & Marron, J.S. (1988) How far are automatically chosen regression smoothing parameters from their optimum? Journal of the American Statistical Association 83, 8699.Google Scholar
Härdle, W., Hall, P. & Marron, J.S. (1992) Regression smoothing parameters that are not far from their optimum. Journal of the American Statistical Association 87, 227233.Google Scholar
Härdle, W. & Marron, J.S. (1985) Optimal bandwidth selection in nonparametric regression function estimation. Annals of Statistics 13, 14651481.CrossRefGoogle Scholar
Hirano, K., Imbens, G.W. & Ridder, G. (2003) Efficient estimation of average treatment effects using the estimated propensity score. Econometrica 71, 11611189.CrossRefGoogle Scholar
Hong, Y. & Lee, T.H. (2003) Inference on predictability of exchange rates via generalized spectrum and nonlinear time series models. Review of Economics and Statistics 85, 10481062.CrossRefGoogle Scholar
Hong, Y. & Li, T.H. (2005) Nonparametric specification testing for continuous-time models with applications to term structures of interest rates. Review of Financial Studies 18, 3784.CrossRefGoogle Scholar
Lee, J. (1990) U-Statistics: Theory and Practice. Marcel Dekker.Google Scholar
Li, Q. & Racine, J. (2004) Cross-validated local linear nonparametric regression. Statistica Sinica 14, 485512.Google Scholar
Li, Q. & Racine, J.S. (2007) Nonparametric estimation of conditional CDF and quantile functions with mixed categorical and continuous data. Journal of Business & Economic Statistics, forthcoming.Google Scholar
Li, T., Perrigne, I. & Vuong, Q. (2002) Structural estimation of the affiliated private value auction model. RAND Journal of Economics 33, 171193.CrossRefGoogle Scholar
Li, T., Perrigne, I. & Vuong, Q. (2003) Semiparametric estimation of the optimal reserve prices in first-price auctions. Journal of Business & Economics Statistics 21, 5364.CrossRefGoogle Scholar
Masry, E. (1996) Multivariate local polynomial regression for time series: Uniform strong consistency and rates. Journal of Time Series Analysis 17, 571599.CrossRefGoogle Scholar
Racine, J.S., Hart, J. & Li, Q. (2006) Testing the significance of categorical predictor variables in nonparametric regression models. Econometric Reviews 25, 523544.CrossRefGoogle Scholar
Racine, J.S. & Li, Q. (2004) Nonparametric estimation of regression functions with both categorical and continuous data. Journal of Econometrics 119, 99130.CrossRefGoogle Scholar
Rosenthal, H.P. (1970) On the subspace of Lp (p ≥ 1) spanned by sequences of independent random variables. Israel Journal of Mathematics 8, 273303.CrossRefGoogle Scholar
Scott, D. (1992) Multivariate Density Estimation: Theory, Practice, and Visualization. Wiley.Google Scholar
Simonoff, J.S. (1996) Smoothing Methods in Statistics. Springer-Verlag.CrossRefGoogle Scholar
Titterington, D.M. (1980) A comparative study of kernel-based density estimates for categorical data. Technometrics 22, 259268.CrossRefGoogle Scholar
Wang, M.C. & van Ryzin, J. (1981) A class of smooth estimators for discrete distributions. Biometrika 68, 301309.CrossRefGoogle Scholar