Abstract
We aim at proposing a new model for binary rare events, i.e. binary dependent variable with a very small number of ones. We extend the Generalized Extreme Value (GEV) regression model proposed by Calabrese and Osmetti (Journal of Applied Statistics 40(6):1172–1188, 2013) to a Generalized Additive Model (GAM). We suggest to consider the quantile function of the GEV distribution as a link function in a GAM, so we propose the Generalized Extreme Value Additive (GEVA) model. In order to estimate the GEVA model, a modified version of the local scoring algorithm of GAM is proposed. Finally, to model default probability, we apply our proposal to empirical data on Italian Small and Medium Enterprises (SMEs). The results show that the GEVA model has a higher predictive accuracy to identify the rare event than the logistic additive model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agresti, A. (2002). Categorical data analysis. New York: Wiley.
Altman, E., & Sabato, G. (2006). Modeling credit risk for SMEs: evidence from the US market. Abacus, 19(6), 716–723.
Ansell, J., Lin, S., Ma, Y., & Andreeva, G. (2009, August). Experimenting with modeling default of small and medium sized enterprises (SMEs). In Credit Scoring and Credit Control XI Conference.
Basel Committee on Banking Supervision. (2005). International convergence of capital measurement and capital standards: A revised framework. Basel: Bank for International Settlements.
Calabrese, R., & Osmetti, S. A. (2013). Modelling small and medium enterprise loan defaults as rare events: the generalized extreme value regression model. Journal of Applied Statistics, 40(6), 1172–1188.
Cerved Group. (2011, February). Caratteristiche delle imprese, governance e probabilità di insolvenza. Report. Milan.
Ciampi, F., & Gordini, N. (2008). Using economic-financial ratios for small enterprize default prediction modeling: an empirical analysis. In Oxford Business & Economics Conference, Oxford.
Falk, M., Haler, J., & Reiss, R. (2010). Laws of small numbers: extremes and rare events (3rd ed.). Basel: Springer.
Fantazzini, D., & Figini, S. (2009). Random survival forests models for SME credit risk measurement. Methodology and Computing in Applied Probability, 11, 29–45.
Fantazzini, D., Figini, S., De Giuli, E., & Giudici P. (2009). Enhanced credit default models for heterogeneous SME segments. Journal of Financial Transformation, 25(N.1), 31–39.
Giudici, P. (2003). Applied data mining: statistical methods for business and industry. London: Wiley.
Hand, D. J. (2009). Measuring classifier performance: a coherent alternative to the area under the ROC curve. Machine Learning, 77, 103–123.
Hastie, T. J., & Tibshirami, R. J. (1990). Generalized additive models. Boca Raton: Chapman & Hall.
King, G., & Zeng, L. (2001). Logistic regression in rare events data. Political Analysis, 9, 321–354.
Kotz, S., & Nadarajah, S. (2000). Extreme value distributions. Theory and applications. London: Imperial Colleg Press.
Thomas, L., Edelman, D., & Crook, J. C. (2002). Credit scoring and its applications. Philadelphia: Society for Industrial and Applied Mathematics.
Vozzella, P., & Gabbi, G. (2010). Default and asset correlation: An empirical study for Italian SMEs. Working Paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Calabrese, R., Osmetti, S.A. (2014). A Generalized Additive Model for Binary Rare Events Data: An Application to Credit Defaults. In: Vicari, D., Okada, A., Ragozini, G., Weihs, C. (eds) Analysis and Modeling of Complex Data in Behavioral and Social Sciences. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-06692-9_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-06692-9_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06691-2
Online ISBN: 978-3-319-06692-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)