On Joint Dimension Reduction and Clustering of Categorical Data

Iodice D’Enza, Alfonso; Van de Velden, Michel; Palumbo, Francesco

doi:10.1007/978-3-319-06692-9_18

Alfonso Iodice D’Enza²²,
Michel Van de Velden²³ &
Francesco Palumbo²⁴

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

2370 Accesses
1 Citations

Abstract

There exist several methods for clustering high-dimensional data. One popular approach is to use a two-step procedure. In the first step, a dimension reduction technique is used to reduce the dimensionality of the data. In the second step, cluster analysis is applied to the data in the reduced space. This method may be referred to as the tandem approach. An important drawback of this method is that the dimension reduction may distort or hide the cluster structure. As an alternative, various authors have proposed joint dimension reduction and clustering approaches. In this paper we review some of these existing joint dimension reduction and clustering methods for categorical data in a unified framework that facilitates comparison.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Arabie, P., & Hubert, L. (1994). Cluster analysis in marketing research. IEEE Transactions on Automatic Control, 19, 716–723.
Google Scholar
Gifi, A. (1990). Nonlinear multivariate analysis. (579 pp). New York: John Wiley & Sons. ISBN 0-471-92620-5.
Google Scholar
Hwang, H., Dillon, W. R., & Takane, Y. (2006). An extension of multiple correspondence analysis for identifying heterogenous subgroups of respondents. Psychometrika, 71, 161–171.
Article MathSciNet Google Scholar
Iodice D’ Enza, A., & Palumbo, F. (2013). Iterative factor clustering of binary data. Computational Statistics, 28(2), 789–807.
Article MathSciNet Google Scholar
Lauro C. N., & D’Ambra, L. (1984). L’analyse non symétrique des correspondances. Data Analysis and Informatics, III, 433–446.
Google Scholar
MacQueen, J. (1967). Some methods for classification and analysis of multivariate observations. In L. M. L. Cam & J. Neyman (Eds.), Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability (Vol. 1, pp. 281–297).
Google Scholar
Nenadic, O., & Greenacre, M. (2007). Correspondence analysis in R, with two- and three-dimensional graphics: the ca package, Journal of Statistical Software, 20(3).
Google Scholar
Van Buuren, S., & Heiser, W. J. (1989). Clustering n objects in k groups under optimal scaling of variables. Psychometrika, 54, 699–706.
Article MathSciNet Google Scholar
Vichi, M., & Kiers, H. (2001). Factorial k-means analysis for two way data. Computational Statistics & Data Analysis, 37, 49–64.
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Università di Cassino, Cassino, (FR), Italy
Alfonso Iodice D’Enza
Erasmus University of Rotterdam, PA Rotterdam, The Netherlands
Michel Van de Velden
Università degli Studi di Napoli Federico II, Napoli, Italy
Francesco Palumbo

Authors

Alfonso Iodice D’Enza
View author publications
You can also search for this author in PubMed Google Scholar
Michel Van de Velden
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Palumbo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alfonso Iodice D’Enza .

Editor information

Editors and Affiliations

Department of Statistical Science, University of Rome "La Sapienza", Rome, Italy
Donatella Vicari
and Information Sciences, Tama University Graduate School of Management, Tokyo, Japan
Akinori Okada
Department of Political Science, University of Naples "Federico II", Naples, Italy
Giancarlo Ragozini
Fakultät Statistik, Technische Universität Dortmund, Dortmund, Germany
Claus Weihs

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Iodice D’Enza, A., Van de Velden, M., Palumbo, F. (2014). On Joint Dimension Reduction and Clustering of Categorical Data. In: Vicari, D., Okada, A., Ragozini, G., Weihs, C. (eds) Analysis and Modeling of Complex Data in Behavioral and Social Sciences. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-06692-9_18

Download citation

DOI: https://doi.org/10.1007/978-3-319-06692-9_18
Published: 17 June 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06691-2
Online ISBN: 978-3-319-06692-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics