Skip to main content

On Joint Dimension Reduction and Clustering of Categorical Data

  • Conference paper
  • First Online:
Analysis and Modeling of Complex Data in Behavioral and Social Sciences

Abstract

There exist several methods for clustering high-dimensional data. One popular approach is to use a two-step procedure. In the first step, a dimension reduction technique is used to reduce the dimensionality of the data. In the second step, cluster analysis is applied to the data in the reduced space. This method may be referred to as the tandem approach. An important drawback of this method is that the dimension reduction may distort or hide the cluster structure. As an alternative, various authors have proposed joint dimension reduction and clustering approaches. In this paper we review some of these existing joint dimension reduction and clustering methods for categorical data in a unified framework that facilitates comparison.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Arabie, P., & Hubert, L. (1994). Cluster analysis in marketing research. IEEE Transactions on Automatic Control, 19, 716–723.

    Google Scholar 

  • Gifi, A. (1990). Nonlinear multivariate analysis. (579 pp). New York: John Wiley & Sons. ISBN 0-471-92620-5.

    Google Scholar 

  • Hwang, H., Dillon, W. R., & Takane, Y. (2006). An extension of multiple correspondence analysis for identifying heterogenous subgroups of respondents. Psychometrika, 71, 161–171.

    Article  MathSciNet  Google Scholar 

  • Iodice D’ Enza, A., & Palumbo, F. (2013). Iterative factor clustering of binary data. Computational Statistics, 28(2), 789–807.

    Article  MathSciNet  Google Scholar 

  • Lauro C. N., & D’Ambra, L. (1984). L’analyse non symétrique des correspondances. Data Analysis and Informatics, III, 433–446.

    Google Scholar 

  • MacQueen, J. (1967). Some methods for classification and analysis of multivariate observations. In L. M. L. Cam & J. Neyman (Eds.), Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability (Vol. 1, pp. 281–297).

    Google Scholar 

  • Nenadic, O., & Greenacre, M. (2007). Correspondence analysis in R, with two- and three-dimensional graphics: the ca package, Journal of Statistical Software, 20(3).

    Google Scholar 

  • Van Buuren, S., & Heiser, W. J. (1989). Clustering n objects in k groups under optimal scaling of variables. Psychometrika, 54, 699–706.

    Article  MathSciNet  Google Scholar 

  • Vichi, M., & Kiers, H. (2001). Factorial k-means analysis for two way data. Computational Statistics & Data Analysis, 37, 49–64.

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alfonso Iodice D’Enza .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Iodice D’Enza, A., Van de Velden, M., Palumbo, F. (2014). On Joint Dimension Reduction and Clustering of Categorical Data. In: Vicari, D., Okada, A., Ragozini, G., Weihs, C. (eds) Analysis and Modeling of Complex Data in Behavioral and Social Sciences. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-06692-9_18

Download citation

Publish with us

Policies and ethics