Skip to main content

Parametric Cluster Analysis and Mixture Regression

  • Chapter
  • First Online:
Modern Psychometrics with R

Part of the book series: Use R! ((USE R))

  • 5744 Accesses

Abstract

This chapter is about advanced parametric clustering techniques based on the concept of mixture distributions. The first section introduces mixture distributions from a general perspective, followed by two popular applications in clustering: normal mixture models (latent profile analysis) for metric input variables and multinomial mixture models (latent class analysis) for categorical variables. Subsequently, these ideas are extended to mixed input scale levels. In the following section, the mixture distribution concept is embedded into a regression framework. In mixture regression models, clustering and estimation of regression parameters are performed simultaneously. By means of Dirichlet process regression, we add another complexity layer to the modeling framework by letting an algorithm determine the optimal number of clusters. Finally, the focus is on latent Dirichlet allocations: topic models for clustering text data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Note that in mclust a maximum-BIC strategy is used; that is, the higher the BIC, the better the fit.

  2. 2.

    In practice, the user should again try out different numbers of clusters and pick the one with the lowest BIC.

  3. 3.

    For an overview see the corresponding task view on CRAN (URL: https://cran.r-project.org/web/views/NaturalLanguageProcessing.html).

  4. 4.

    Other packages for topic modeling in R are mallet (Mimno, 2013) and lda (Chang, 2015).

References

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Mair, P. (2018). Parametric Cluster Analysis and Mixture Regression. In: Modern Psychometrics with R. Use R!. Springer, Cham. https://doi.org/10.1007/978-3-319-93177-7_12

Download citation

Publish with us

Policies and ethics