Skip to main content

Principal Component Analysis and Extensions

  • Chapter
  • First Online:
Modern Psychometrics with R

Part of the book series: Use R! ((USE R))

Abstract

This chapter introduces principal component analysis (PCA), a technique for dimension reduction in multivariate datasets. At its core there is a matrix decomposition technique called singular value decomposition, which is introduced at the beginning of this chapter. This is followed by PCA model formulation, computation, and an application. Relationships with exploratory factor analysis are discussed as well. Subsequently, some PCA variants such as robust and sparse PCA are briefly discussed. The final two sections introduce two extensions of PCA. The first one is three-way PCA for three-way input data structures. The second one is called independent component analysis and illustrated using electroencephalography (EEG) data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    For readers who are more attracted to a sonic introduction to SVD rather than a formal one, Michael Greenacre composed an SVD song (“It had to be U”), available on YouTube.

  2. 2.

    In this section we follow a notation for indices of the data and components as well as the number of them which is different from that used before but which is standard in the literature of three-way analysis (see Kiers, 2000).

  3. 3.

    We omit the matrix formulation for all three-way PCA models since it requires some 3D matrix operations that are beyond scope of this book (see, e.g., Kroonenberg, 2008, for corresponding expressions).

  4. 4.

    Biplots are introduced in more detail in Chap. 10.

  5. 5.

    Thanks for Hrag Pailian for sharing this dataset.

References

  • Arend, A. M., & Zimmer, H. D. (2011). What does ipsilateral delay activity reflect? Inferences from slow potentials in a lateralized visual working memory task. Journal of Cognitive Neuroscience, 23, 4048–4056.

    Article  Google Scholar 

  • Carroll, J. D., & Arabie, P. (1980). Multidimensional scaling. Annual Review of Psychology, 31, 607–649.

    Article  Google Scholar 

  • Carroll, J. D., & Chang, J. J. (1970). Analysis of individual differences in multidimensional scaling via an N-way generalization of Eckart-Young decomposition. Psychometrika, 35, 283–319.

    Article  Google Scholar 

  • Eckart, C., & Young, G. (1936). The approximation of one matrix by another of lower rank. Psychometrika, 1, 211–218.

    Article  Google Scholar 

  • Fox, J., & Weisberg, S. (2011). An R companion to applied regression (2nd ed.). Thousand Oaks: Sage.

    Google Scholar 

  • Giordani, P., Kiers, H., & Del Ferraro, M. (2014). Three-way component analysis using the R package ThreeWay. Journal of Statistical Software, 57(1), 1–23. https://www.jstatsoft.org/index.php/jss/article/view/v057i07

    Google Scholar 

  • Harshman, R. A. (1970). Foundations of the PARAFAC procedure: Models and conditions for an “explanatory” multi-modal factor analysis (Technical report 16, UCLA Working Papers in Phonetics).

    Google Scholar 

  • Helwig, N. E. (2015a). eegkit: Toolkit for electroencephalography data. R package version 1.0-2. https://CRAN.R-project.org/package=eegkit

  • Helwig, N. E. (2015b). ica: Independent component analysis. R package version 1.0-1. https://CRAN.R-project.org/package=ica

  • Helwig, N. E. (2017). multiway: Component models for multi-way data. R package version 1.0-3. https://CRAN.R-project.org/package=multiway

  • Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology, 24, 417–441.

    Article  Google Scholar 

  • Hyvärinen, A., Karhunen, J., & Oja, E. (2001). Independent component analysis. New York: Wiley.

    Book  Google Scholar 

  • James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning with applications in R. New York: Springer.

    Book  Google Scholar 

  • Joliffe, I. T. (2002). Principal component analysis (2nd ed.). New York: Springer.

    Google Scholar 

  • Karatzoglou, A., Smola, A., Hornik, K., & Zeileis, A. (2004). kernlab: An S4 package for kernel methods in R. Journal of Statistical Software, 11(9), 1–20. http://www.jstatsoft.org/v11/i09/

    Article  Google Scholar 

  • Kiers, H. A. L. (2000). Towards a standardized notation and terminology in multiway analysis. Journal of Chemometrics, 14, 105–122.

    Article  MathSciNet  Google Scholar 

  • Kroonenberg, P. M. (2008). Applied multiway data analysis. Hoboken: Wiley.

    Book  Google Scholar 

  • Leibovici, D. (2010). Spatio-temporal multiway data decomposition using principal tensor analysis on k-modes: The R package PTAk. Journal of Statistical Software, 34(1), 1–34. https://www.jstatsoft.org/index.php/jss/article/view/v034i10

    Google Scholar 

  • Marchini, J. L., Heaton, C., & Ripley, B. D. (2013). fastICA: FastICA algorithms to perform ICA and projection pursuit. R package version 1.2-0. https://CRAN.R-project.org/package=fastICA

  • Mevik, B. H., Wehrens, R., & Liland, K. H. (2016). pls: Partial least squares and principal component regression. R package version 2.6-0. https://CRAN.R-project.org/package=pls

  • Pearson, K. (1901). On lines and planes of closest fit to systems of points in space. Philosophical Magazine, 2, 559–572.

    MATH  Google Scholar 

  • Revelle, W. (2017). psych: Procedures for psychological, psychometric, and personality research. R package version 1.7.8. http://CRAN.R-project.org/package=psych

  • Sharma, S. (1996). Applied multivariate techniques. New York: Wiley.

    Google Scholar 

  • Sidanius, J., Levin, S., van Laar, C., & Sears, D. O. (2010). The diversity challenge: Social identity and intergroup relations on the college campus. New York: The Russell Sage Foundation.

    Google Scholar 

  • Sigg, C. D., & Buhmann, J. M. (2008). Expectation-maximization for sparse and non-negative PCA. In Proceedings of the 25th International Conference on Machine Learning.

    Google Scholar 

  • Stacklies, W., Redestig, H., Scholz, M., Walther, D., & Selbig, J. (2007). pcaMethods: A bioconductor package providing PCA methods for incomplete data. Bioinformatics, 23, 1164–1167.

    Article  Google Scholar 

  • Stone, J. V. (2004). Independent component analysis: A tutorial introduction. Cambridge: The MIT Press.

    Google Scholar 

  • Timmerman, M. E. (2001). Component analysis of multisubject multivariate longitudinal data. PhD thesis, University of Groningen, Groningen.

    Google Scholar 

  • Treiblmaier, H. (2006). Datenqualität und individualisierte Kommunikation [Data Quality and Individualized Communication]. Wiesbaden: DUV Gabler Edition Wissenschaft.

    Google Scholar 

  • Treiblmaier, H., Bentler, P. M., & Mair, P. (2011). Formative constructs implemented via common factors. Structural Equation Modeling: A Multidisciplinary Journal, 18, 1–17.

    Article  MathSciNet  Google Scholar 

  • Tucker, L. R. (1966). Some mathematical notes on three-mode factor analysis. Psychometrika, 31, 279–311.

    Article  MathSciNet  Google Scholar 

  • Venables, W. N., & Ripley, B. D. (2002). Modern applied statistics with S (4th ed.). New York: Springer.

    Book  Google Scholar 

  • Vogel, E. K., & Machizawa, M. G. (2004). Neural activity predicts individual differences in visual working memory capacity. Nature, 428, 748–751.

    Article  Google Scholar 

  • Willerman, L., Schultz, R., Rutledge, J. N., & Bigler, E. (1991). In vivo brain size and intelligence. Intelligence, 15, 223–228.

    Article  Google Scholar 

  • Zou, H., & Hastie, T. (2012). elasticnet: Elastic-net for sparse estimation and sparse PCA. R package version 1.1. https://CRAN.R-project.org/package=elasticnet

  • Zou, H., Hastie, T., & Tibshirani, R. (2006). Sparse principal component analysis. Journal of Computational and Graphical Statistics, 15, 262–286.

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Mair, P. (2018). Principal Component Analysis and Extensions. In: Modern Psychometrics with R. Use R!. Springer, Cham. https://doi.org/10.1007/978-3-319-93177-7_6

Download citation

Publish with us

Policies and ethics