Anna Konstorum

Research Data Scientist

nipalsMCIA: Flexible Multi-Block Dimensionality Reduction in R via Non-linear Iterative Partial Least Squares


Journal article


Max Mattessich, Joaquin Reyna, Edel Aron, F. Ay, M. Kilmer, S. Kleinstein, A. Konstorum
bioRxiv, 2024

Semantic Scholar DOI
Cite

Cite

APA   Click to copy
Mattessich, M., Reyna, J., Aron, E., Ay, F., Kilmer, M., Kleinstein, S., & Konstorum, A. (2024). nipalsMCIA: Flexible Multi-Block Dimensionality Reduction in R via Non-linear Iterative Partial Least Squares. BioRxiv.


Chicago/Turabian   Click to copy
Mattessich, Max, Joaquin Reyna, Edel Aron, F. Ay, M. Kilmer, S. Kleinstein, and A. Konstorum. “NipalsMCIA: Flexible Multi-Block Dimensionality Reduction in R via Non-Linear Iterative Partial Least Squares.” bioRxiv (2024).


MLA   Click to copy
Mattessich, Max, et al. “NipalsMCIA: Flexible Multi-Block Dimensionality Reduction in R via Non-Linear Iterative Partial Least Squares.” BioRxiv, 2024.


BibTeX   Click to copy

@article{max2024a,
  title = {nipalsMCIA: Flexible Multi-Block Dimensionality Reduction in R via Non-linear Iterative Partial Least Squares},
  year = {2024},
  journal = {bioRxiv},
  author = {Mattessich, Max and Reyna, Joaquin and Aron, Edel and Ay, F. and Kilmer, M. and Kleinstein, S. and Konstorum, A.}
}

Abstract

Motivation With the increased reliance on multi-omics data for bulk and single cell analyses, the availability of robust approaches to perform unsupervised analysis for clustering, visualization, and feature selection is imperative. Joint dimensionality reduction methods can be applied to multi-omics datasets to derive a global sample embedding analogous to single-omic techniques such as Principal Components Analysis (PCA). Multiple co-inertia analysis (MCIA) is a method for joint dimensionality reduction that maximizes the covariance between block- and global-level embeddings. Current implementations for MCIA are not optimized for large datasets such such as those arising from single cell studies, and lack capabilities with respect to embedding new data. Results We introduce nipalsMCIA, an MCIA implementation that solves the objective function using an extension to Non-linear Iterative Partial Least Squares (NIPALS), and shows significant speed-up over earlier implementations that rely on eigendecompositions for single cell multi-omics data. It also removes the dependence on an eigendecomposition for calculating the variance explained, and allows users to perform out-of-sample embedding for new data. nipalsMCIA provides users with a variety of pre-processing and parameter options, as well as ease of functionality for down-stream analysis of single-omic and global-embedding factors. Availability nipalsMCIA is available as a BioConductor package at https://bioconductor.org/packages/release/bioc/html/nipalsMCIA.html, and includes detailed documentation and application vignettes. Supplementary Materials are available online.