Show simple item record

dc.contributor.authorHsu, Lauren L
dc.contributor.authorCulhane, Aedín C
dc.date.accessioned2024-07-23T08:04:11Z
dc.date.available2024-07-23T08:04:11Z
dc.date.issued2023-01-21
dc.identifier.pmid36681709
dc.identifier.doi10.1038/s41598-022-26434-1
dc.identifier.urihttp://hdl.handle.net/10147/642427
dc.descriptionEffective dimension reduction is essential for single cell RNA-seq (scRNAseq) analysis. Principal component analysis (PCA) is widely used, but requires continuous, normally-distributed data; therefore, it is often coupled with log-transformation in scRNAseq applications, which can distort the data and obscure meaningful variation. We describe correspondence analysis (CA), a count-based alternative to PCA. CA is based on decomposition of a chi-squared residual matrix, avoiding distortive log-transformation. To address overdispersion and high sparsity in scRNAseq data, we propose five adaptations of CA, which are fast, scalable, and outperform standard CA and glmPCA, to compute cell embeddings with more performant or comparable clustering accuracy in 8 out of 9 datasets. In particular, we find that CA with Freeman-Tukey residuals performs especially well across diverse datasets. Other advantages of the CA framework include visualization of associations between genes and cell populations in a "CA biplot," and extension to multi-table analysis; we introduce corralm for integrative multi-table dimension reduction of scRNAseq data. We implement CA for scRNAseq data in corral, an R/Bioconductor package which interfaces directly with single cell classes in Bioconductor. Switching from PCA to CA is achieved through a simple pipeline substitution and improves dimension reduction of scRNAseq datasets.en_US
dc.language.isoenen_US
dc.rights© 2023. The Author(s).
dc.rightsAttribution 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectsingle cellen_US
dc.subjectpipelineen_US
dc.titleCorrespondence analysis for dimension reduction, batch integration, and visualization of single-cell RNA-seq data.en_US
dc.typeArticleen_US
dc.typeOtheren_US
dc.identifier.eissn2045-2322
dc.identifier.journalScientific reportsen_US
dc.source.journaltitleScientific reports
dc.source.volume13
dc.source.issue1
dc.source.beginpage1197
dc.source.endpage
refterms.dateFOA2024-07-23T08:04:14Z
dc.source.countryUnited States
dc.source.countryEngland


Files in this item

Thumbnail
Name:
41598_2022_Article_26434.pdf
Size:
3.903Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record

© 2023. The Author(s).
Except where otherwise noted, this item's license is described as © 2023. The Author(s).