A clustering-independent method for finding differentially expressed genes in single-cell transcriptome data.
IF: 17.694
Cited by: 13


A common analysis of single-cell sequencing data includes clustering of cells and identifying differentially expressed genes (DEGs). How cell clusters are defined has important consequences for downstream analyses and the interpretation of results, but is often not straightforward. To address this difficulty, we present singleCellHaystack, a method that enables the prediction of DEGs without relying on explicit clustering of cells. Our method uses Kullback-Leibler divergence to find genes that are expressed in subsets of cells that are non-randomly positioned in a multidimensional space. Comparisons with existing DEG prediction approaches on artificial datasets show that singleCellHaystack has higher accuracy. We illustrate the usage of singleCellHaystack through applications on 136 real transcriptome datasets and a spatial transcriptomics dataset. We demonstrate that our method is a fast and accurate approach for DEG prediction in single-cell data. singleCellHaystack is implemented as an R package and is available from CRAN and GitHub.


Spatial Transcriptomics

MeSH terms

Bone Marrow
Cluster Analysis
Computational Biology
Data Mining
Gene Expression
Gene Expression Profiling
Gene Regulatory Networks
Single-Cell Analysis


Vandenbon, Alexis
Diez, Diego

Recommend literature

Similar data