A Bayesian mixture model for clustering droplet-based single-cell transcriptomic data from population studies

Basic information
Cell
8,922
Sample
4

Technology
10X Genomics
Omics
scRNA-seq
Source
PBMCs

Dataset ID
30967541
Platform
Illumina NextSeq 500
Species
Human
Disease
Healthy
Age range
0 - 0
Update date
2019-04-09
Summary

The recently developed droplet-based single-cell transcriptome sequencing (scRNA-seq) technology makes it feasible to perform a population-scale scRNA-seq study, in which the transcriptome is measured for tens of thousands of single cells from multiple individuals. Despite the advances of many clustering methods, there are few tailored methods for population-scale scRNA-seq studies. Here, we develop a Bayesian mixture model for single-cell sequencing (BAMM-SC) method to cluster scRNA-seq data from multiple individuals simultaneously. BAMM-SC takes raw count data as input and accounts for data heterogeneity and batch effect among multiple individuals in a unified Bayesian hierarchical model framework. Results from extensive simulation studies and applications of BAMM-SC to in-house experimental scRNA-seq datasets using blood, lung and skin cells from humans or mice demonstrate that BAMM-SC outperformed existing clustering methods with considerable improved clustering accuracy, particularly in the presence of heterogeneity among individuals.

Overall design

Peripheral blood mononuclear cells (PBMC) from healthy donors, skin cells from the dorsal mid-forearm of healthy donors, and lung cells from streptococcus pneumonia (SP) infected and naïve mice

Contributors

To be supplemented.

Contact

To be supplemented.

snRNA-Seq
Sample nameSample titleDiseaseGenderAgeSourceTreatmentTechnologyPlatformOmicsSample IDDataset IDAction
No data available