High-content CRISPR screens link coronary artery disease genes to endothelial cell programs [Pilot scRNA-seq]
Source: NCBI BioProject (ID PRJNA875320)

0 0

Project name: High-content CRISPR screens link coronary artery disease genes to endothelial cell programs [Pilot scRNA-seq]
Description: Genome-wide association studies (GWAS) have discovered thousands of risk loci for common, complex diseases, each of which could point to genes and gene programs that influence disease. For some diseases, it has been observed that GWAS signals converge on a smaller number of biological programs, and that this convergence can help to identify causal genes. However, identifying such convergence remains challenging: each GWAS locus can have 2-20 candidate genes, the cellular programs a gene participates in are difficult to define in an unbiased fashion, and it remains unclear which genes and programs would be likely to influence disease risk. Here, we explored a new approach to address this challenge, by creating an unbiased catalog of gene programs and their regulators in endothelial cells to link variants to functions for coronary artery disease (CAD). To do so, we applied CRISPRi-Perturb-seq to knock down all expressed genes within 500 Kb of all CAD GWAS loci (2,285 genes in total) and measure their effects on the transcriptome using single-cell RNA-seq. We used consensus non-negative matrix factorization to define 60 gene expression programs—including core cellular programs, such as ribosome biogenesis, and endothelial cell-specific programs, such as flow response and angiogenesis—and link these programs to upstream regulators including transcription factors, chromatin regulators, metabolic enzymes, and signaling cascades. By combining this gene-to-program catalog with variant-to-gene maps, we find that candidate CAD genes converge onto 6 interrelated gene programs, together involving known and novel genes in 39 of 229 CAD GWAS loci. Analysis of these programs revealed that the cerebral cavernous malformations (CCM) complex—whose potential connection to CAD has not been previously explored—acts upstream to regulate other CAD genes involved in cytoskeletal organization, extracellular matrix remodeling, and cell migration. The strongest regulator of these programs is TLNRD1, a highly conserved but poorly studied gene that we show acts in the CCM pathway and regulates actin organization and endothelial cell barrier function. Together, our study nominates new genes that likely influence risk for CAD, identifies convergence of CAD risk loci into certain gene programs in endothelial cells, and demonstrates a generalizable strategy to catalog gene programs to connect disease variants to functions.Overall design: This series contains pilot Perturb-seq studies, which were used to optimize conditions for and cross validate the larger Perturb-seq study reported in GSE210681. Human immortalized endothelial cells engineered to contain doxycycline inducible CRISPR interference machinery (CRISPRi TeloHAEC or CRISPRi Eahy926), were transduced with one of two guide RNA libraries targeting gene promoters (referred to as the 50gene & 200gene libraries), in a CROP-seq vector. After 5 days of doxycycline treatment, cells were run on a single lane of a 10X Chromium Controller and scRNA-seq libraries generated. For Eahy926, the maximum number of cells recommended in the 10x protocol was used (16k), or the lane was "superloaded" with ~10x more cells (150k). For TeloHAEC, only the superloading condition was used. polyA+ cDNA surrounding the guide sequences was amplified to make "dialout" libraries, used to link guide RNA sequences to scRNAseq cell barcodes.
Data type: Transcriptome or Gene expression
Sample scope: Multiisolate
Relevance: Medical
Organization: Broad Institute
Last updated: 2022-08-31
Statistics: 15 samples; 15 experiments; 48 runs