Benchmark datasets for technical evaluation of SARS-CoV-2 bioinformatic analysis workflows
Source: NCBI BioProject (ID PRJNA763390)

0 0

Project name: Severe acute respiratory syndrome coronavirus 2
Description: This project is a collaborative effort to provide benchmark datasets that aid public health labs in building genome sequencing capacity, including assessment of bioinformatics infrastructure, comparison of different workflows, and evaluation of control metrics, to ensure sequencing quality for timely outbreak investigation and surveillance of SARS-CoV-2. Many included raw sequencing data fail one or more common, accepted quality tests and thus serve to calibrate bioinformatics tools designed to identify low coverage, contamination, or other descriptive metrics evaluated during routine analyses or submission to public repositories. Provided sample information has been sanitized to appear as generic sequencing run data and specific quality failures are indicated in the BioSample records. All sequencing reads contaminated with human genetic data have been derived from a public reference genome assembly.
Data type: genome sequencing
Sample scope: Multiisolate
Relevance: Medical
Organization: Centers for Disease Control and Prevention
Last updated: 2021-09-15
Statistics: 19 samples; 19 experiments; 19 runs