China National GeneBank DataBases (CNGBdb) is a unified platform for biological big data sharing and application service. Based on underlying big data and cloud computing, it provides a variety of services including convenient submission and storage, automatic archive and management, full retrieval and download, and intelligent computing of biological data. The database is freely available to all, which will promote the effective use of data and the development of life sciences and bioindustry.
CNGBdb accepts biological data such as projects, samples, experiments, raw data, assembly, other supporting data, sequence variations and annotated sequences. Submitting data to CNGBdb represents that you have acquiesced to the open data protocol.
The data submission of CNGBdb includes CNGB Nucleotide Sequence Archive (CNSA), Pan immune repertoire database (PIRD) and GigaDB. Data of project, sample, experiment/run, assembly，variations can be submitted to the CNSA. PIRD accepts raw and processed sequences of immunoglobulins (IGs) and T cell receptors (TCRs) of human and other vertebrate species with different phenotypes. The support data can be submitted to GigaDB.
Currently, with respect to open data, only the ENA Accession ID can be used as a reference for articles, etc. The CNSA ID is currently only available for data archiving administration of China National GeneBank and users for data query retrieval.
Submitters can control the data information disclosure by setting up release date in each CNGBdb data module. The controlled time limits to two years by default. If the controlled time is more than two years, submitters should manually modify the time before it is about to expire. If it is expired, the modification is forbidden and the data will be released automatically.