FAQ

CNGB Sequence Archive (CNSA) is a convenient and fast online submission system for biological research projects, samples, experiments, runs, assemblies, variations and other information data. Based on the International Nucleotide Sequence Database Collaboration (INSDC) standard and DataCite standard, accepting the submission of global scientific research sequencing data, information and analysis result data, its data submission service can be used as a supplement to the literature publishing process to support early data sharing. CNSA is committed to the storage and sharing of biological sequencing data, information and analysis result data, and is designed to provide global researchers with the most comprehensive data and information resources, enabling researchers to access data more easily and facilitate data reuse.

So far, CNSA has archived XXXX TB data, XXX TB public data, XXXX projects, XXXXX samples, XXXXX experiments, and XXXXX runs.

So far, CNSA has supported XX articles, XX magazines, XX organizations.

Please view the Statistics for detailed statistics.

CNSA currently accepts metadata or data files of project, sample, experiment/run, assembly and variation.

Metadata is data that describes an information resource or a data object. For example, metadata of experiment/run refers to a description of experiment/run data, such as sequencing platform, library strategy, etc.; data files of experiment/run refer to files of sequencing reads.

Please click "Submit" on the homepage or “Submission portal” in the navigation bar, select the data type, and follow the page prompts to submit the metadata or data files. Users can view the data upload options in “My service” if they will upload data files. For the FTP upload method, refer to FTP data upload. For the MD5 value calculation, refer to MD5 check. For a simple data submission process, refer to the “CNSA Handbook (simple version in English)”. For detailed data submission process, please refer to “Data Submission”.

There are three Data management forms of CNSA: Public, Controlled and Private. The data submitter can choose a Data management form when submitting a project.

Public:That is, the metadata and data files associated with the project will be public. The data submitter needs to set a release date in the project's submission process, and all metadata and data files associated with the project will be public on that release date. The public data will be displayed on the China National GeneBank DataBase (CNGBdb) and will be open to the world, and users can access or use it freely at CNGBdb.

Controlled:That is, the metadata associated with the project will be public and the data files will be controlled. The data submitter needs to set the release date of metadata in the project's submission process, and all metadata associated with the project will be public on that date. Only metadata for project, sample and other data types of controlled data will be displayed on CNGBdb, and data files will not be displayed on the platform. Other registered users can apply for access to controlled data. Data applicants must use the data after the data submitter have reviewed and approved, and the access or data files will be granted to the data applicant by the data submitter.

Private:That is, the metadata and data files associated with the project are controlled. Private data will not be displayed on CNGBdb and will not accept any access and download requests.

Please first download the FTP client, such as the Filezilla FTP client, log in with the ftp server, username and password provided by CNSA, and upload the data files. The FTP server, username and password can be viewed in the data submission process or “My service”. For details on how to upload, please refer to "FTP data upload".

MD5 (Message Digest Algorithm 5) is a hash function which calculates a hash value (MD5 number, 32-digit numbers and letters) of a given file. An MD5 checksum can be computed for a file before and after transfer to verify whether the file was transmitted successfully. For details on how to generate MD5 values, refer to “MD5 check”.

Controlled:That is, the metadata associated with the project will be public and the data files will be controlled. The data submitter needs to set the release date of metadata in the project's submission process, and all metadata associated with the project will be public on that date. Only metadata for project, sample and other data types of controlled data will be displayed on CNGBdb, and data files will not be displayed on the platform. Other registered users can apply for access to controlled data. Data applicants must use the data after the data submitter have reviewed and approved, and the access or data files will be granted to the data applicant by the data submitter.

There are three Data management forms of CNSA: Public, Controlled and Private. The data submitter can choose a Data management form when submitting a project.

Private:That is, the metadata and data files associated with the project are controlled. Private data will not be displayed on CNGBdb and will not accept any access and download requests.

There are three Data management forms of CNSA: Public, Controlled and Private. The data submitter can choose a Data management form when submitting a project.

Controlled data refers to data that the metadata associated with the project will be public and the data files will be controlled. Other registered users can apply for access to controlled data to the CNGB data access. Data applicants must use the data after the data submitter have reviewed and approved, and the access or data files will be granted to the data applicant by the data submitter.

Please click on "My Submission" to view it. In the “Status” column you can view the corresponding submission ID or download the metadata file with accessions.

CNSA will automatically assign numbers, prefixed by CNP (project), CNS (sample), CNSebb (EBB sample), CNX (experimental), CNR (run), CNA (assembly), varc (variation), etc. Please refer to the "Numbering rules" for specific numbering rules.

StatusExplanation
UnfinishedThe data submission process has not yet reached the final step.
ProcessingThe data submission process has been completed and the data has not been reviewed or in review.
ProcessedThe data has been reviewed and has a release date, but has not reached the release date.
ControlledThe data has been reviewed and has no release date and cannot be made public.
PublicThe data has been reviewed and public on the release date set by the data submitter.

Please click on "My submission" to find the object that needs to be modified, click the “pencil icon” to modify it, the scope of the modifiability and the requirements for the fields can be found in the modification process. Please do not create a new submission to submit similar information for the purpose of modification! Modifications do not affect the reference to assigned accession! If you need to modify the object without pencil iconor delete some objects, you can apply for modification or deletion at datasubs@cngb.org and provide the corresponding accession.

There are three Data management forms of CNSA: Public, Controlled and Private. You can choose a Data management form when submitting a project. The release date of the public data and the metadata release date of the controlled data can be set in the project submission process. If you need to modify the release date, please click “My submission” to find out the submission that you need to modify.

1)   If the status of the project is “Unfinished”, click the “pencil icon” in the project “status” column to enter the process to modify the release date.

2)   If the status of the project is "Processing" or "Processed", you can click the date of the “release date” column or "pencil icon" to modify the release date.

3)   If the status of the project is "Public" or "Controlled", please send an email to datasubs@cngb.org to apply for the modification and indicate the project accession and the reason for the change.

If you need to modify the Data management form, please send an email to datasubs@cngb.org with the project accession and the reason for the change. If you need to change the controlled data to public, you will need to fill out the Data Submission Review Application Form and prepare the appropriate review materials.

1)   Legality and compliance review:After the project is submitted, the system will send the Data Submission Review Application Form to your email address. You will need to complete the form and submit the relevant review materials requested in the form and send it to the mailbox (cngb-ebb@cngb.org) of CNGB Bioresource Sharing Compliance Center (BSCC). The BSCC reviews the legality and compliance of the submitted data, such as ethics review and human genetic resources review, etc.

2)   Data review:CNSA's data administrators review the completeness, correctness, and relevance of the submitted data. If the data you submitted is incorrect, the data administrator will notify the user via email (datasubs@cngb.org) to modify it. If the data you submit is correct, the specific review time depends on the sample size submitted and the amount of data. Generally, it does not exceed 3 working days.

If your data has been submitted to the CNGBdb-CNSA, you can add the following words to your manuscript to cite the accession number in CNGBdb:

The data that support the findings of this study have been deposited in the CNSA (https://db.cngb.org/cnsa/) of CNGBdb with accession number CNPXXXXXXX.

If your data is controlled, please send an email with the project accession to datasubs@cngb.org. If the data is public, you can enter the CNSA project accession directly in the search box on the CNSA homepage and send a link to the searched data detail page to the magazine. Currently we can provide reviewer links of project, sample, experiment/run, assembly, and variant. The reviewer link is valid for 2 months. if you need to postpone, please send an email to datasubs@cngb.org to apply for an extension, and you need to indicate the reviewer link in the email.

The public data can be searched by entering keywords such as the accession in the search box of the CNSA homepage.

Only public data allows users to download freely. Users can click the “Download” button in the navigation bar to enter the CNSA FTP download page to download data. You can also enter the data accession in the search box on the home page and enter the search details page to download. When downloading and using publicly available data, please follow the CNSA User Instructions.

If you have any questions or suggestions, feel free to contact datasubs@cngb.org.

Address:China National GeneBank, Jinsha Road, Dapeng District, Shenzhen, China

Tel:0755-33945586

QQ group:894343659

Public number: