CODEPLOT aims to provide a reliable and highly efficient computing platform for users to conduct bioinformatics analysis automatically even without programming background.
Version 1.0 release features include:
1）Assembly and gene annotation of the 1000 plant transcriptomes
The 1000 plants Project (1KP) is an international multidisciplinary alliance project that has conducted large-scale transcriptome sequencing of more than 1,000 plants.
The CoVID-19 Novel Coronavirus Sequence Database collected data from CNGB, GenBank, GSAID and other sources, providing the fundamental information to study the evolutionary relationship of COVID-19 collected from different regions worldwide and infer the potential spread routine.
Single-cell database integrates complex single cell sequencing datasets, and provides various relevant analysis tools including result visualization services, which will facilitate researchers to access and explore published single-cell datasets easily.
1)Blast, sequence database screen based on homologous sequence alignment
2) single_cell_scanpy, single cell sequencing data analysis
3)HMMER, gene family member mining
4)edgeR, differential expression gene analysis
Codeplot employs blockchain to produce fingerprint for all confidential files and calculation process to ensure that all relevant calculation processes and histories can be traced back and the records cannot be tampered with. Users can browse and retrieve the fingerprint information on the whole block chain for your ana.
CDCP (Cell-omics Data Coordinate Platform) is a shared and integrated set of complex single cell data, providing users with integrated services such as single cell data search, analysis tools and visualization.
Single Cell Analysis of Macaca Fascicularis project 210,000 + fascicularis fascicularis;
Single Cell Analysis of Human Project 100,000 + cellular omics data and visualization;
In order to comply with the new national laws and regulations on life science data archiving, the "Privacy and Security Policy" has been updated this time and will be updated online on September 23, 2020.
The updated documents are as follows:
"Privacy and Security Policy" https://db.cngb.org/policy/
In order to comply with the new national laws and regulations on life science data archiving, part of the "Terms and Conditions" has been updated this time and will be updated online on September 2, 2020.
In order to comply with the new national laws and regulations on life science data archiving, management, and open sharing, CNGBdb has amended processes of data submission and access applications and established related documents for its data services. The new processes were officially updated and implemented on July 11, 2020. To ensure a smooth transition, if you have already submitted your request, you can complete the process with the old version documents.
The updated documents are as follows:
Please read carefully the above documents to fully understand our new processes before continuing to visit CNGBdb and use its data services.
If you have any questions, please contact firstname.lastname@example.org.
The new version of CNGBdb comes with a much better user interface, which has been re-revised, fully upgraded, becomes more easier to use and user-friendly.
CNGBdb's advanced search allows users to run queries over all fields in the database according to the data retrieval needs. For example, users can do a title search with keyword ‘rice’ in the Title field of the literature database, or search ‘Gigascience’ in the Journal field. BOOLEAN operations also can be used to improve search accuracy and search efficiency.
Users are offered ‘auto complete’ search service by CNGBdb. If users type partial keywords into the CNGBdb search bar, auto-complete will finish typing it for users.
Users are offered ‘auto correction’ search service by CNGBdb. With this search service, users will receive suggested spelling search results if they type their search queries incorrectly, which can help users find what they are looking for in an easy and fast way. For example, users type ‘lang cancer’ into the search bar, based on the algorithms analysis, CNGBdb predicts ‘lung cancer’ may be the term users think about actually, and provides search results of ‘lung cancer’.
Search and filter functions of CNGBdb in all databases can help users narrow down searches that brought back too many results and find results accurately and rapidly. For example, users can filter out full-text articles by selecting ‘Free full text’ in the search result page of literature database, and also can search for the newest papers by the year data of publication.
In search results, CNGBdb will give priority to recommend excellent datasets that match the user's search terms. For example, if users search for ‘the Ruili Botanical Garden’, the search results will give priority to recommend the Ruili Botanical Garden dataset, project ID of which is CNPhis0000538, containing 42TB sequencing data and 738 plant samples.
CNGBdb version 1.0 has increased data resources compared to the beta version. Up to February 2019, data resources of all databases are as follows:
Data resources will be updated periodically to ensure timeliness, such as daily updates of the literature database.
The average submission time is reduced by 70% after optimization. Testing case: submission time-consuming(file size: 99kb; article number: 768) is 27s and 5s before and after optimization.
Based on big data and cloud computing technologies, China National GeneBank DataBase (CNGBdb) provides integrated data services such as data archiving, computational analysis, knowledge search, management authorization and visualization.
CNGBdb constructs multiple databases, including Literature, Gene, Variation, Protein, Sequence, Project, Sample, Experiment, Assembly, and allows cross-reference among those data sources to form data interconnection. CNGBdb offers a number of important performance advantages:3 billion data items; full-text search; second response time; retrieval keywords both in Chinese and English.