The new version of CNGBdb comes with a much better user interface, which has been re-revised, fully upgraded, becomes more easier to use and user-friendly.
CNGBdb's advanced search allows users to run queries over all fields in the database according to the data retrieval needs. For example, users can do a title search with keyword ‘rice’ in the Title field of the literature database, or search ‘Gigascience’ in the Journal field. BOOLEAN operations also can be used to improve search accuracy and search efficiency.
Users are offered ‘auto complete’ search service by CNGBdb. If users type partial keywords into the CNGBdb search bar, auto-complete will finish typing it for users.
Users are offered ‘auto correction’ search service by CNGBdb. With this search service, users will receive suggested spelling search results if they type their search queries incorrectly, which can help users find what they are looking for in an easy and fast way. For example, users type ‘lang cancer’ into the search bar, based on the algorithms analysis, CNGBdb predicts ‘lung cancer’ may be the term users think about actually, and provides search results of ‘lung cancer’.
Search and filter functions of CNGBdb in all databases can help users narrow down searches that brought back too many results and find results accurately and rapidly. For example, users can filter out full-text articles by selecting ‘Free full text’ in the search result page of literature database, and also can search for the newest papers by the year data of publication.
In search results, CNGBdb will give priority to recommend excellent datasets that match the user's search terms. For example, if users search for ‘the Ruili Botanical Garden’, the search results will give priority to recommend the Ruili Botanical Garden dataset, project ID of which is CNPhis0000538, containing 42TB sequencing data and 738 plant samples.
CNGBdb version 1.0 has increased data resources compared to the beta version. Up to February 2019, data resources of all databases are as follows:
Data resources will be updated periodically to ensure timeliness, such as daily updates of the literature database.
The average submission time is reduced by 70% after optimization. Testing case: submission time-consuming(file size: 99kb; article number: 768) is 27s and 5s before and after optimization.
Based on big data and cloud computing technologies, China National GeneBank DataBase (CNGBdb) provides integrated data services such as data archiving, computational analysis, knowledge search, management authorization and visualization.
CNGBdb constructs multiple databases, including Literature, Gene, Variation, Protein, Sequence, Project, Sample, Experiment, Assembly, and allows cross-reference among those data sources to form data interconnection. CNGBdb offers a number of important performance advantages:3 billion data items; full-text search; second response time; retrieval keywords both in Chinese and English.