Introduction
The digital revolution has provoked huge challenges towards the management of massive research data across disciplines and subjects. And to help better facilitate the open data work, the state-level rules, "Measures for Managing Scientific Data" has been released in March 2018. As a follow-up action, 20 national data centers are selected by the Ministry of Science and Technology to ensure long-term data accessibility and reusability. Eleven out of the twenty national data centers are managed by the Chinese Academy of Sciences and have close cooperation with the China Science and Technology Cloud (CSTCloud).
List of 20 National Data Centers in China
Name | Affiliation | Website |
National High Energy Physics Science Data Center | Institute of High Energy Physics, CAS | http://www.hep.nsdc.cn/ |
National Genome Science Data Center | Beijing Institute of Genomics, CAS | https://bigd.big.ac.cn/ |
National Microbial Science Data Center | Institute of Microbiology,CAS | https://gcmeta.wdcm.org/ |
National Space Science Data Center | National Space Science Center, CAS | http://www.cssdc.ac.cn/ |
National Astronomical Sciences Data Center | National Astronomical Observatories, Chinese Academy of Sciences | https://nadc.china-vo.org/?locale=en |
National Earth Observation Science Data Center | Aerospace Information Research Institute, CAS | http://www.chinageoss.cn/en/index.htm |
National Arctic and Antarctic Data Center | Polar Research Institute of China | https://www.chinare.org.cn/en/ |
National Qinghai Tibet Plateau Science Data Center | The Institute of Tibetan Plateau Research, CAS | https://data.tpdc.ac.cn/en/ |
National Ecological Science Data Center | Institute of Geographics Sciences and Natural Resources Research, CAS | http://www.cnern.org/ |
National Material Corrosion and Protection Science Data Center | The University of Science and Technology Beijing | http://corrdata.org.cn/ |
National Cryosphere Desert Data Center | Cold and Arid Regions Environmental and Engineering Research Institute, CAS | http://www.ncdc.ac.cn/portal/?lang=en&clear_cache=1 |
National Metrological Science Data Center | National Institute of Metrology, China | www.nms.org.cn |
National Earth System Science Data Center | Institute of Geographics Sciences and Natural Resources Research, CAS | http://www.geodata.cn/ |
National Population Health Science Data Center | Chinese Academy of Medical Sciences | https://www.ncmi.cn/index.html |
National Basic Sciences Public Science Data Center | Computer Network Information Center, CAS | http://www.nsdata.cn/ |
National Agricultural Science Data Center | Agricultural Information Institute of CAAS | http://www.agridata.cn/ |
National Forestry and Grassland Science Data Center | Institute of Forest Resource Information Techniques, CAF | http://www.forestdata.cn/index-en.html |
National Meteorological Science Data Center | National Meteorological Information Center | http://data.cma.cn/en |
National Earthquake Science Data Center | China Earthquake Networks Center | https://data.earthquake.cn/index.html |
National Marine Science Data Center | National Marine Data and Information Service | http://mds.nmdis.org.cn/ |
Tailored services in CSTCloud
To meet the needs of centralized data storage, backup, disaster recovery, and high-speed circulation in the National Data Centers, China Science and Technology Cloud adapts to the "cold, warm, and hot" stages of scientific data storage with a total storage capacity of over 150PB. Given the characteristics of disciplinary scientific data, multiple cloud services are provided by the CSTCloud.
Cloud storage service
CSTCloud cloud storage service is based on a distributed file system with elastic expansion, high reliability, and robustness. Supported by the CSTNET, the data transmission adopts standard transmission protocols and adapts to various operating systems with IPV4 and IPV6 dual-stack access. The data durability could reach 99.99999%.
Object storage service
To ensure the long-term security of data to the greatest extent, and to release dozens of petabytes level research data, the CSTCloud object storage service is based on a distributed system of error-tolerant with on-demand data storage and can support most popular interface access, such as the web, FTP, REST API, and S3, etc.
Cloud backup service
Cloud backup service has been developed by the CSTCloud for archiving massive research data. FTP and HTTP protocols are both supported with a standard file system for seamless data access, mainstream applications, and workloads.
Cooperation with selected data centers
In response to the challenges faced by the National Science Data Center, the "China Science and Technology Cloud 2020 Application Promotion Plan" has been launched and some featured use cases are selected. For example, 500TB backup storage space and professional technical support are prepared for selected national data centers, such as the National Basic Sciences Public Science Data Center, the National Cryosphere Desert Data Center as well as the National High Energy Physics Science Data Center.
In addition, facing the special needs of massive data access from the National Microbial Science Data Center (NMDC) and her sub-branches, China Science and Technology Cloud provided professional technical support with a compound solution which has helped facilitate the 100TB data transfer from 62 databases in more than 7 categories for national researchers.