CBS Data Storage

Projects utilising microdata sometimes result in large files that are of interest to other projects. However, storing the data derived from those projects leads to significant costs for researchers developing those files, limiting their reuse potential. ODISSEI, in collaboration with CBS, supports the reuse of data that is complex and/or requires a lot of computational resources to create. It allows researchers to store the files derived from the microdata projects in the CBS Data Storage. This reduces their storage costs while making valuable datasets accessible to others.

The facility is offered as a collaboration between CBS and ODISSEI and is open to all researchers working with CBS microdata, producing data files that are of great value to the (ODISSEI) research community.

From 1 December 2024, 20TB will be available for a period of six months and can be requested through the CBS Microdata Environment. In May 2025, this procedure will be reviewed between CBS and ODISSEI, and it will be determined whether the facility will continue and in what form.

What are the requirements?

  • The file made available by the researchers was created using CBS microdata only.
  • It is a large file, which means there is a hefty cost to store the file within the project environment.
  • All other conditions as implied by the CBS law apply
  • CBS will stay in control of the created dataset
  • The file is valuable to other users.
  • The script used to create the files and the metadata describing them must be prepared by the researcher, be of sufficient quality, and be suitable for public availability. 
  • The link to the script and the metadata will be published in the ODISSEI Code Library and Portal
  • CBS will place the metadata in the RA environment alongside the file. 
  • The researchers who made the file take responsibility for answering any questions about the file.

What are the steps?

  1. Fill in the form to request the CBS Data Storage (available in the CBS Microdata Environment). In your H drive, create a folder called ‘Data Storage’, place the filled-in form there, and inform your CBS microdata advisor of this
  2. After CBS agrees to use the facility to store your dataset, you can place the data file, metadata file and scripts in H:/Data Storage. The metadata template will be made available in that folder. CBS will move the data to the Data Storage. 
  3. CBS will perform an output check for metadata and scripts.
  4. Publish the code in a code registry of your choice (e.g., Zenodo, Open Science Framework – OSF, etc.) and include the link to it in the metadata form.
  5. Send the metadata form to ODISSEI Data Manager, Angelica Maineri, so that the ODISSEI Coordination Team can publish the metadata via the ODISSEI Portal to make your dataset findable, and add the link to the code via the ODISSEI Code Library.

If you have questions about the publication of the metadata or code, you can reach out to Angelica Maineri, ODISSEI Data Manager.

For any questions regarding the CBS Data Storage and the procedure, please reach out to CBS via microdata@cbs.nl