University of Helsinki Databank

Do you have data that are not in active research use and for which you cannot find a suitable storage location? University of Helsinki Databank offers a location for 5 to 15 years for digital research datasets produced at the University. Research data are stored in Databank in frozen form, which means that their content cannot be edited in the service.
When regular storage space is not enough

Databank is a curated storage service that is also suited to large datasets. The service is not intended for the most sensitive datasets. The suitability of data is ensured in the service process.

In matters related to Databank, please reach out to the University’s Data Support: datasupport@helsinki.fi.

Careful preparation ensures preservation

In the case of data stored in Databank, it is important to use persistent file formats, as the data preserved in the service are not curated during storage. In other words, file formats will not be updated in sync with software updates.

The service is free of charge to researchers.

What kind of research data is the Databank suitable for?​

The Databank is suitable for research data, which can be formed as a dataset that can be described. Descriptive metadata (title, short description of the data, authors, etc.) will be published in the Data Catalogue which will be launched in spring 2025. Information about the usage rights are requested on the Databank’s subscription form. This ensures that there are no agreements preventing the retention of the dataset.​

The table below specifies what kind of data the Databank is suitable for from the perspective of data security. Data support is happy to help you if you are unsure whether your research data is suitable to Databank. Please ask: datasupport@helsinki.fi

 

Data type​
A detailed description of the data type​
Is the dataset suited for Databank?​
Personal data​

Dataset containing ordinary personal data

Ordinary personal data includes, but is not limited to, name, address, profession, e-mail, marital status, voice, image, video, etc. ​
The dataset does NOT contain data belonging to special categories of personal data.

Suitable for Databank!​

Pseudonymised data belonging to special categories of personal data.

All personal data in the dataset is protected by a code. The data does NOT contain direct identifiers, such as human voice, image or videos.​

Suitable for Databank if the pseudonymisation code is stored in a different location.​

Special categories of personal data are pseudonymised, but the data itself is indirectly identifiable

All strong identifiers (name, personal identity code, etc.) of the data are protected by code, but the data itself is unique, such as genetic information (sequences) or brain images.​

May be suitable for the Databank if it is impossible to identify a person from the data. The risk level of the data is assessed on a case-by-case basis.​

Anonymized data

All personal data has been deleted/modified in such a way that individual persons cannot be identified.​

May be suitable for Databank. The risk level of the data is assessed on a case-by-case basis. The anonymity of the dataset is checked.​

Confidential information​

Confidential information​

Confidential data include, for example, the habitats of endangered plants or the location of endangered animals, as well as some information related to national defence.​

High-risk data cannot be stored in the Databank. The Databank is suitable for low- and moderate-risk data.

Trade secrets​

The dataset contains trade secrets of a third party (such as another university or company). ​

If the economic value associated with a trade secret is significant, a Databank may not be the best option. ​
The Databank is suitable if the economic value related to the trade secret is not significant.​