zum Inhalt springen

Research Data Management

As the GSSC is a publicly funded institution, its research data has to be publicly available, unless the right of publicity is applicable. To this end, a two-stage research data archival storage infrastructure is being implemented.

On the one hand, there is a GSSC-administered cloud storage solution based on OwnCloud with a sophisticated back-up system, run by the RRZK. This allows all members of the GSSC to savely and privately store and share their research data, as well as collaboratively write funding applications or papers. The structure of this storage solution is managed by the individual researchers and is intended for internal use only. The cloud storage is based on OwnCloud, running on an Ubuntu LTS server on the RRZK's VM solution.

On the other hand, the back end of a fully OAIS-compliant archival system has been set up and the whole system should be fully usable by the members of the GSSC by the end of 2016. Not only will this system facilitate a state of the art archival storage, but also the comprehensive presentation of dissemination packages as per the set rights of the archival packages. Metadata is stored in basic Dublin Core, with the possibility of researchers adding their own metadata in their preferred format.

The archival system is Archivematica, while the dissemination system is based on its colleague, Access to Memory (AtoM), both also running on dedicated VMs. The deep storage back end is the University's IBM Tivoli TSM tape robot system.

The decision to use only free, open source software stems from the idea that only FOSS is capable of real long time storage and archival solutions, as a proprietary software life cycle is always dependent on the company's life cycle. The exception is the tape library, as there is no FOSS alternative.

The remaining main challenge will be to safeguard the continuity of data curation beyond the funding period.

Diagram of GSSC Digital Infrastructure

vergrößern: