Swestore

From SNIC Documentation

Swestore is Research Data Storage Infrastructure operated by the Swedish National Infrastructure for Computing (SNIC).

The resources provided by Swestore are made available through open procedures such that the best Swedish research is supported and new research is facilitated. The purpose of Swestore allocations, granted by Swedish National Allocations Committee (SNAC), is to provide large scale data storage for “live” or “working” research data, also known as active research data.

Due to the VR funding, the free allocations on Swestore have some usage limitations.

Swestore is NOT supposed to be used for backups and such requests for allocation will be rejected. Please, check with your university (home institution/organisation) IT department about backup services, strategies and policies in place. If such services do not exist or if you can’t access them for different reasons, please contact us at support@swestore.se;

Swestore is NOT supposed to be used as archiving service, long-term storage or repository for “static” data. Once data is no longer in the process of change, and decision on which data should be retained shared and/or preserved has been taken, data should be moved toward appropriate data services. The higher education institutions are responsible for archiving and long-term preservation of research data produced by researchers employed by them;

Glossary

Active (Research) data

is data that is being worked on as part of research project and therefore subject to change. The files containing data will need to be accessed and amended or updated as new data is gathered or processed.

Static (Research) data

is data that is no longer in the process of change and it can be prepared for preservation and reuse.

Backup

is a copy of the digital data to be stored and used as a replacement in case the main copy is either deleted or corrupted.

Archive

is a service to record, organise, and store (digital) items in optimal conditions, with standardised labelling to ensure their longevity and continued access. The service is based on application of metadata, archiving policies, records management, and digital preservation actions. Archivists make decisions on selection and retention of items which are usually governed by supporting policies.

Swestore is distributed across the SNIC centres C3SE, HPC2N, Lunarc, NSC and Uppmax. Data is stored in two copies with each copy at a different SNIC centre. This enables the system to cope with a multitude of issues ranging from a simple crash of a storage element to losing an entire site while still providing access to the stored data.

One of the major advantages to the distributed nature of Swestore is the excellent aggregated transfer rates possible. This is achieved by bypassing a central node and having transfers going directly to/from the storage elements if the selected transfer protocol allows it. Swestore can achieve aggregated transfer rates in excess of 100 Gigabit per second, but in practice this is limited by connectivity to the end user, each university or a limited number of files (typically max 1 Gbit/s per file/connection).

To protect against silent data corruption the dCache storage system checksums all stored data and periodically verifies the data using this checksum.

The dCache system does NOT yet provide protection against user errors like inadvertent file deletions.

Register your eScience client certificate in SUPR (for all users)

All project members have to register in SUPR and be added to the approved project by the PI. All users also have to register their certificate in SUPR. This information is used by Swestore to authenticate the users when accessing the storage area. Registering the certificate is easy though. Make sure your certificate is stored in your browser, log in to SUPR , click "Personal Information" in the left menu, click "Register Client Certificate" and follow the instructions. Please wait for up to 10 minutes for this information to be distributed to Swestore.

Using Swestore

Download and upload data

From the command line

There are several command line tools capable of using the protocols provided by Swestore. For interactive usage on SNIC clusters we recommend using the ARC tools which should be installed on all SNIC resources.

As an integration point for building scripts and automated systems we suggest using the curl program and library.

Using a GUI client

From a web browser

Swestore is accessible in your web browser as a simple directory index interface at https://webdav.swestore.se/. To browse private data you need to have your certificate installed in your browser (see above). Projects are organized under the /snic directory as https://webdav.swestore.se/snic/YOUR_PROJECT_NAME/.

Enabled access protocols

A design criteria for Swestore is to provide the storage over a number of standardized and public protocols. There is no vendor specific client needed for access.