Loading...
 

Storage

Storage

Options

When working with huge data files, the question of where to store your input/output files becomes an issue. Here is a list of what is available.

Name/Location Local/Shared Use Advantage Disadvantage
/tmp local+ Fast local storage on the cluster nodes Very Fast Only local; very temporary
/tmpdata/< username >* local Fast local storage only on spinoza and schroedinger Very Fast Only local; temporary, see policy
/scratch/< username > shared Fast shared storage on all servers Available on all servers Temporary, see policy
tier2 e.g. /g/tier2/< groupname >/ shared Cheap permanent storage Cheap and safe (no backup though!) Not accessible on the cluster; Slow
tier1 e.g. /g/< groupname >/ shared Backed-up shared storage Available on all servers; Backed-up Expensive; Not as fast as /scratch or /tmpdata

* if /tmpdata/< username > does not exists, try creating it. Remember to use your unix account or ask GBCS.

+Local meaning that the /tmp on schroedinger is not the same as on spinoza or any other node.

On the GB servers, Spinoza and Schroedinger, all of the above are available. Tier2 is however not accessible from the cluster nodes.

Policies

  • /tmp is exclusively reserved for real temporary data and is cleaned up automatically.
  • /tmpdata on spinoza and schroedinger is used to store a copy of your data locally with the underlying goal to speed up computing and reduce the I/O load due to NFS access.
    • Please use /tmpdata/< username > . Files and directories not matching this format are deleted on a regular basis.
    • Cleanup your data regularly.
    • /tmpdata/< alumni > directories are deleted without notice after a 6 month grace period from the time the user left EMBL.
  • /scratch is cleaned every day automatically by the IT. If the overall usage is above 90% then everything older than 45 days (approx. 6 weeks) is deleted.

See also New at GB