Chapter 6

Storage Systems

Each cluster provides several different storage systems that can be placed into the following rough categories:

CategorySharedSpeedSizeDurabilityBackupsPurpose
HOMEyesmediummediumrobustyesUser config files, software, etc.
SCRATCH/WORKyesfastbigfragilenoActive data
ARCHIVE/PERMyesvery slowmediumrobustyesArchival to tape
LOCALnofastsmallfragilenoJob temporary data

Many of them are shared parallel filesystems accessible from many nodes, rather than node local storage. Many of these filesystems also have quotas on the use of space and/or inodes (every file and directory has a “header” using one or more inodes).

Note

It is important to use the right storage system for the right purpose to get the best performance, have the fewest surprises, and cause the least impact on other users. For example, a job that has to do many IO operations on temporary data could perform terribly if those temporary files (e.g. opening a file, writing a small amount of data, closing the file, deleting the file repeatedly millions of times) were stored on HOME or in a SCRATCH/WORK filesystem, and could easily slow down the filesystem for everyone on the cluster.

See the pages below (also can be found in the nav bar to the left) for information about the different parts of the storage systems and how to use them.