Storage Systems
Each cluster provides several different storage systems that can be placed into the following rough categories:
Category | Shared | Speed | Size | Durability | Purpose |
---|---|---|---|---|---|
HOME | yes | medium | medium | robust+backups | User HOMEs |
Project | yes | medium | medium | robust+backups | Medium term data |
SCRATCH/WORK | yes | fast | big | fragile | Active data |
ARCHIVE/PERM | yes | very slow | medium | robust+backups | Archival to tape |
Many of them are shared parallel filesystems accessible from many nodes, rather than node local storage. Many of these filesystems also have quotas on the use of space and/or inodes (every file and directory has a “header” using one or more inodes).
It is important to use the right storage system for the right purpose to get the best performance, have the fewest surprises, and cause the least impact on other users. For example, a job that has to do many IO operations on temporary data could perform terribly if those temporary files (e.g. opening a file, writing a small amount of data, closing the file, deleting the file repeatedly millions of times) were stored on HOME or in a SCRATCH/WORK filesystem, and could easily slow down the filesystem for everyone on the cluster.
See the pages below (also can be found in the nav bar to the left) for information about the different parts of the storage systems and how to use them.
Last modified: 2024-11-20 10:49:29