Resources at RUB
HPC Cluster Elysium
Node Specifications
Type | Count | CPU | Memory | Local NVMe Storage | GPU |
---|---|---|---|---|---|
Thin-CPU | 284 | 2xAMD EPYC 9254 (24 core) | 384 GB | 960 GB | - |
Fat-CPU | 13 | 2xAMD EPYC 9454 (48 core) | 2304 GB | 1.92 TB | - |
Thin-GPU | 20 | 2xAMD EPYC 9254 (24 core) | 384 GB | 1.92 TB | 3xNVIDIA A30 Tensor Core GPU 24GB, 933GB/s |
Fat-GPU | 7 | 2xAMD EPYC 9454 (48 core) | 1152 GB | 1.92 TB + 15.36 TB | 8xNVIDIA H100 SXM5 GPUs 80GB, 3.35TB/s |
File Systems
The following file systems are available:
/home
: For your software and scripts. High availability, but no backup. Quota: 50 GB per user./lustre
: Parallel file system to use for your jobs. High availability, but no backup. Not for long term storage. Quotas: 1 TB and 1.000.000 files per user./tmp
: Fast storage on each node for temporary data. Limited in space, except for FatGPU nodes where multiple TB are available. Data is removed when the job ends.
Partition Overview
Two partitions are available for each type of compute node: the filler partitions are designed for short jobs, while the standard partitions support longer-running tasks.
Jobs in the filler partition have a lower priority and will only start if no job from the regular partition requests resources. Running jobs in the filler will cost only a fraction of the fair share of a regular partition.
The vis
partition is special since the visualization nodes are intended for
interactive use.
Partition | Timelimit | Nodelist | Max Tasks per Node |
Share-Cost² |
---|---|---|---|---|
cpu | 2-00:00:00¹ | cpu[001-284] | 48 | 1.000 / core |
cpu_filler | 3:00:00 | cpu[001-284] | 48 | 0.050 / core |
fat_cpu | 2-00:00:00 | fatcpu[001-013] | 96 | 1.347 / core |
fat_cpu_filler | 3:00:00 | fatcpu[001-013] | 96 | 0.067 / core |
gpu | 2-00:00:00 | gpu[001-020] | 48 | 1.000 / core 49.374 / GPU |
gpu_filler | 1:00:00 | gpu[001-020] | 48 | 1.000 / core 12.344 / GPU |
fat_gpu | 2-00:00:00 | fatgpu[001-007] | 96 | 1.000 / core 169.867 / GPU |
fat_gpu_filler | 1:00:00 | fatgpu[001-007] | 96 | 1.000 / core 49.217 / GPU |
vis | 1-00:00:00 | vis[001-003] | 2.000 / core 29.401 / GPU |
¹ Times of up to 7 days are possible on this partition but not recommended. Only 2 days are guaranteed, jobs running longer than that may get cancelled if that becomes necessary for important maintenance work.
² Cost does not refer to money, but the factor of computing time that is added to a projects used share in order to compute job priorities. The costs are based on the relative monetary costs of the underlying hardware.