Overview#
The HPC environment became available to MCW researchers in March 2021. The cluster consists of 71 compute nodes, 3,400 CPU cores, and 40 GPUs. The cluster is connected by 7 100Gbps switches running RoCEv2 (ethernet equivalent to Infiniband). Additionally, a 215TB NVMe provides scratch storage, and a 1.77PB scale-out NAS provides persistent storage.
Cluster#
Detailed information is available below. Please note, the table is wide and might require side scrolling to view all data.
Nodes | Type | Cores/node | Mem/node (Gb) | Disk/node (Gb) | Total GPUs | Sockets/node | Cores/socket | Threads/core | CPU Vendor | CPU Model | CPU Base Freq (GHz) | CPU Turbo Freq (GHz) | GPU Vendor | GPU Model | GPU Mem (Gb) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
60 | CPU | 48 | 384 | 440 | 2 | 24 | 1 | Intel | 6240R | 2.4 | 4 | ||||
6 | GPU | 48 | 384 | 440 | 4 | 2 | 24 | 1 | Intel | 5220R | 2.2 | 4 | NVIDIA | Tesla V100 | 32 |
2 | GPU | 48 | 512 | 440 | 4 | 2 | 24 | 1 | Intel | 6336Y | 2.4 | 3.6 | NVIDIA | Ampere A40 | 48 |
1 | GPU | 40 | 512 | 7000 | 8 | 2 | 20 | 1 | Intel | E5-2698 v4 | 2.2 | 3.6 | NVIDIA | Tesla V100 SXM2 | 32 |
2 | Large mem | 1536 | 440 | 2 | 24 | 1 | Intel | 6240R | 2.4 | 4 |