System Architecture
System Architecture
Place system architecture description here.
Hardware and Component Specifications
System Component | Configuration |
---|---|
Supermicro X12 Gaudi Training Nodes |
|
CPU Type | Intel Xeon Gold 6336 |
Habana Gaudi processors | 336 |
Nodes | 42 |
Training processors/Node | 8 |
Host x86 processors/node | 2 |
Sockets | 2 |
Memory capacity |
* 512 GB DDR4 DRAM |
Memory/training processor |
32 GB HDM2 |
Local Storage |
6.4 TB local NVMe |
Max CPU Memory bandwidth | ** GB/s |
Intel First Generation Habana Inference Nodes | |
CPU Type | Xeon Gold 6240 |
First-Generation Habana Inference Processors | 16 |
Nodes | 2 |
First-Generation Habana Inference Cards/node | 8 |
Cores/socket | 20 |
Sockets | 2 |
Clock speed | 2.5 GHz |
Flop speed | 34.4 TFlop/s |
Memory capacity | *384 GB DDR4 DRAM |
Local Storage |
1.6TB Samsung PM1745b NVMe PCIe SSD |
Max CPU Memory bandwidth | 281.6 GB/s |
Standard Compute Nodes | |
CPU Type | Intelx86 |
Nodes | 36 |
x86 processors/node | 2 |
Memory Capacity | 384 GB |
Local NVMe | 3.2 TB |
Interconnect | |
Topology | Full bi-section bandwidth switch |
Per Node bandwidth | 6*400 Gb/s (bidirectional) |
DISK I/O Subsystem | |
File Systems | Ceph |
Ceph Storage | 1 PB |