ENTERPRISE240 min200 min180 min
AI Infrastructure Networking
High-bandwidth, low-latency fabrics for GPU clusters: RoCE, InfiniBand concepts, spine-leaf at scale, and observability for ML platforms.
36 hours · 3 modules
Module 1
AI Cluster Fabrics
Master ai cluster fabrics through guided lessons, labs, and assessments.
- • Fat-tree
- • Rail-optimized
- • ECN/PFC
4 lessons
Module 2
RDMA & RoCE
Master rdma & roce through guided lessons, labs, and assessments.
- • Lossless Ethernet
- • PFC/ECN tuning
4 lessons
Module 3
AI Network Observability
Master ai network observability through guided lessons, labs, and assessments.
- • Telemetry
- • Bottleneck analysis
4 lessons 1 labs