~/leocamacho.co

High Performance Computing

Made Jun 22, 2025modified Jun 22, 20251 min read

Basically all of (Machine Learning) Models nowadays are behemoths of engineering challenges.

nVidia’s GPU’s have NVLink → which allows extremely high throughput for GPU-GPU communication at training time over the network.

There’s different ways to parallelise DNN training: