~/leocamacho.co

Get Around

🧠 EdinburghAI
Co-founder and President of my University's AI Society
🛠️ Projects
Side projects I've worked on
📝 Essays
Thoughts on AI, startups, and the future

Contact Me

📧 Email
💼 LinkedIn
🐦 Twitter

Data Parallelism

Made Jul 09, 2025modified Jul 09, 20251 min read

What:

Parallelising the training data and splitting it up amongst GPU’s
Each GPU computes gradients (loss) independently
Gradients are averaged to update model
Allows scaling with multiple GPUs and minimal code changes

Graph View

Backlinks

Governing Compute
High Performance Computing

Created with Quartz v4.4.0 © 2025

GitHub
LinkedIn
Twitter