Multi-GPU Training in Pytorch

Data and Model Parallelism

Rachel Draelos, MD, PhD
Towards Data Science
6 min readMar 4, 2020

--

“Harvesters” by Anna Ancher. Link: Wikipedia.

This post will provide an overview of multi-GPU training in Pytorch, including:

  • training on one GPU;
  • training on multiple GPUs;
  • use of data parallelism to accelerate training by processing more examples at once;
  • use of model parallelism to enable training models that…

--

--

CEO at Cydoc | Physician Scientist | MD + Computer Science PhD | AI/ML Innovator