this post was submitted on 16 Aug 2023
1 points (100.0% liked)

Machine Learning

1765 readers
1 users here now

founded 4 years ago
MODERATORS
 

When I train my PyTorch Lightning model on two GPUs on jupyter lab with strategy="ddp_notebook", only two CPUs are used and their usages are 100%. How can I overcome this CPU bottleneck?

Edit: I tested with PyTorchProfiler and it was because of old ssds used on the server

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here