Setting up multi-GPUs in the notebook

gwanyeong · March 18, 2021, 1:22am

Dear OCP team,
Thank you for building such a huge database for the community.

Currently, I am trying to understand the details of OCP and testing a few models using Jupyter notebook by following the tutorial. But I met a problem in the settings.

How can I set up multiple GPUs to train the model in the notebook?

Using the command, I was able to obtain the results as guided.(torch.distributed launch --nproc_per_node = 4, num-gpus 4)
However, personally, it would be easier to deal with the things using notebook if possible.

Thank you in advance.

Sincerely,
Gwan-Yeong

sidgoyal · March 18, 2021, 1:53am

Hi Gwan-Yeong

Thanks for your query!

Setting up multi-gpu training using Jupyter notebook is pretty tricky. I would recommend looking at this response from the pytorch team: DistributedDataParallel on terminal vs jupyter notebook - #4 by pritamdamania87 - distributed - PyTorch Forums

We are unable to extend support for multi-gpu training using Jupyter notebooks at this point of time in the OCP codebase, since there’re already open issues in (upstream) Pytorch.

We would recommend setting up multigpu training via terminal.

Best
Siddharth

gwanyeong · March 18, 2021, 2:25am

Dear Siddharth,

Thank you very much for your quick reply. I didn’t know that issue:)
If so, I should use the terminal.

Thanks again!

BR,
Gwan-Yeong

Topic		Replies	Views
CCAI - OCP Tutorial not being able to run	0	405	January 5, 2023
Is there a way to accelerate IS2RS task optimization speed	1	457	November 10, 2022
Torch/nccl version dismatch	4	740	June 26, 2023
Error with model: GemNetOC	2	195	January 12, 2024
Parallel inference on 2 GPUs	1	14	November 25, 2024

Setting up multi-GPUs in the notebook

Related topics