I am evaluating the GemNet-OC optimisation for the IS2RS task using different test sets. It took me 3 days to finish one test set (test-id) with one 3090 GPU. I saw that structures are optimised one by one, and I am curious is there a way to accelerate the optimisation process (e.g. in-batch or multi-GPU acceleration)?