Good day, OCP team,
I am interested in using the GemNet versions in the fairchem repository due to their optimization for multiple GPUs. I would appreciate your guidance on using both versions of the model (gemnet_gp and gemnet_oc). My intention is to use them both for making predictions of energy and forces as well as for performing molecular dynamics. To be more concise, allow me to pose the following questions:
- What is the relevance of the LMDB files required for both models in the base.yml file?
- What information do both the s2ef files and the LMDB files contain? Do your versions of GemNet incorporate the use of information that Gasteiger’s original version did not use, or is the exact same information used for both performing inferences and dynamics?
- Are there specific LMDB files for s2ef to train and test the model? This question arises because, in the download_data.py file, the only files of this format for s2ef are apparently those marked as ‘test’.
- If to use your versions of GemNet I need to convert my data into .XYZ format as used by the original model version or my s2ef data to LMDB data, after performing this conversion is it only necessary to set its path in the ‘base.yml’ file to make full use of both versions? (This is considering using the model for any molecular structure that requires investigation with my own data.) Do you have publicly available code to perform these conversions into the data format?
- What would be the reasons to choose the gemnet_gp model over the gemnet_oc model or vice versa?
- Is the command to use the gemnet_oc model analogous to that for gemnet_gp? (python main.py --mode train --config-yml configs/s2ef/all/gp_gemnet/gp-gemnet-xl.yml --distributed --num-nodes 32 --num-gpus 8 --gp-gpus 4)
- If its use is not analogous, how can I use the gemnet_oc model?
Thank you for your time and attention.