Hi there, thanks for providing this amazing huge dataset! I’m still getting oriented and understanding how everything corresponds to everything else, and I have a couple questions I haven’t been able to figure out the answers to:
Should every entry in the LMDB files have an entry in the
sidfield? It seems that when I load it I don’t see
sids for the
trainportions, obviously a large segment of the dataset…I’m not sure if it’s supposed to be that way or if I’ve done something wrong, or perhaps have a damaged version of the file…
In the pickle files with mappings , do those system-id’s correspond to the same
sidfrom the LMDB (prepended by
randomI guess) or to something else? When I tried just prepending
randomto one fo the
sidvalues I could see, I got a
KeyErrorso I suspect I’m doing something wrong…maybe the trajectory files have different id’s than the LMDB’s? If that’s the case, how do I correspond those to each other…?
Any insights would be greatly appreciated – thanks in advance!
(For additional context: the LMDB’s I’m referring to are the IS2RE task, and I’m loading them in with the