I try to deal with the chemical dataset, which has a lot of small molecules with different atoms. How to use torchdict to store those non-aligned data, for example, coordinates. What should I set the batch_size?
for example, some data looks like this:
ndims = 3
data = {
'xyz': torch.concatenate([torch.rand(10, ndims), torch.rand(7, ndims)...]),
'energy': torch.tensor([1, 2, ...]),
'batch_ud': torch.tensor([0] * 10 + [1] * 7)
}
Thanks a lot!