(LibTorch -> PyTorch tensors) How to save a C++ Libtorch Tensor and load it into a Python Pytorch project?

need_libtorch_help · September 5, 2019, 6:24pm

Hey, I’m simply trying to save a vector of LibTorch (C++) tensors to file and then load those tensors back into PyTorch (Python) for post-processing reasons.

On the C++ side, I have the following sample code:

  const auto new_tensor = torch::rand({2, 3, 4});
  const auto new_tensor2 = torch::rand({1, 125, 13, 13});
  torch::save({new_tensor, new_tensor2}, "tensor_vector.pt");

I then copy the generated file into my python project directory and call the following script:

import torch  
tensors = torch.load('./tensor_vector.pt')

Which returns the following python error message:

Traceback (most recent call last):
  File "C:\Users\admin\.conda\envs\py37_torch\lib\tarfile.py", line 189, in nti
    n = int(s.strip() or "0", 8)
ValueError: invalid literal for int() with base 8: 'ZZZZZZZZ'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "C:\Users\admin\.conda\envs\py37_torch\lib\tarfile.py", line 2297, in next
    tarinfo = self.tarinfo.fromtarfile(self)
  File "C:\Users\admin\.conda\envs\py37_torch\lib\tarfile.py", line 1093, in fromtarfile
    obj = cls.frombuf(buf, tarfile.encoding, tarfile.errors)
  File "C:\Users\admin\.conda\envs\py37_torch\lib\tarfile.py", line 1035, in frombuf
    chksum = nti(buf[148:156])
  File "C:\Users\admin\.conda\envs\py37_torch\lib\tarfile.py", line 191, in nti
    raise InvalidHeaderError("invalid header")
tarfile.InvalidHeaderError: invalid header

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "C:\Users\admin\.conda\envs\py37_torch\lib\site-packages\torch\serialization.py", line 524, in _load
    return legacy_load(f)
  File "C:\Users\admin\.conda\envs\py37_torch\lib\site-packages\torch\serialization.py", line 448, in legacy_load
    with closing(tarfile.open(fileobj=f, mode='r:', format=tarfile.PAX_FORMAT)) as tar, \
  File "C:\Users\admin\.conda\envs\py37_torch\lib\tarfile.py", line 1589, in open
    return func(name, filemode, fileobj, **kwargs)
  File "C:\Users\admin\.conda\envs\py37_torch\lib\tarfile.py", line 1619, in taropen
    return cls(name, mode, fileobj, **kwargs)
  File "C:\Users\admin\.conda\envs\py37_torch\lib\tarfile.py", line 1482, in __init__
    self.firstmember = self.next()
  File "C:\Users\admin\.conda\envs\py37_torch\lib\tarfile.py", line 2309, in next
    raise ReadError(str(e))
tarfile.ReadError: invalid header

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "C:/Users/admin/PycharmProjects/TestProject/importLibtorchTensor.py", line 2, in <module>
    tensors = torch.load('./tensor_vector.pt')
  File "C:\Users\admin\.conda\envs\py37_torch\lib\site-packages\torch\serialization.py", line 368, in load
    return _load(f, map_location, pickle_module)
  File "C:\Users\admin\.conda\envs\py37_torch\lib\site-packages\torch\serialization.py", line 528, in _load
    raise RuntimeError("{} is a zip archive (did you mean to use torch.jit.load()?)".format(f.name))
RuntimeError: ./tensor_vector.pt is a zip archive (did you mean to use torch.jit.load()?)

Additionally, If I instead try to use the jit torch package to load:

import torch  
tensors = torch.jit.load('./tensor_vector.pt')

Then I get the following error:

Traceback (most recent call last):
  File "C:/Users/admin/PycharmProjects/TestProject/importLibtorchTensor.py", line 2, in <module>
    tensors = torch.jit.load('./tensor_vector.pt')
  File "C:\Users\admin\.conda\envs\py37_torch\lib\site-packages\torch\jit\__init__.py", line 132, in load
    torch._C.import_ir_module(module_lookup, f, map_location)
RuntimeError: INVALID_ARGUMENT:tensors[1]: Cannot find field. (deserialize at ..\torch\csrc\jit\import.cpp:95)
(no backtrace available)

Is there any way to save a C++ Libtorch tensor and load it in as Python Pytorch tensor? I don’t need to save/load entire models, just specific tensors.

yf225 · September 5, 2019, 6:56pm

We are working on the feature of saving a C++ libtorch Tensor and loading it into PyTorch Python frontend. There is an open issue for it https://github.com/pytorch/pytorch/issues/20356 and we will have an update as soon as possible.

ShangxuanWu · November 19, 2019, 4:33pm

Is there any update to this? I am also trying to load some tensors dumped in LibTorch into Pytorch but failed.

solvingPuzzles · January 26, 2020, 11:44pm

I see that there’s been some progress on this. On the issue that was linked above (https://github.com/pytorch/pytorch/issues/20356), there was a new comment in Dec 2019 that shows a way to export a torch tensor from C++ to disk to Python.

Edwardmark · April 14, 2020, 7:58am

@solvingPuzzles And how to load pytorch tensor in c++ libtorch?
https://github.com/pytorch/pytorch/issues/36554

neilmehta87 · May 29, 2020, 1:39am

Is there a way to do this operation (C++ tensor to pytorch pthon) directly on a GPU memory? Will using pybind11 be useful?
Thanks!
Edit: Would pickle_load and pickle_save work on the GPUs?